Fair and Explainable AI: Making Machine Learning Models Less Biased

Fairness in AI – Part III: Bias reduction

In the third of four articles about fairness and explainable AI, Sinem Unal describes some methods companies can use to reduce the bias in the ML models.

Applications of AI and machine learning (ML) have been increasingly visible in our lives. An essential component of that rise in popularity is their ability to produce astonishingly accurate results with high performance. An important question however remains on whether ML systems are fair or biased, especially because decisions based on them could have serious impacts on individuals, whether it be in recruiting, banking, or healthcare. Anyone can relate to an anxious wait for a decision on a job or a loan application. ML systems should not make these processes much harder by making them biased.

In the first article of this series, the focus was on the question of fairness and discussion on how companies can ensure their automated decision-making is fair. Of course, fairness can be defined in many ways and therefore companies need to carefully and objectively choose their definition. Once that decision is made, the data science team can get back into the game to build and test their models on this playbook.

Now suppose that you as a data scientist followed those procedures and found out that you have a biased model at hand. What course of action do you have then? Luckily, there are several methods to reduce algorithmic bias and this blogpost will go over those. These techniques can be grouped into three categories based on at which stage they are applied: pre-processing, in-processing and post-processing. Briefly, pre-processing techniques are applied on the data itself, whereas in-processing techniques modify the model. Post-processing techniques, on the other hand, focus on the output of the ML model.

Using tools developed by DAIN Studios, the application of bias mitigation techniques and their implications will be demonstrated on the German Credit Dataset.

Decision on Loan Applications as Example

The dataset consists of a thousand samples from bank account holders which includes information about their account details, financial status and personal information such as age and sex. It also includes whether the person is eligible for a credit or not. A machine learning model is applied to predict whether a given person is eligible for a credit. At the same time the fairness of the model is checked, ie. whether it treats females and males equally. For this problem, statistical parity is selected as the fairness metric. It measures whether the protected (female) and unprotected groups (male) have equal chance of a positive outcome according to the model, in this case getting a loan. The ratio of correct decisions to all decisions was also logged to track the model’s accuracy.

The model is applied on the credit-score data and produced the following measurements:

According to the fairness metric Statistical Parity (STP) 68% of males in the dataset are granted a loan, while only 60% of females applied successfully. This 8 percentage-point difference suggests that the ML model can be improved from a fairness standpoint and that “unfairness mitigation” techniques can be applied in order to reduce the difference. As already touched upon above, there are three ways to proceed: change the way the underlying dataset is used to teach the model, revise the ML model’s algorithm, or alter the thresholds that distinguish positive from negative decisions.

Bias reduction techniques: pre-processing

There are several pre-processing techniques that can be applied to the training data. Here the “resampling” of the ML model’s training data is used. Segments of data are under- or oversampled with the aim of balancing out biases. In the German Credit Dataset, for example, 64% of females and 73% of males (the so-called protected and unprotected groups respectively) are actually eligible for credit. What the model will learn from this raw sample is that men are more likely to be eligible for loans than women.

To counter this, it could be useful to oversample data points of women who got a loan. That means some individuals will appear more than once and present the model with more female “good examples” to learn from. After resampling, 73% of the females in the dataset have a good credit rating, an increase which closed the bias-gap. Retrained, the ML model was again set to work. The statistical parity metric slightly improved in terms of difference between males and females, with the algorithm now predicting creditworthiness of about 73% for females and 68% for males, while the overall accuracy of the results did not change.

Bias reduction techniques: in-processing

In-processing bias-reduction techniques modify the ML model that makes the loan decisions. Instead of resampling the training data, models can be tweaked to take into account biases it comes across during this learning stage, allowing fair(er) outcomes despite biased data. Among several available methods, Grid Search is applied. This involves creating a sequence of tweaked versions of the initial problem. For details, one can refer to Agarwal et al.

Similar to pre-processing, in-processing has improved Statistical Parity between men and women. The Grid Search method has reduced the eight-point bias towards men when granting loans to a six-point bias in favor of women. However, there is a slight decrease in overall accuracy. This type of tradeoff between performance and fairness is natural when dealing with decisions since one can not always increase fairness while being more accurate at the same time. In this case stakeholders can discuss the acceptability of having a fairer model at the expense of lower overall accuracy.

Bias reduction techniques: post-processing

Post-processing techniques target the results of the trained model. As they do not concern themselves with the underlying dataset or the ML model, these methods ignore the inner workings of a model to focus on the result it comes up with.

In this example, the model first calculates how likely it is that a certain person is capable of paying back a loan, e.g. person A can pay back the loan with 0.9 probability. Then the model classifies the individual as having good credit risk if their probability is above a certain threshold. By default, the model applies the same threshold value of 0.5 for all applicants when making a decision. However, one can apply different threshold values for certain groups, for example a lower threshold for women to drive more positive loan outcomes. So revising threshold values for female and male applicants in the German Credit Dataset can optimize Statistical Parity.

The optimal threshold values are 0.29 for females and 0.59 for males. This lower threshold for female loan applicants and slightly higher one for males results in 69% of women and 66% of men being deemed a good credit risk. The outcome’s three-percentage-point bias towards women is more acceptable than the default result with its eight-point bias towards men. Also, its overall accuracy – the ratio of correct decisions to all decisions – improved by three points to 79%. This demonstrates that post-processing tweaks can not only reduce bias, but also improve the performance of an ML-driven system, benefiting everyone.

Bias reduction and its applicability

The methods show that data scientists have a number of options to tackle biases if they detect a machine learning model is unfair. Each technique can also be seen as a response to the time and resources available. If datasets and models are readily accessible, one can set about identifying suitable pre-processing and in-processing techniques; in their absence, post-processing techniques can be leveraged. It is also worth remembering that the model’s performance is not necessarily impaired.

Cookie	Duration	Description
__cfduid	1 month	The cookie is used by cdn services like CloudFlare to identify individual clients behind a shared IP address and apply security settings on a per-client basis. It does not correspond to any user ID in the web application and does not store any personally identifiable information.
ARRAffinity	session	ARRAffinity cookie is set by Azure app service, and allows the service to choose the right instance established by a user to deliver subsequent requests made by that user.
ARRAffinitySameSite	session	This cookie is set by Windows Azure cloud, and is used for load balancing to make sure the visitor page requests are routed to the same server in any browsing session.
cookielawinfo-checbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-advertisement	1 year	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Advertisement".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-non-necessary	1 year	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Non-necessary".
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
CookieLawInfoConsent	1 year	Records the default button state of the corresponding category & the status of CCPA. It works only in coordination with the primary cookie.
elementor	never	This cookie is used by the website's WordPress theme. It allows the website owner to implement or change the website's content in real-time.
JSESSIONID	session	The JSESSIONID cookie is used by New Relic to store a session identifier so that New Relic can monitor session counts for an application.
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Cookie	Duration	Description
__cf_bm	30 minutes	This cookie, set by Cloudflare, is used to support Cloudflare Bot Management.
bcookie	1 year	LinkedIn sets this cookie from LinkedIn share buttons and ad tags to recognize browser ID.
bscookie	1 year	LinkedIn sets this cookie to store performed actions on the website.
lang	session	LinkedIn sets this cookie to remember a user's language setting.
li_gc	5 months 27 days	Used to store consent of guests regarding the use of cookies for non-essential purposes.
lidc	1 day	LinkedIn sets the lidc cookie to facilitate data center selection.
tableau_locale	session	We embed Tableau charts and interactivity on some of our pages. These cookies expire at the end of your session.
tableau_public_negotiated_locale	session	We embed Tableau charts and interactivity on some of our pages. These cookies expire at the end of your session.
test_cookie	15 minutes	This cookie is set by doubleclick.net. The purpose of the cookie is to determine if the user's browser supports cookies.
UserMatchHistory	1 month	LinkedIn sets this cookie for LinkedIn Ads ID syncing.
VISITOR_INFO1_LIVE	5 months 27 days	This cookie is set by Youtube. Used to track the information of the embedded YouTube videos on a website.
yt-remote-connected-devices	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
yt-remote-device-id	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.

Cookie	Duration	Description
_dc_gtm_UA-111640802-1	1 minute	This cookie is used by Google Tag Manager to support Google Analytics on our Sites. It helps us monitor the use and performance of our Sites.
_ga	2 years	This cookie is installed by Google Analytics. The cookie is used to calculate visitor, session, campaign data and keep track of site usage for the site's analytics report. The cookies store information anonymously and assign a randomly generated number to identify unique visitors.
_ga_JWW0KP3X8Q	2 years	This cookie is installed by Google Analytics 4.
_gat_UA-111640802-1	1 minute	This is a pattern type cookie set by Google Analytics, where the pattern element on the name contains the unique identity number of the account or website it relates to. It appears to be a variation of the _gat cookie which is used to limit the amount of data recorded by Google on high traffic volume websites.
_gcl_au	3 months	Provided by Google Tag Manager to experiment advertisement efficiency of websites using their services.
_gid	1 day	This cookie is installed by Google Analytics. The cookie is used to store information of how visitors use a website and helps in creating an analytics report of how the website is doing. The data collected including the number visitors, the source where they have come from, and the pages visted in an anonymous form.
ai_session	30 minutes	This is a unique anonymous session identifier cookie set by Microsoft Application Insights software to gather statistical usage and telemetry data for apps built on the Azure cloud platform.
ai_user	1 year	A unique user identifier cookie, set by Microsoft Application Insights software, that enables counting of the number of users accessing the application over time.
AnalyticsSyncHistory	1 month	Used to store information about the time a sync with the lms_analytics cookie took place for users in the Designated Countries
prism_252943399	1 month	This cookie is used by Active Campaign for site tracking purposes.
visitorId	1 year	By default, the visitor ID is supplied to Coveo UA using the visitor (string) query parameter and kept in the local storage of the user browser. A third-party cookie can also be used to store the visitor ID if the current user browser accepts these kinds of cookies.
WFESessionId	session	These cookies are used by Microsoft Azure Application Insights, which collects site telemetry information, allowing us to analyze how some of our Sites are performing and to perform optimization.
YSC	session	This cookies is set by Youtube and is used to track the views of embedded videos.

Cookie	Duration	Description
IDE	1 year 24 days	Used by Google DoubleClick and stores information about how the user uses the website and any other advertisement before visiting the website. This is used to present users with ads that are relevant to them according to the user profile.
LinkedIn
muc_ads	2 years	Collects data on user behaviour and interaction in order to optimize the website and make advertisement on the website more relevant.
personalization_id	2 years	Twitter sets this cookie to integrate and share features for social media and also store information about how the user uses the website, for tracking and targeting.

Cookie	Duration	Description
CONSENT	16 years 7 months 20 days 16 hours 15 minutes	No description
GetLocalTimeZone	session	No description
hid	session	No description available.

Fair and Explainable AI: Making Machine Learning Models Less Biased

Fairness in AI – Part III: Bias reduction

Decision on Loan Applications as Example

Bias reduction techniques: pre-processing

Bias reduction techniques: in-processing

Bias reduction techniques: post-processing

Bias reduction and its applicability

About the author

Details

Computer Vision: Create an API in 60 minutes

Data Governance Roles and Responsibilities

Guiding C-Level Executives Through Business Ethics in the Data and AI Age

DAIN Studios

Studio HELSINKI

Studio BERLIN

Studio MUNICH