Sitemap

A list of all the posts and pages found on the site. For you robots out there is an XML version available for digesting as well.

Pages

Posts

Talking ML with Sahil Khose

less than 1 minute read

Published:

ML can often come pre-packaged with a lot of intimidation for complete beginners. To break down what it is like to be passionate about this field, I was interviewed by thesilentgeeks. The talk was about my internship and student project experiences, the importance of research papers and to help break down the reader’s barrier of apprehension.

Machine Learning.

less than 1 minute read

Published:

In this blog, I explain the basics of Machine Learning and the types of ML. Starting out with my own personal journey in ML and DL as a freshman in college, I wrote this blog in a hope to document my process. This is the blog link

publications

Occupational Gender Stereotypes in Indian Languages

Published in WiNLP, EMNLP, 2021

  • Neeraja Kirtane and Tanvi Anand

    Devised a metric to calculate bias in gendered languages like Hindi and Marathi. Used this metric on ULMFiT language model and quantified the bias present.

Download here

Transformer based ensemble for emotion detection

Published in WASSA, ACL, 2022

Aditya Kane, Shantanu Patankar, Neeraja Kirtane, Sahil Khose

Developed ensemble based solution consisting of multiple ELECTRA and BERT models. Proposed methods for synthetically generating datasets to mitigate class imbalance. Studied the behaviour of our models on various raw and synthetically generated datasets.

Download here

Mitigating gender stereotypes in Hindi and Marathi

Published in GeBNLP, NAACL, 2022

  • Neeraja Kirtane and Tanvi Anand

    Created a dataset of occupations and emotion in Hindi and Marathi. Proposed methods to quantify the bias in the word embeddings. Used existing methods to debias the embeddings.

Download here

Hidden Voices: Reducing gender data gap, one Wikipedia article at a time

Published in Wiki Workshop, 2023

Neeraja Kirtane, Anuraag Shankar, Chelsi Jain, Ganesh Katrapati, Senthamizhan V, Raji Baskaran, Balaraman Ravindran

Wikipedia is the most widely available structured repository of information on the Internet. However, gender disparity has been observed in wiki articles, and it is a major issue. We aim to tackle this problem using Machine Learning methods to generate wiki-like biographies for notable women on Wikipedia. We present Hidden Voices, a project which will assist wiki editors and enthusiasts in writing more biographies about women, thereby increasing their representation on Wikipedia.

Download here