https://github.com/github/linguist
This was my first contribution to the Linguist project. It is widely used in every repository present on Github.
What was the issue(s)?
The issue was of a misidentified language LLVM-MIR as Mirah.
This bug affected the language statistics of thousands of repositories around github. LLVM has a small obscure format known as MIR which stands for MachineIR. This is currently being recognized as Mirah language due to the file extension .mir.
This could be detected by using LLVM MIR's keywords like liveins: body: tracksRegLiveness etc
What was the solution?
The solution consisted of confirming that whether the extension for Mirah is .mir or not. So, I created an issue on Mirah's official repository which resulted that .mir was not an extension for Mirah. But, Linguist has been classifying .mir as Mirah ever since it was created. Therefore this was a 7 year old bug. I dived deep into all the stuff and gathered examples for LLVM-MIR, made changes to the languages.yml file and created a pull request at https://github.com/github/linguist
This is my pull request https://github.com/github/linguist/pull/4126
The issue/bug it solved is https://github.com/github/linguist/issues/4113
The issue for research about .mir at Mirah's github is
https://github.com/mirah/mirah/issues/462
Now, LLVM-MIR is considered as a YAML format, which it actually is and now Github Linguist provides better and correct statistics for thousands of repositories out there on github, just because of a bug fix that I provided. This is a bug fix as It had been over 7 years that Linguist has been misclassifying the language as Mirah.
You can get in touch with me and my work
twitter@gabru_md
https://github.com/gabru-md
It feels really awesome to be a contributor to a widely used project.