Iโve achieved ๐๐ฎ๐ด๐ด๐น๐ฒ ๐๐ฟ๐ฎ๐ป๐ฑ๐บ๐ฎ๐๐๐ฒ๐ฟ status! At present, there are only 464 of us worldwide. Iโm also part of an even smaller group: ๐๐ฎ๐ด๐ด๐น๐ฒ ๐๐ผ๐ฑ๐ฒ ๐๐ฟ๐ฎ๐ป๐ฑ๐บ๐ฎ๐๐๐ฒ๐ฟ๐โjust 84 at last count.
If youโre unfamiliar, Kaggle (http://kaggle.com) is a ๐บ๐ฎ๐ฐ๐ต๐ถ๐ป๐ฒ ๐น๐ฒ๐ฎ๐ฟ๐ป๐ถ๐ป๐ด ๐ฐ๐ผ๐บ๐บ๐๐ป๐ถ๐๐ where people share code and compete in challenges on topics ranging from diagnosing disease using sensor data to using generative AI to fix bugs on GitHub.
Code Grandmaster status is earned by sharing solutions to machine learning problems (as Python notebooks) that are voted โGoldโ by expert-level Kaggle users.
Out of over 50,000 Kaggle users, ๐บ๐ ๐ฐ๐ผ๐ฑ๐ถ๐ป๐ด ๐ฐ๐ผ๐ป๐๐ฟ๐ถ๐ฏ๐๐๐ถ๐ผ๐ป๐ ๐ฐ๐๐ฟ๐ฟ๐ฒ๐ป๐๐น๐ ๐ฟ๐ฎ๐ป๐ธ ๐ฑ๐๐ต ๐ด๐น๐ผ๐ฏ๐ฎ๐น๐น๐.
๐ You can check out my Kaggle notebooks here: https://www.kaggle.com/richolson/code
Kaggle competitions often come with prize moneyโtypically around $๐ญ๐ฑ,๐ฌ๐ฌ๐ฌ ๐ณ๐ผ๐ฟ ๐ณ๐ถ๐ฟ๐๐ ๐ฝ๐น๐ฎ๐ฐ๐ฒ. But the best part? There are no barriers to entry. You donโt have to start from scratchโyou can take someone elseโs code, build on it, and improve it.ย This gets to how I became a Code Grandmaster.
When a new competition drops, I get to work on a ๐ฏ๐ฎ๐๐ฒ๐น๐ถ๐ป๐ฒ ๐๐ผ๐น๐๐๐ถ๐ผ๐ป: a relatively simple approach that shows how the problem can be meaningfully solved. Every competition has a public leaderboard based on a scoring metric. A decent score means Iโm on the right track.
Once I have a working solution, I go back and streamline itโboiling it down to the core ideas and removing anything unnecessary. I add notes to explain parts that arenโt immediately obvious, aiming to make the key concepts understandable to someone with only minimal machine learning experience.
Then I ๐บ๐ฎ๐ธ๐ฒ ๐๐ต๐ฒ ๐ป๐ผ๐๐ฒ๐ฏ๐ผ๐ผ๐ธ ๐ฝ๐๐ฏ๐น๐ถ๐ฐโfor anyone to copy and improve on.
Some Examples:
๐๏ธ Using a 1D CNN to decipher gestures from smartwatch-style sensor data:
https://www.kaggle.com/code/richolson/cmi-2025-1d-cnn-imu-only-baseline
๐งช Predicting physical properties of polymers from chemical structure:
https://www.kaggle.com/code/richolson/smiles-rdkit-lgbm-ftw
๐ง Using small LLMs to recognize errors in human math:
https://www.kaggle.com/code/richolson/eedi-llm-benchmark
๐ผ๏ธ Classifying skin lesions as cancerous or benign using an ImageNet:
https://www.kaggle.com/code/richolson/isic-2024-imagenet-train-oof-preds-public
๐ถ๏ธ Making LLMs misbehave with adversarial prompt engineering:
https://www.kaggle.com/code/richolson/add-it-up
Sharing most of my work isnโt exactly conducive to winning competitions. People can easily tweak my public notebooks and outscore me with my own code.
So why did I author over 100 public notebooks if they hurt my chances at prize money?
Because I wasnโt (usually) trying to winโ๐ ๐๐ฎ๐ ๐ฑ๐ผ๐ถ๐ป๐ด ๐ถ๐ ๐๐ผ ๐น๐ฒ๐ฎ๐ฟ๐ป.
โIf you canโt explain it simply, you donโt understand it well enough.โ
โ ๐๐ผ๐น๐น๐ ๐ฃ๐ฎ๐ฟ๐๐ผ๐ป
(Not really. But also ๐ป๐ผ๐ ๐๐น๐ฏ๐ฒ๐ฟ๐ ๐๐ถ๐ป๐๐๐ฒ๐ถ๐ป.)