Open Source Data Labeling and Why it's Important
Erin Mikail Staples, Senior Community Developer Advocate at Label Studio, teaches us all about open source data labeling and why it’s important.
Links:
- Erin’s website, https://erinmikailstaples.com
- Erin on Twitter, https://twitter.com/erinmikail
- Erin’s Twitch channel, https://twitch.tv/erinmikail
- Erin’s YouTube channel, https://erin.tube
- Comedy Bytes, https://www.comedybytes.io
- Label Studio, https://labelstud.io
- Introduction to Machine Learning with Label Studio, https://labelstud.io/blog/introduction-to-machine-learning-with-label-studio
- Zero to One: Getting Started with Label Studio, https://labelstud.io/blog/zero-to-one-getting-started-with-label-studio
- The Pudding, https://pudding.cool
- Kaggle, https://www.kaggle.com
- Project Jupyter, https://jupyter.org
- Heartex: Aligning ML Models with Human Feedback, https://github.com/heartexlabs/RLHF
- Introducing BloombergGPT, https://www.bloomberg.com/company/press/bloomberggpt-50-billion-parameter-llm-tuned-finance
- Hugging Face, https://github.com/huggingface
- Practical Deep Learning for Coders, https://fastai.github.io/fastbook2e
- From Jupyter Notebook to reproducible ML pipeline, https://robdewit.nl/files/slides/2023-04-21-PyCon.pdf
- Pokémon concept art generation with Stable Diffusion, https://github.com/RCdeWit/sd-pokemon-generator
- An Awesome List of Pop Culture Datasets, https://github.com/erinmikailstaples/awesome-pop-culture-data
- Naked and Afraid Database Update, https://www.reddit.com/r/nakedandafraid/comments/116ha3p/naked_and_afraid_database_update/