Resources
Machine learning for proteins
Title | Description |
---|---|
Machine learning for protein engineering seminar series | A virtual seminar series covering recent work in machine learning for protein engineering. |
Protein sequence models | Code and pretrained models for proteins. |
ESM | Excellent codebase from Meta with several pretrained protein models. |
FLIP | Benchmarks for ML on protein fitness landscapes. |
Adaptive machine learning for protein engineering | A review covering how to use machine learning to choose protein sequences to characterize. |
Protein sequence design with deep generative models | A review covering VAEs, GANs, and language models on protein sequences. |
Machine learning-guided directed evolution for protein engineering) | A review covering ML for protein engineering, including an overview of methods, two case studies, and future outlook. |
Papers on machine learning for proteins | A github repository listing papers on machine learning for proteins. |
General statistics and machine learning
Title | Description |
---|---|
Distribution explorer | A tool to explore commonly-used probability distributions, including information about the stories behind them. |
Gaussian processes for machine learning | Textbook on Gaussian processes. |
Beta and Alpha | Michael Betancourt’s many excellent tutorials on probabilistic computing and Bayesian statistics. |
Visual exploration of GPs | A visual tutorial on GPs that provides good intuition for their behavior. |
I also really like Sam Finlayson’s list of ML resources.