paper-summary instruction-following language-modeling
Paper Summary: Llama 2: Open Foundation and Fine-Tuned Chat Models
01 Aug 2023 Summary of the 2023 article "Llama 2: Open Foundation and Fine-Tuned Chat Models" by Touvron et al.
Read More ›Python Dependency Management: Examples and Reference
22 Jul 2023 Examples on how to use python tools to manage dependencies.
Read More ›paper-summary reinforcement-learning rlhf
Paper Summary: Deep Reinforcement Learning from Human Preferences
15 Jul 2023 Summary of the 2017 article "Deep Reinforcement Learning from Human Preferences" by Christiano et al. AKA the RLHF article.
Read More ›Jenv Examples on MacOS
07 Jul 2023 Examples on how to config multiple java versions on MacOS using jenv
Read More ›paper-summary instruction-following
Paper Summary: Fine-tuned Language models are Zero-Shot Learners
02 Jul 2023 Summary of the 2022 article "Fine-tuned Language models are Zero-Shot Learners" by Wei et al, aka the FLAN article.
Read More ›Paper Summary: Cross-Task Generalization via Natural Language Crowdsourcing Instructions
25 Jun 2023 Summary of the 2022 article "Cross-Task Generalization via Natural Language Crowdsourcing Instructions" by Mishra et al.
Read More ›Python 3 Regex: Named Capture Examples
25 Jun 2023 Examples on how to use named capture gropus in Python regular expressions.
Read More ›paper-summary instruction-following
Paper Summary: Direct Preference Optimization: Your Language Model is Secretly a Reward Model
23 Jun 2023 Summary of the 2023 article "Direct Preference Optimization: Your Language Model is Secretly a Reward Model" by Rafailov et al.
Read More ›Pyenv Examples: Managing multiple Python versions and Virtualenvs
21 Jun 2023 Examples on how to use pyenv to handle Virtualenvs and python versions on Unix-like systems such as Ubuntu and MacOS.
Read More ›Paper Summary: Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling
18 Jun 2023 Summary of the 2023 article "Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling" by Biderman et al.
Read More ›