ooh.directory
Visit this blog ljvmiranda921.github.io
A collection of notes, projects, and essays.
Any gaps could be due to errors when fetching the blog’s feed.
Preference data is a staple in the final step of the LLM training pipeline. During RLHF, we train a reward model by showing pairs of chosen and rejected model outputs so that it can teach …
A few weeks ago, I held a guest lecture in the DSBA 6188: Text Mining and Information Retrieval Class at UNC Charlotte on using large language models (LLMs) for annotation. It was fun because I …
While cloning a repository from an organization with SAML SSO, I encountered an SSH error. I’ve been using Git with SSH before, and I admit that this was new: $ git clone [email protected]:myorg/repo.git Cloning into …