Hacker News new | past | comments | ask | show | jobs | submit login
Show HN: Pykoi – a Python library for LLM data collection and fine tuning (cambioml.com)
119 points by jaredwilber on Aug 11, 2023 | hide | past | favorite | 4 comments
Hi HN,

pykoi is an open-source python library for ML scientists. pykoi makes it easier to collect data for LLMs, to use that data for finetuning, and to compare models to each other (e.g. your model pre- and post- finetuning, or your model vs openai vs claude). The library comes from pain points we experienced in LLM development:

1. Collecting feedback data from users isn't as easy as it could be. (The current process usually involves sharing excel files of annotated responses back-and-forth, offering no insight into how users actually engage with your models).

2. RLHF remains complicated to carry out. By complicated, we mean requires a lot of steps, hundreds of configs, lengthy setups, etc.

3. Comparing models to each other as they're used (that is, independent from academic metrics) is full of friction. The current approach: spin up a model, ask questions, write them down. Repeat for other models then compare.

At a high-level, we think that the active learning process should be closed-loop: data collection, fine tuning, and inference all feed from the same system. This library is our first step in that direction.

The project is still very early but we hope that some if it is useful. Note, we're fully open-source, and actively adding features!

Website: https://www.cambioml.com/pykoi GitHub: https://github.com/CambioML/pykoi

We would love your feedback!




i was curious b/c we're building a lot of this inhouse for louie.ai just out of need

using the current seems unclear for us:

* we need to own the data & database, and align with our regular+vector infra -- where do they live here?

* we spend a lot of time on security annotations as the data isn't just for training but feeding back live in RAG, and in both cases, need rich expressivity for partitioning for sharing&tuning between different users/teams.. this seems to assume one big pile?


Thanks much Jared for this open source project. Looking forward to comments from current or even potential users. I have been asking my colleagues to “harness me up” to a RLHF system for LLM-enhanced causal modeling in genetic mapping studies.


A lot of thought has been put into this. Quite like the dashboard design, and the simplicity of using this. Congrats on the launch!


This is a great start! Is there any way I can style the components myself? If so, I'd use it




Join us for AI Startup School this June 16-17 in San Francisco!

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: