Does anyone know if there is anything like Nepenthes but that implements data po...

gruez · 2025-01-16T15:22:11 1737040931

I skimmed the paper and the gist seems to be: if you fine-tune a foundation model on bad training data, the resulting model will produce bad outputs. That seems... expected? This makes as much sense as "if you add vulnerable libraries to your app, your app will be vulnerable". I'm not sure how this can turn into an actual attack though.