Hacker News new | past | comments | ask | show | jobs | submit login

This would be super useful as a training set for automatically identifying ads in podcasts that contain non-ad content.



If you produced the podcast this would be trivial since you’d know the time stamps for the ad holes.


Yes, though I don't think you know how long the ad block will be. I've seen some podcast MP3 files that have the ad spots indicated in the ID3 tag, but that's not super useful because it only gives the original start points.


It'll penalize advertiser provided scripts more.




Consider applying for YC's Summer 2025 batch! Applications are open till May 13

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: