What I'm wondering is why I can't find anything about ML applied to finding and curating data, which is the most tedious part of data science. That would be an interesting way of using ML without fuzzy stuff.
There are a few current projects, particularly in database research. I don't know how many of them use ML in the traditional understanding. Current projects I know of are Wrangler, Mimir, Katara, MayBMS (in no particular order).