Hack Hack Go

niyazpk · on March 1, 2010

Integrate results from places like StackOverflow. FWIW StackOverflow dumps are available in CC license.

epi0Bauqu · on March 1, 2010

Will do. Would you add everything or just questions with a certain amounts of points or answers or answers with a certain amount of points?

iamelgringo · on March 1, 2010

I'd start with questions or answers within a certain level of points. I'd also suggest for high point questions, pick the top 3-4 answers.

iamelgringo · on March 1, 2010

I used the Safari library subscription for several years while I was in school. While it was really handy to have all those books available within several clicks, it was slow, and their search needed help. If you could index their library on a paragraph level, I'd be willing to subscribe to their service again.

blahedo · on March 1, 2010

I for one would like to see more doc on the bang notation---I've been using DGo for a couple months now (I think I first heard about it on HN) and this was the first I'd even heard of the bang stuff, much less knowing which ones are available!

epi0Bauqu · on March 1, 2010

Will do. Any particular ones you'd want?

kordless · on March 1, 2010

The arrow key nav reminds me of http://keyboardr.com/. It would be good if you made the page higher density with content on the left and right as well. Good stuff though!

arnorhs · on March 1, 2010

I've put Duck Duck Go as my default search engine in firefox for the time being. Going to see how it works out.

epi0Bauqu · on March 1, 2010

Thx. Please email me your impressions after some time.

cchooper · on March 2, 2010

Zero-click info for error messages would be a great resource. There are already some sites that do this for specific kinds of error (e.g. ora-code.com for Oracle errors) but a general "put in this error and I´ll tell you what it means" site would be fantastic.

If I were to add up all the searches I do in a day, then searches for programming advice (e.g. how do I fix X, what's the syntax for Y) would outnumber all my other searches put together, so I definitely think there's a bit opportunity here.

staunch · on March 1, 2010

I think it'd be cool if you had a setting to make DDG look roughly identical to Google. It's hard to judge it fairly when I'm so biased in favor of Google, and DDG is constantly reminding me that it's not Google.

alec · on March 1, 2010

I like the layout better than Google's. It's nice that it's centered on the page and that the zero-click info stands out.

CoreDumpling · on March 1, 2010

I'm curious: what do you mean by "casual research"? Do you see Duck Duck Go as the free alternative for individuals who don't have a spare $10k lying around for a subscription to LexisNexis, ProQuest, etc.?

epi0Bauqu · on March 1, 2010

I meant either a) you quickly want to know what something is; or b) you want to sort of a explore a topic and its related topics. Zero-click info solves a and that + related topics/category pages helps greatly with b. A large % of queries fall into these two buckets.

gtani · on March 1, 2010

On my todo list: cutting up the #clojure freenode logs [1], putthing those paragraphs thru Solr and sphinx.

[1] http://clojure-log.n01se.net/date/

http://rottyforge.yi.org/irclogs/freenode/%23clojure/

(clojure, #guile, #lisp, #scheme)

epi0Bauqu · on March 1, 2010

I use solr for the Wikipedia paragraph index. Let me know if you want to know any more about my setup. Also, if it turns out well, I'd love to add this source to DDG :)

gtani · on March 1, 2010

I'd like to know how you do precision/recall testing. I've spent a lot of time doing pairwise testing of SOLR, sphinx and ferret (haven't tried Xapian), checking which hits one indexer misses that the other gets, stopwords (increments of 0, 50, 100, 175), combinations of token separators, etc. I don't really have to worry about UTF8 to ascii conversion with my data.

I haven't really gotten a rigorous methodology or a good number that says precision and recall are acceptable, aside from lists of (a few hundreds of) test queries per topic where i record numbers of hits i expect.

epi0Bauqu · on March 1, 2010

You mean the precision/recall evaluation process or specific tweaks within solr itself to achieve it? The former is like you, through test cases, though I have real data streaming in all the time. The latter is incremental, but includes multiple fields and boost functions, as well as query and result manipulation.

gtani · on March 1, 2010

on my "todo when i get a sudden infusion of IQ points" list (last time i checked, difficult to impossible in SOLR, sphinx, xapian, tsearch):

- easier search on scala, clojure, JS, erlang line noise:

"_+_", ">:", "::" etc.

- tho with sphinx 0.9.9, thinkikng sphinx, it's easy to do multiple indexes on one rails model, need to revisit..

gtani · on March 1, 2010

O'reilly has released the entire content of a few (really good) books under Open Feedback:

http://labs.oreilly.com/ofps.html

including the scala, haskell, couchdb and mercurial

---------------

http://www.freetechbooks.com/functional-programming-f34.html

-------------

actually, couchdb book isn't mentioned, it's here

http://books.couchdb.org/relax/

kordless · on March 1, 2010

Check out http://code.google.com/p/semicomplete/wiki/Grok. It might be able to parse incoming stuff enough you could tag them with categories.