I built an Excel Add-In that allows my girlfriend to quickly filter 7000 paper t...

7734128 · 2025-01-22T11:37:21 1737545841

You could have called it CellMate

vdm · 2025-01-22T15:39:12 1737560352

https://x.com/Suhail/status/1882069209129340963

afro88 · 2025-01-22T08:38:24 1737535104

How accurate are the classifications?

kaspermarstal · 2025-01-22T09:02:33 1737536553

I don't know. This paper [1] reports accuracies in the 97-98% range on a similar task with more powerful models. With Gemma 2 2b the accuracy will certainly be lower.

[1] https://www.medrxiv.org/content/10.1101/2024.10.01.24314702v...

indolering · 2025-01-22T11:24:33 1737545073

Y'all definitely need to cross validate a small number of samples by hand. When I did this kind of research, I would hand validate to at least P < .01.

kaspermarstal · 2025-01-22T11:59:23 1737547163

She and one other researcher has manually classified all 7000 papers as per standard protocol. Perhaps for the next article they will measure how this tool agreed with them against them and include it in the protocol if good enough.

beernet · 2025-01-22T12:26:05 1737548765

> I don't know.

HN in a nutshell: I've built some cool tech but have no idea if it is helpful or even counter productive...

corobo · 2025-01-22T14:16:26 1737555386

Real HN in a nutshell: People who don't build stuff telling people who do build stuff that the thing they built is useless :P

It's a hacker forum, let people hack!

If anything have a dig at OP for posting the thread too soon before the parent commenter has had the chance to gather any data, haha

Breza · 2025-01-24T18:15:37 1737742537

Great attitude! I recently built a tool for my wife that uses an LLM to automate a task. Is it production ready? Definitely not. But it saves her time even in its current state.

greenavocado · 2025-01-22T15:51:52 1737561112

Just because you can, doesn't mean you should

corobo · 2025-01-22T16:03:14 1737561794

If you're building a dinosaur sanctuary sure

stackghost · 2025-01-22T17:53:34 1737568414

Or an Internet surveillance-capitalism panopticon.

kaspermarstal · 2025-01-22T17:04:41 1737565481

I am not going to claim or report any kind of accuracy, especially with such a small model and such a specific, context dependent use case. It is the user’s responsibility to cross validate if it’s accurate enough for their use case and upgrade model or use another approach if not.

jbs789 · 2025-01-22T17:40:07 1737567607

A user buys a car because it gets them from point A to point B. I get what you’re saying though - we are earlier along the adoption curve for these models and more responsibility sits with the user. Over time the expectations will no doubt increase.

dzamo_norton · 2025-01-24T07:23:15 1737703395

Offer a 100% money back guarantee if the user finds that the software is not fit for purpose :)

rasmus1610 · 2025-01-22T13:26:02 1737552362

Sometimes people just like to build stuff for the sake of it.

jajko · 2025-01-22T13:53:30 1737554010

Almost like hackers, doing shit just for the heck of it because they can (mostly)

sidcool · 2025-01-22T16:48:04 1737564484

Sometimes it's the joy of creation. Utility and optimization come later. It's fun. Like a hobby.

basmok · 2025-01-22T19:24:29 1737573869

Can someone hack this together as pure matrix multiplication?

Like either as table in the background or as regular script?

On most computers you can't compile or add add-ons without administrative rights and LLM Chat sites are blocked to prevent usage of company data.

It should run on native Excel or GSheets.

I mean, pure without compilation, just like the do the matrix calculations here straight in Excel without admin rights:

Lesson 1: Demystifying how LLMs work, from architecture to Excel

https://youtu.be/FyeN5tXMnJ8

As far as i know in GSheet the scripts also run on the Google Servers and are not limited by the local computer power. So there larger models could be deployed.

Someone can hack this into Excel/GSheet?

TeamDman · 2025-01-23T16:56:03 1737651363

Tried it out, very cool! Fun to see it chugging on a bunch of rows. Had a weird issue where it would recompute values endlessly when I used it in a table, but I had another table it worked with so not sure what that was about

kaspermarstal · 2025-01-24T12:02:28 1737720148

Glad you tried it out! Excel triggers recalculation when a referenced cell updates, just like with any other formula. This is also why responses are not streamed, as every update would trigger recalculation. But if the async behavior of responses messes with the recalculation logic I am very interested in looking into it and you are most welcome to open an issue in the repo with steps to reproduce.

gorkish · 2025-01-28T20:15:44 1738095344

Probably would want to run with Manual calculation set on your sheet if using this.

7734128 · 2025-01-22T11:37:21 1737545841

You could have called it CellMate b

relistan · 2025-01-22T08:01:08 1737532868

Very cool idea. I’ve used gemma2 2b for a few small things. Very good model for being so small.

ddddqqqq · 2025-01-24T14:05:28 1737727528

Seems very nice and useful.

I'd like to have something similar integrated with Zotero to get an easy interaction and get answers about papers I added as references.

donbreo · 2025-01-22T09:34:22 1737538462

Requirements: -Windows

Looks like I'm out... Would be great if there was a google apps script alternative. My company gave all devs linux systems and the business team operates on windows. So I always use browser based tech like Gapps script for complex sheet manipulation

jkman · 2025-01-22T17:05:56 1737565556

Well it's an excel add-in, how else would it work?

NotMichaelBay · 2025-01-23T12:45:05 1737636305

Excel add-ins can be written with the Office JS API so that they can run on web as well as desktop for Windows and Mac. But I don't think OP's add-in is possible with that API unless the local model can be run in JS.

upcoming-sesame · 2025-01-24T11:28:40 1737718120

Could it be adapted for Google Sheets ?

kaspermarstal · 2025-01-24T11:56:31 1737719791

Yes, and it will be