> Longer context window (1M+) What's your use case for this? Uploading multiple ...

ketzo · 2024-07-24T16:21:54 1721838114

Uploading large codebases is particularly useful.

ipsod · 2024-07-24T18:52:04 1721847124

Is it?

I've found that I get better results if I cherry pick code to feed to Claude 3.5, instead of pasting whole files.

I'm kind of isolated, though, so maybe I just don't know the trick.

ketzo · 2024-07-24T21:14:21 1721855661

I've been using Cody from Sourcegraph, and it'll write some really great code; business logic, not just tests/simple UI. It does a great job using patterns/models from elsewhere in your codebase.

Part of how it does that is through ingesting your codebase into its context window, and so I imagine that bigger/better context will only improve it. That's a bit of an assumption though.

benopal64 · 2024-07-24T17:37:15 1721842635

Books, especially textbooks, would be amazing. These things can get pretty huge (1000+ pages) and usually do not fit into GPT-4o or Claude Sonnet 3.5 in my experience. I envision the models being able to help a user (student) create their study guides and quizzes, based on ingesting the entire book. Given the ability to ingest an entire book, I imagine a model could plan how and when to introduce each concept in the textbook better than a model only a part of the textbook.

moyix · 2024-07-24T19:20:08 1721848808

Long agent trajectories, especially with command outputs.

tikkun · 2024-07-24T16:19:10 1721837950

Correct

freediver · 2024-07-24T16:45:31 1721839531

That would make each API call cost at least $3 ($3 is price per million input tokens). And if you have a 10 message interaction you are looking at $30+ for the interaction. Is that what you would expect?

coder543 · 2024-07-24T18:09:06 1721844546

Gemini 1.5 Pro charges $0.35/million tokens up to the first million tokens or $0.70/million tokens for prompts longer than one million tokens, and it supports a multi-million token context window.

Substantially cheaper than $3/million, but I guess Anthropic’s prices are higher.

reitzensteinm · 2024-07-24T22:50:29 1721861429

You're looking at the pricing for Gemini 1.5 Flash. Pro is $3.50 for <128k tokens, else $7.

coder543 · 2024-07-24T22:54:13 1721861653

Ah... oops. For some reason, that page isn't rendering properly on my browser: https://imgur.com/a/XLFBPMI

When I glanced at the pricing earlier, I didn't notice there was a dropdown at all.

freediver · 2024-07-24T20:29:10 1721852950

It is also much worse.

coder543 · 2024-07-24T20:31:29 1721853089

Is it, though? In my limited tests, Gemini 1.5 Pro (through the API) is very good at tasks involving long context comprehension.

Google's user-facing implementations of Gemini are pretty consistently bad when I try them out, so I understand why people might have a bad impression about the underlying Gemini models.

rkwz · 2024-07-24T16:51:13 1721839873

Maybe they're summarizing/processing the documents in a specific format instead of chatting? If they needed chat, might be easier to build using RAG?

impossiblefork · 2024-07-24T21:00:29 1721854829

So do it locally after predigesting the book, so that you have the entire KV-cache for it.

Then load that KV-cache and add your prompt.

tr4656 · 2024-07-24T17:22:09 1721841729

This might be when it's better to not use the API and just pay for the flat-rate subscription.