I'm far from the first to make this observation but LLMs are like anti-compression algorithms when used like that, a simple idea gets expanded into a bloated mess by an LLM, then sent to someone else who runs it through another LLM to summarize it back to something approximating the original prompt. Nobody benefits aside from Sam Altman and co, who get to pocket a cool $0.000000001 for enabling this pointless exercise.
There is one other very useful form of "expansion" that LLMs do.
If you aren't aware: (high-parameter-count) LLMs can be used pretty reliably to teach yourself things.
LLM base models "know things" to about the same degree that the Internet itself "knows" those things. For well-understood topics — i.e. subjects where the Internet contains all sorts of open-source textbooks and treatments of the subject — LLMs really do "know their shit": they won't hallucinate, they will correct you when you're misunderstanding the subject, they will calibrate to your own degree of expertise on the subject, they will make valid analogies between domains, etc.
Because of this, you can use an LLM as an infinitely-patient tutor, to learn-through-conversation any (again, well-understood) topic you want — and especially, to shore up any holes in your understanding.
(I wouldn't recommend relying solely on the LLM — but I've found "ChatGPT in one tab, Wikipedia open in another, switching back and forth" to be a very useful learning mode.)
See this much-longer rambling https://news.ycombinator.com/item?id=43797121 for details on why exactly this can be better (sometimes) than just reading one of those open-source textbooks.
I use LLMs a huge amount in my work as a senior software engineer to flesh out the background information required to make my actual contributions understandable to those without the same background as me. eg, if I want to write a proposal on using SLO's and error budgets to make data driven decisions about which errors need addressing and which don't, inside a hybrid kubernetes and serverless environment, I could do a few things:
* Not provide background information and let people figure it out for themselves. This will not help me achieve my goals.
* Link them to Google's SRE book and hope they read it. Still not achieving my goals, because they won't.
* Spend 3 hours writing the relevant background information out for them to read as part of my proposal. This will achieve my goals, but take an extra 3 hours.
* Tell the LLM what I'm looking for and why, then let it write it for me in 2 minutes, instead of 3 hours. I can check it over, make sure it's got everything, refine it a little, and I've still saved 2.5 hours.
So for me, I think the author has missed a primary reason people use LLMs. It saves a bunch of time.
It feels like the information is there strewn across the internet, in forums, Reddit posts, stack overflow, specs, books. But to trawl though it all was so time consuming. With an LLM you can quickly distill it down to just the information you need.
Saying that, I do feel like reading the full spec for something is a valuable exercise. There may be unknown unknowns that you can't even ask the LLM about. I was able to become a subject expert in different fields just but sitting down and reading through the specs / RFCs, while other colleagues continued to struggle and guess.
This is how I currently am relearning upper high school math. It’s tremendously helpful as I am a why guy.
Why is the angle called m? Why is a combination nPr * (1/r)? What is 1/r doing there?
I use mathacademy.com as my source of practice. Usually that’s enough but I tend to fall over if small details aren’t explained and I can’t figure out why those details are there.
In high school this was punished. With state of the art LLMs, I have a good tutor.
Also it’s satisfying to just upload a page in my own handwriting and it understands what I did, and is able to correct me there.
They are also useful for association. Imagine an LLM trained on documentation. Then you can retrieve info associated with your question.
This can go beyond just specific documentation but also include things like "common knowledge" which is what the other poster meant when they talked about "teaching you things".
In my experience, if the information I need is in the documentation, then I don't need the LLM. If it is not in the documentation, then the LLM will invent stuff that could be there but that isn't, and it's actually a loss of time.
> In my experience, if the information I need is in the documentation, then I don't need the LLM
need is a strong word. Do you need to be able to do ctrl+F? Not really, you can just read it all. But maybe it's easier to do ctrl+F. Same with LLM. Just imagine it as a fuzzy ctrl+F. Can be useful.
Rituals are significant because they are long. A ritual that consisted of the words “Rain please” wouldn’t convince the gods, much less their human followers.
Depends on what you are looking for. I’ve turned half baked ideas into white papers for plenty of praise. I’ve used them to make my Jira tickets seem complicated and complete. I’ve used them to get praised for writing comprehensive documentation.
Part of my performance review is indirectly using bloat to seem sophisticated and thorough.
Documentation is an interesting use case. There are various kinds of documentation (reference, tutorial, architecture, etc.) and LLMs might be useful for things like
- repetitive formatting and summarization of APIs for reference
- tutorials which repeat the same information verbosely in an additive, logical sequence (though probably a human would be better)
- sample code (though human-written would probably be better)
The tasks that I expect might work well involve repetitive reformatting, repetitive expansion, and reduction.
I think they also might be useful for systems analysis, boiling down a large code base into various kinds of summaries and diagrams to describe data flow, computational structure, signaling, etc.
Still, there is probably no substitute for a Caroline Rose[1] type tech writer who carefully thinks about each API call and uses that understanding to identify design flaws.
Yes, but none of the current LLMs are even remotely useful doing that kind of work for even something moderately complex. I have a 2k LOC project that no LLM even "understands" *. They can't grasp what it is: It's a mostly react-compatible implementation of "hooks" to be used for a non-DOM application. Every code assistant thinks it's a project using React.
Any documentation they write at best re-states what is immediately obvious from the surrounding code (Useless: I need to explain why), or is some hallucination trying to pretend it's a React app.
To their credit they've slowly gotten better now that a lot of documentation already exists, but that was me doing the work for them. What I needed them to do was understand the project from existing code, then write documentation for me.
Though I guess once we're at the point AI is that good, we don't need to write any documentation anymore, since every dev can just generate it for themselves with their favorite AI and in the way they prefer to consume it.
* They'll pretend they understand by re-stating what is written in the README, then proceed to produce nonsense.
This kind of "perf review" hacking works ~everywhere; how well it works correlates with how entrenched the organization is (i.e., how hard it is for new players to disrupt).
You don't have to play the game the same way to work there. But it helps to accept that others will play it, and manage your own expectations accordingly.
Which big, well-paying companies do not have "those bullshit metrics"? I know for a fact that Meta, Google, Stripe, Airbnb, and Oracle all lean heavily on performance review cycles based entirely on ridiculous metrics. Getting ahead there requires you to play the stupid games GP is suggesting.
Feel for you or anyone surrounded by such others but it is most definitely not everywhere - that is used to justify your presence in a place of work you should not be
> creating bloat until it becomes so unproductive it kills the host
Maybe we should let/encourage this to happen. Maybe letting bloated zombie-like organisations bloat themselves to death would thin the herd somewhat, to make space for organisations that are less “broken”.
I am on the mind that every organization should eventually die before it becomes a monster. I am also not a huge fan of inheritance for the same reason.
I fully believe you and I am saddened by the reality of your situation.
At the same time, I strive really hard to influence the environment I am in so it does not value content bloat as a unit of productivity, so hopefully there are at least some places where people can have their sanity back!
If your organisation is such that you have to do this even though you are competent for your job, then they deserve it. They lose money because they do it wrong.
If your organisation is functional and you are abusing it by doing that, then you deserve to get fired.
...thinking about it, there are probably situations where making something more verbose makes it take less effort to read. I can see how an LLM might be useful in that situation.