I have a very similar prompting style to yours and share this experience. I am a...

lyu07282 · on Feb 15, 2024

It constantly hallucinates APIs for me, I really wonder why people's perceptions are so radically different. For me it's basically unusable for coding. Perhaps I'm getting a cheaper model because I live in a poorer country.

sho_hn · on Feb 16, 2024

Are you using Gemini Advanced? (The paid tier.) The free one is indeed very bad.

belter · on Feb 16, 2024

Spent a few hours comparing Gemini Advanced with GPT-4.

Gemini Advanced is nowhere even close to GPT-4, either for text generation, code generation or logical reasoning.

Gemini Advanced is constantly asking for directions "What are your thoughts on this approach?" even to create a short task list of 10 items. Even when being told several times to provide the full list, and not stop at every three or four items and ask for directions. Is constantly giving moral lessons or finishing the results with annoying marketing style comments of the type "Let's make this an awesome product!"

Code is more generic, solutions are less sophisticated. On a discussion of Options Trading strategies Gemini Advanced got core risk management strategies wrong and apologized when errors were made clear to the model. GPT-4 provided answers with no errors, and even went into the subtleties of some exotic risk scenarios with no mistakes.

Maybe 1.5 will be it, or maybe Google realized this quite quickly and are trying the increased token size as a Hail Mary to catch up. Why release so soon?

Quite curious to try the same prompts on 1.5.

oceanplexian · on Feb 16, 2024

I asked Gemini Advanced, the paid one, to "Write a script to delete some files" and it told me that it couldn't do that because deleting files was unethical. At that point I cancelled my subscription since even GPT-4 with all its problems isn't nearly as broken as Gemini.

panarky · on Feb 16, 2024

If you share your prompt I'm sure people here can help you.

Here's a prompt I used and got a a script that not only accomplishes the objective, but even has an option to show what files will be deleted and asks for confirmation before deleting them.

Write a bash script to delete all files with the extension .log in the current directory and all subdirectories of the current directory.

sjwhevvvvvsj · on Feb 15, 2024

I’m going to have to try Gemini for code again. It just occurred to me as a Xoogler that if they used Google’s code base as the training data it’s going to be unbeatable. Now did they do that? No idea, but quality wins over quantity, even with LLM.

barrkel · on Feb 15, 2024

There is no way NTK data is in the training set, and google3 is NTK.

sjwhevvvvvsj · on Feb 16, 2024

I dunno, leadership is desperate and they can de-NTK if and when they feel like it.

cpeterso · on Feb 16, 2024

What is “NTK”?

mjamaloney · on Feb 16, 2024

"Need To Know" I.e. data that isn't open within the company.

saagarjha · on Feb 17, 2024

Almost all of google3 is basically open to all of engineering.