More

slashdave · 2024-12-28T06:52:08 1735368728

> we have no way of knowing in advance what the capabilities of current AI systems will be if we are able to scale them by 10x, 100x, 1000x, and more.

Scaling experiments are routinely performed (the results are not encouraging). To say we know nothing about this is wrong.

slashdave · 2024-11-30T17:19:29 1732987169

I had the same thought. I mean, were they actually using ordinary floating-point numbers to represent amounts in their ledger? This sets off so many alarm bells.

slashdave · 2024-11-30T17:08:35 1732986515

In some circles, there is the irritating tendency to believe that technology can solve every problem. Experts are eschewed because innovation is valued above all else.

slashdave · 2024-11-15T19:13:01 1731697981

Um, is it okay to admit, as an "experienced" programmer, that I often resort to print statements? I mean, compilers are just so darn fast these days.

Another trick: for rare circumstances, code whatever complicated logic is needed to isolate the bug in order to issue a print statement, then use the debugger to break on that print statement.

slashdave · 2024-11-15T19:08:27 1731697707

The syntax is clunky, but watchpoints can do what you want.

https://ftp.gnu.org/old-gnu/Manuals/gdb/html_mono/gdb.html#S...

slashdave · 2024-11-15T19:05:41 1731697541

Coming from VMS at the time, I was confused why there was no decent full screen interface to gdb. DDD was such a disappointment in this regard.

slashdave · 2024-11-15T18:59:44 1731697184

Deep learning is the very opposite of generalization.

pas · 2024-11-18T06:42:04 1731912124

it's not that simple

"""

Intuitively, an overparameterized model will generalize well if the model’s representations capture the essential information necessary for the best model in the model class to perform well

"""

https://iclr-blogposts.github.io/2024/blog/double-descent-de...

slashdave · 2024-11-15T18:58:45 1731697125

The improvements in transformer implementation (e.g. "Flash Attention") have saved gobs of money on training and inference, I am guessing most likely more than the salary of those researchers.

slashdave · 2024-11-15T18:55:38 1731696938

I hear what you are saying, but "innovation" is also often used to excuse some rather badly engineered concepts

slashdave · 2024-11-15T00:58:51 1731632331

> Tesla are all making rapid progress on functionality

The lack of progress with self driving seems to indicate that Tesla has a serious problem with scaling. The investment in enormous compute resources is another red flag (if you run out of ideas, just use brute force). This points to a fundamental flaw in model architecture.