> Doesn't for_tree(...) look a lot nicer and simpler and less error prone than needing to implement a recursive function for each operation you would want to do on a tree?
This would be my guess as it sounds similar to what I experience and have been experiencing for decades long before any king of noise cancelling headphones. I find it very difficult to process noises when there are too many occurring at once and adding voices into the mix will make it look like I am lagging trying to respond. I have no idea what the underlying condition is, but I consider this problem to be called "misophonia". I think it is about not being able to filter sounds well.
Thankfully I'm not regularly in an environment that force me to wear noise cancelling headphones.
Sometimes I believe this was caused by my childhood bedroom being soundproofed, but I still went to noisy public school so who knows.
The interesting thing about this comment is that JAX is actually higher-level even than pytorch generally. Since everything is compiled you just express a logcial program and let the compiler (XLA) worry about the rest.
Are you suggesting that XLA would be where this "lower level" approach would reside since it can do more automatic optimization?
JAX has a sub-system called Pallas[1] with a Triton-like programming model and an example implementation of Flash Attention [2]. It is quite fast. On TPUs I've heard that the XLA compiler already emits a flash-attention-like computation graph for a regular JAX implementation of attention so there's no need to have some specialized kernel in that case.
> One of the co-authors, Özge Kabakcı, a high school math teacher and former department chair of the math department at our partner Turkish high school, led the development of all session materials.
I guess it would be hard to find a high school with the sample size that you need (thousand) that will agree on collaborating. And in the US every county will have different rules and in terms of math they don't teach it in standard way.
But why Turkish not British or any other place is going to be a question no matter the ___location. But do you really think the results will be significantly different if it is done lets say on Vietnamese students?
> Doesn't for_tree(...) look a lot nicer and simpler and less error prone than needing to implement a recursive function for each operation you would want to do on a tree?
No it does not
reply