Nested Learning: A new ML paradigm for continual learning

☆ Yσɠƚԋσʂ ☆@lemmygrad.ml · 1 month ago

Nested Learning: A new ML paradigm for continual learning

Maeve@kbin.earth · 1 month ago

That’s pretty neat.

☆ Yσɠƚԋσʂ ☆@lemmygrad.ml · 1 month ago

yeah seems like a much better approach to neural network architecture

Maeve@kbin.earth · 1 month ago

I was wondering if this is a first step to actual “learning” vs. mimicry.

☆ Yσɠƚԋσʂ ☆@lemmygrad.ml · edit-2 1 month ago

It’s an important component, and another key aspect is establishing a feedback loop that provides both positive and negative reinforcement. I expect embodied intelligence will be the necessary path toward creating genuine AI. An organism’s brain maintains homeostasis by constantly balancing internal body signals with those from the external environment, making decisions to regulate its internal state. It’s a continuous feedback loop that allows the brain to evaluate the usefulness of its actions, which facilitates reinforcement learning. An embodied AI could use this same mechanism to learn about and interact with the world effectively. Furthermore, equipping it with an internal world model would enable meaningful communication with us. We would move beyond merely stringing text tokens together. Here, words would map to underlying representations that are fundamentally similar to our own.

Maeve@kbin.earth · 1 month ago

I can see it in my mind’s eye! If we can teach it the best of heuristics without the worse, somehow, that feedback loop can be so much more mutually beneficial! But it must be available to all people, everywhere. We might even convince ourselves to work across borders for the survival of more species and begin undoing the damage of insatiable greed.

☆ Yσɠƚԋσʂ ☆@lemmygrad.ml · 1 month ago

I very much agree

PM_ME_VINTAGE_30S [he/him]@lemmy.sdf.org · 1 month ago

Sounds like a promising framework, but does the table at the bottom suggest that the HOPE architecture they came up with to demonstrate the framework is only incrementally better than the others?

☆ Yσɠƚԋσʂ ☆@lemmygrad.ml · 1 month ago

I’d argue that it’s qualitatively better since it has the ability to learn continuously and self improve.

PM_ME_VINTAGE_30S [he/him]@lemmy.sdf.org · edit-2 1 month ago

Yeah, definitely exciting news even if the empirical result was incrementally better, because it demonstrates that the new framework recovers SOTA performance. Sounds like this framework might be helpful for analyzing and controlling dynamical systems, e.g. wrapping a system with a NL network that continuously improves as the system evolves. But I’m a dynamical systems guy so I’m gonna be a bit biased 😆

☆ Yσɠƚԋσʂ ☆@lemmygrad.ml · 1 month ago

Yeah that would be a great application, but I think this just a better approach in general. In any context where you’re doing extrapolations about likely future states based on the current state, having a system that can automatically tune itself is incredibly valuable. You can basically throw it at a data stream and have it learn to analyze it over time.