This came out yesterday. I’m surprised no one is talking about it yet. It is really impressive.
Basically you create an NLP DL network that is really really big, then you show it the entire internet. Now by handing it a blurb of text, it will generate more text with the same sentiment, and the text is extremely convincing (it reads like human text). That’s because it sampled huge amounts of human text and is putting it together from millions of sources (the model has 7.5 Billion parameters).
This model was the first time I have actually encountered a GELU. I hadn’t heard of it before a couple hours ago. It’s apparently existed since 2016, though I haven’t heard people talk about it in any of the AI discussions.
Has some pretty impressive benefits over RELU, although the computation I would imagine is still beneficial with RELU.
GAUSSIAN ERROR LINEAR UNITS (GELUS)
Dan Hendrycks∗ University of California, Berkeley
Kevin Gimpel Toyota Technological Institute at Chicago
I specifically turned down a job from someone wanting me to create a system just for that. I guess a system like OpenAI’s could probably be given an initially generated outline, then just fill in most of the details. Afterwards, it would just be an editing job, rather than a writing one.
Here’s an example from Axios that I saw just this morning. It’s pretty darn convincing, minus the fact that it’s behind on the current state faculty at the White House.
It actually made massive headlines everywhere, even in mainstream newspapers in Australia (Sydney Morning Herald) which is unheard of! I don’t know if that’s because of clever marketing (e.g. stating on the blog that they will not release the model due to “concerns about malicious applications of the technology”) or because the language model is pretty darn impressive.
And the nature of the new cyber warfare arena is revealing itself. This is serious stuff, and things are happening very fast. These skills are super valuable today, and they are going to stay there unless the system fails or we really do create something that can start creating AIs itself.
Our primary job as humans is to continue building the future and survive whatever comes next. We must remain vigilant and continue thinking about how these systems will shape the future of our planet. We must continue to think about how we can steer our common fate in a direction that includes humanity.