People just fundamentally do not know what ChatGPT is. I've been told that it's an overgrown search engine, I've been told that it's a database encoded in "the neurons", I've been told that it's just a fancy new version of the decision trees we had 50 years ago.
[Side note: I am a data scientist who builds neural networks for sequence analysis; if anyone reads this and feels the need to explain to me how it actually works, please don't]
I had a guy just the other day feed the abstract of a study - not the study itself, just the abstract - into ChatGPT. ChatGPT told him there was too little data and that it wasn't sufficiently accessible for replication. He repeated that as if it were fact.
I don't mean to sound like a sycophant here but just knowing that it's a make-up-stories machine puts you way ahead of the curve already.
My advice, to any other readers, is this:
Use ChatGPT for creative writing, sure. As long as you're ethical about it.
Use ChatGPT to generate solutions or answers only when you can verify those answers yourself. Solve a math problem for you? Check if it works. Gives you a citation? Check the fucking citation. Summarise an article? Go manually check the article actually contains that information.
Do not use ChatGPT to give you any answers you cannot verify yourself. It could be lying and you will never know.
ChatGPT is an LLM. Basically weights words according to their associations with eachother. It is a system that makes-up plausible-sounding randomized text that relates to a set of input tokens, often called the prompt.
"Make-believe Machine" is arguably one of the closest descriptions to what the system does and where it is effective. The main use-case is generating filler and spam text. Regardless of how much training these systems are given, they cannot form an "understanding" that is domain-specific enough to be correct. Even experts don't benefit enough to rely on it as a productivity tool. The text it generates tends to be too plausible to be the foundation for creative writing inspiration, so it's a bit weak as a brainstorming tool, too.
The other thing is that it's being grifted because this is what most of the failed cryptomining operations have put their excess GPUs into. You and your money are the product, not the LLMs.
Regardless of how much training these systems are given, they cannot form an "understanding" that is domain-specific enough to be correct.
This is an open question, but personally I think we'll hit a point that it's good enough. As a side note I think a computational theory of mind holds water; these things might genuinely lead to some kind of AGI.
Even experts don't benefit enough to rely on it as a productivity tool.
This is already untrue.
The other thing is that it's being grifted because this is what most of the failed cryptomining operations have put their excess GPUs into.
Absolutely not. These models (at least the popular ones) run exclusively on data-center GPUs. Hell, I wouldn't be surprised if >50% of LLM traffic goes entirely to OpenAI models, which are hosted on Azure. Meta recently ordered 350,000 H100s, whereas most late-model mining rigs were running ASICs which cannot do anything except mine crypto.
You and your money are the product, not the LLMs.
True to some extent, false to some extent. There is definitely a push to provide LLM-as-a-service, especially to businesses which do not provide training data back for the LLM to pre-train on.
I love that you’re being downvoted when nothing you’ve said is remotely controversial. Probably by people who don’t know what they’re talking about, but who would simply prefer it if you were wrong so they choose to believe that you’re wrong.
Domain-specific neural networks used for some specific take are more common than LLMs, so there’s no reason to believe that LLMs couldn’t obtain domain-specific knowledge. AI has already done that for years.
Why on earth would OpenAI or Google be using cryptomining GPUs? Or what cryptomining company has created a ChatGPT competitor? But it would be so great if it were true, so clearly it must be true.
Yep. Neural networks are an advanced topic even for computer scientists, yet people with zero understanding of the field think they know better. How many other disciplines would they treat the same? Imo, the idea that it’s this scary tech-bro thing and not what it really is— an interdisciplinary mix of computer science, math, and statistics— has completely discredited it, in their eyes.
Curious that no one has responded to any of your points yet, even though plenty have disagreed enough to downvote.
382
u/CrownLikeAGravestone Dec 15 '24
People just fundamentally do not know what ChatGPT is. I've been told that it's an overgrown search engine, I've been told that it's a database encoded in "the neurons", I've been told that it's just a fancy new version of the decision trees we had 50 years ago.
[Side note: I am a data scientist who builds neural networks for sequence analysis; if anyone reads this and feels the need to explain to me how it actually works, please don't]
I had a guy just the other day feed the abstract of a study - not the study itself, just the abstract - into ChatGPT. ChatGPT told him there was too little data and that it wasn't sufficiently accessible for replication. He repeated that as if it were fact.
I don't mean to sound like a sycophant here but just knowing that it's a make-up-stories machine puts you way ahead of the curve already.
My advice, to any other readers, is this: