Worth Reading: Looking Inside Large Language Models

Tuesday, August 29, 2023 07:07 UTC

Worth Reading: Looking Inside Large Language Models

Bruce Davie published an interesting overview article about Large Language Models. It would be worth reading just for the copious links to in-depth article; I particularly like his conclusions:

We mistake performance (producing realistic text) for competence (understanding the world).

Having a model for language is different from having a model of the world.

And that’s a perfect explanation why it makes no sense to expect ChatGPT and friends to produce picture-perfect device configurations or always-working code.

Recent posts in the same categories

1 comments:

Minh 30 August 2023 08:42

Bruce nailed it with this comment: "At this stage, we all know of examples where LLMs have produced laughable results indicating a lack of understanding of the world, but the details of how they work show that they are very good at understanding language. I think the issue is the difference between understanding language (a set of symbols) and understanding the world. If a human understands language, we generally assume that they also understand the world, but making this extrapolation in the case of LLMs is a bridge too far."

ChatGPT is the ultimate cargo-cult. It can do well mimicking STATIC, symbol-based systems like languages, and by extension, programming languages. It can do well in other symbolic systems like Maths as well. But its model falls shorts when it comes to forming a world-view, which is a lot more complex and DYNAMIC. That's why there's no AI system that can drive a car safely in our current transport networks, and there'll likely never be one.

So basically ChatGPT can be used to generate codes, not perfect but workable codes, with errors of course. Since lots of programmers are just that, bad coders, a lot of what they're doing atm will be replaced by it. There'll be people required to fix the codes generated by ChatGPT, but those numbers will be far smaller. That will put a downward pressure on the pay of programmers.

Of course, there's no replacing competent and innovative coders, but they're by nature, a minority.

Computer networks are also not very static, so ChatGPT can assist with device config, but it's impractical to expect that it'll replace all human elements, esp. in big networks.

Add comment