FastCo Works
- VISA
- RED SEA GLOBAL

970x250-ReadCan the Middle East spark positive change through design Experts at Innovation by Design Summit deliberate

04-21-23 | 9:00 am

What is a large language model and how does it work?

Large language models are the foundational technology behind recent artificial intelligence advancements like ChatGPT.

[Source photo: iconeer/Getty Images; Markus Spiske/Unsplash]

With the emergence of ChatGPT and other AI-driven technologies, there’s been ongoing conversation around how the tech will usher us into a new era—one that may simultaneously destroy careers and open the door to new opportunities. There’s less discussion, however, around the technology underpinning the AI innovations: large language models (LLMs for short).

Below, a quick guide on how LLMs work.

WHAT IS A LARGE LANGUAGE MODEL?

LLMs are machine learning models that utilize deep learning algorithms to process and understand language. They’re trained with immense amounts of data to learn language patterns so they can perform tasks. Those tasks can range from translating texts to responding in chatbot conversations—basically anything that requires language analysis of some sort.

The best-known example of LLMs is ChatGPT, with which users can have conversations or ask specific tasks related to language. Another popular example: BERT, or Bidirectional Encoder Representations from Transformers, which was developed by Google and can understand questions to form meaningful responses.

HOW DO LARGE LANGUAGE MODELS WORK?

LLMs are comprised of multiple layers of neural networks, which work together to analyze text and predict outputs. They’re also trained with a left-to-right or bidirectional transformer, which works to maximize the probability of following and preceding words in context—just like a human could reasonably predict what might come next in a sentence.

LLMs also have an attention mechanism that allows them to focus selectively on parts of text in order to identify the most relevant sections for summaries, for example.

HOW DO YOU TRAIN AN LLM?

LLMs can be incredibly expensive to train. A 2020 study estimated that the cost of training a model with 1.5 billion parameters can be as high as $1.6 million. However, advances in software and hardware have brought those costs down in recent years.

Generally, training an LLM includes identifying a data set, which likely needs to be large in order for it to perform functions like a human, determining the network layer configuration, using supervised learning to learn the information in the data set, and finally fine-tuning, or adding specific adjustments based on performance or motive.

With task-specific training, it’s an iterative process of figuring out what you need that’s not reflected and how to achieve that end goal. However, training LLMs can be quite difficult: you need distributed software, and the training time is long, in addition to requiring the technical knowledge necessary to train the model.

Featured Videos

Today's Top Stories:

01

News

Saudi Arabia's Aramco signs 4-year partnership deal with FIFA

02

News

UAE investors banking on AI-powered businesses for big returns, finds survey

03

News

UAE partners with Archer Aviation to introduce electric air taxis by 2025

04

News

KAUST and NEOM launch world's largest coral restoration initiative in Saudi Arabia

05

News

Techno-optimism is a powerful tool for change. Is it enough?

FROM OUR PARTNERS

News

Saudi Arabia's Aramco signs 4-year partnership deal with FIFA

News

UAE investors banking on AI-powered businesses for big returns, finds survey

News

UAE partners with Archer Aviation to introduce electric air taxis by 2025

News

KAUST and NEOM launch world's largest coral restoration initiative in Saudi Arabia

News

Techno-optimism is a powerful tool for change. Is it enough?

impact

What would it take to ditch single-use plastic?

CO-DESIGN

There’s a simple reason you don’t believe the influencers in your feed

CO-DESIGN

Here’s a look at the world’s first civil space traffic coordination system

Work Life

Google’s productivity expert says this is the best way to start your day

impact

An electricity expert explains how to lower clean energy curtailment

impact

Ocean waves contain more ‘forever chemicals’ than industrial pollution. That’s bad news if you live on the coast

Technology

Generative AI meets scams and spam: Deciding what’s real on social media gets trickier

Technology

Tesla’s Optimus humanoid robots could be ready to sell by the end of next year

Technology

Embedding AI into education will personalize learning

impact

Wild bees are the real climate heroes

CO-DESIGN

This magical sponge furniture grows when you dunk it in water

CO-DESIGN

Bang & Olufsen is selling a $55,000 CD player. You read that right

Technology

Content moderation in the AI era: Humans are still needed across industries, says ex-Twitter trust and safety exec

Technology

No, you can’t opt out of Meta AI, despite what Meta AI tells you

CO-DESIGN

Can the Middle East spark positive change through design? Experts at Innovation by Design Summit deliberate

What is a large language model and how does it work?

Large language models are the foundational technology behind recent artificial intelligence advancements like ChatGPT.

WHAT IS A LARGE LANGUAGE MODEL?

HOW DO LARGE LANGUAGE MODELS WORK?

HOW DO YOU TRAIN AN LLM?

Featured Videos

Today's Top Stories:

01

Saudi Arabia's Aramco signs 4-year partnership deal with FIFA

02

UAE investors banking on AI-powered businesses for big returns, finds survey

03

UAE partners with Archer Aviation to introduce electric air taxis by 2025

04

KAUST and NEOM launch world's largest coral restoration initiative in Saudi Arabia

05

Techno-optimism is a powerful tool for change. Is it enough?

More Top Stories:

FROM OUR PARTNERS

Impact

Impact

Impact

News

News

News

Co. Design

Co. Design

Co. Design

Work Life

Work Life

Work Life

Saudi Arabia’s Aramco signs 4-year partnership deal with FIFA

UAE investors banking on AI-powered businesses for big returns, finds survey

UAE partners with Archer Aviation to introduce electric air taxis by 2025

Unparalleled Journalism. Start Your Subscription Today.