Wednesday, November 26, 2025

AI25050 Natural Language Processing (NLP) V01 261125

 NLP stands for Natural Language Processing.

It is the field of artificial intelligence that focuses on enabling computers to understand, generate, interpret, and interact using human language.


Here’s a simple breakdown:


What NLP Does


NLP allows computers to:


1. Understand language

recognizing meaning in text or speech

analyzing grammar

extracting keywords, topics, or sentiment


2. Generate language

writing responses (as I’m doing now)

summarizing documents

translating between languages

creating chatbots or assistants


3. Interact with humans

voice assistants (Siri, Alexa)

customer service chatbots

search engines

document automation


How NLP Works


Two main approaches exist:


1. Traditional NLP (before deep learning)

Used linguistic rules, dictionaries, grammar patterns.

Very Chomskyan, symbolic, and rule-based.

Could not scale easily to messy real-world language.


2. Modern NLP (deep learning / LLMs)

Uses neural networks trained on huge amounts of text.

Learns statistical patterns instead of rules.

Powers GPT, Claude, Gemini, etc.


Examples of NLP Tasks

Speech recognition (“Hey Siri…”)

Machine translation (Google Translate)

Autocorrect / predictive text

Sentiment analysis (“Is this review positive?”)

Information extraction (finding names, dates, events)

Document summarization

Question answering


In one sentence:


NLP is the technology that lets computers read, write, and understand human language.


Great — here’s a clear explanation of how NLP connects to Chomsky’s theories and how LLMs use NLP today, showing how the field evolved.


1. How NLP connects to Chomsky’s theories


Chomsky’s work shaped the foundation of NLP, especially early computational linguistics.


A. Formal grammars


Chomsky introduced the Chomsky hierarchy:

Regular grammars

Context-free grammars

Context-sensitive grammars

Unrestricted grammars


These became the backbone of programming language design and early NLP systems.

For decades, NLP tried to model human language using:

grammar rules

parse trees

symbolic transformations


This mirrors Chomsky’s idea that sentences are produced like instructions in code.


B. Parsing based on syntax


Tools like:

context-free grammar parsers

syntax trees

phrase-structure rules

dependency grammars


all emerged from generative-linguistic thinking.


C. Competence vs. performance


Early NLP focused on idealized language (competence), not messy real-world text.

Systems failed in practice because real language is ambiguous, noisy, and context-dependent.


Bottom line:


Chomsky’s ideas provided the blueprint for rule-based NLP, but these systems were brittle and did not scale.


2. How LLMs use NLP today (a completely different approach)


Modern NLP—what powers GPT, Claude, and Gemini—takes almost the opposite approach from Chomsky.


A. Data-driven, not rule-based


LLMs learn language by analyzing billions of words and discovering patterns, instead of relying on hand-written grammar rules.


B. No explicit grammar


LLMs never see:

phrase-structure rules

transformations

deep/surface structure

Universal Grammar


Yet they learn syntactic patterns implicitly.


They “infer” grammar through statistical regularities.


C. Continuous, not symbolic


Chomsky:

Language = discrete symbols (NP, VP, Tense, Movement)


LLMs:

Language = vectors in high-dimensional space representing patterns.


D. Meaning emerges from context


LLMs use self-attention to determine which words relate to which.

They don’t “parse” in a symbolic sense; they compute relationships.


E. Performance-first


LLMs do not distinguish competence from performance; they model language as it’s used in the real world.


Bottom line:


LLMs succeeded because they embraced probabilistic, usage-based learning rather than Chomskyan rule systems.


3. How the two approaches relate (big picture)


Here’s a simple summary.


Chomsky LLMs

Language is a formal rule system Language is a statistical pattern

Grammar is innate Grammar is learnable from data

Syntax is symbolic, hierarchical Structure is encoded in vector patterns

Meaning is separate from usage Meaning emerges from usage

Rule-based generation Prediction-based generation

Computer analogy: compiler Computer analogy: neural pattern recognizer


4. Why this matters

Classical Chomskyan ideas shaped early NLP but became a bottleneck.

Neural approaches bypassed the rule-based limitations.

LLMs now achieve fluent, human-level language without symbolic grammar.


This shift changed both linguistics and AI, raising new questions about what “knowing a language” actually means.



No comments:

Post a Comment

AI26019 Copyright and AI V01 100326

  Creative types have the upper hand in AI copyright fight Katie Prescott Kanishka Narayan is the minister for AI Next image  › ‘‘ Pimli-cod...