Sunday, January 25, 2026

AI26008 The AI Users V01 250126

 When we talk about AI users, we are looking at a diverse spectrum of people—from casual chatters to the engineers building the world's next major models.

In the context of the digital commons we discussed earlier, AI users are not just "consumers"; they are often part of a cycle where they both use and contribute to shared resources.

1. Types of AI Users

Not every AI user interacts with the technology in the same way. We can categorize them by their level of "depth":

End Users (Consumers): These are people using tools like ChatGPT, Gemini, or Midjourney for daily tasks. They use AI for "conversational knowledge gathering," writing assistance, or creative fun.

Power Users (Prompt Engineers): These users treat AI as a professional tool. They understand how to structure complex queries, use "chain-of-thought" prompting, and often use AI to automate their own workflows.

Builders (Developers): These are the people creating the applications. They use APIs (Application Programming Interfaces) to plug AI models into new software, like a medical app that uses AI to summarize doctor-patient notes.

Designers (Model Trainers): The smallest group, consisting of data scientists and researchers who actually build and "fine-tune" the base models.

2. AI Users as "Data Contributors"

This is where the digital commons becomes critical. Every time a user interacts with an AI, they are often contributing back to the system:

Reinforcement Learning: When you tell an AI "This answer was helpful" or "This answer was wrong," you are acting as a human trainer. This is called RLHF (Reinforcement Learning from Human Feedback).

Data Exhaust: Most casual users generate "data exhaust"—the logs of their conversations which, depending on the privacy settings, companies may use to train future versions of the AI.

The "Closed" vs. "Open" Loop: Some users choose to use Open Source AI (like Llama or Mistral). In these cases, the users often contribute code or data back to the public "commons" rather than keeping it inside a company’s private walls.

3. The New "Synthetic User"

A fascinating trend in 2026 is the rise of Synthetic Users. This isn't a person, but an AI impersonating a user:

• Companies now create "AI Personas" to simulate how a real customer might react to a new product.

• Instead of running a focus group with 10 people, a company might "talk" to 1,000 synthetic users to test an idea instantly.

4. Key Challenges for AI Users

The Trust Gap: Many users feel "AI fatigue" or distrust because they don't know if the information is accurate or if their data is being used to train a competitor's model.

Data Sovereignty: There is a growing movement among users to "own" their own data—using local AI models that run on their own computers rather than sending their private thoughts to a cloud server.



In 2026, the term "Open Source AI" has become more nuanced. It refers to AI systems where the essential components are shared publicly, allowing anyone to inspect, modify, and run the model on their own hardware.  

However, there is a big difference between "marketing" open source and "technical" open source. Here is the breakdown:

1. The Core Components

For an AI to be truly open source, it generally needs to provide three things:

The Code: The programming instructions used to build the model's architecture and the scripts used to train it.  

The Weights: These are the numerical "knowledge" the AI learned during training. Without these, you have the engine (code) but no fuel (knowledge).  

The Data (The Controversy): To be "pure" open source, the training data should also be public. In practice, many companies keep their data secret but share the code and weights, calling it "Open Weights."  

2. Why People Use Open Source AI

Privacy & Sovereignty: You can run the AI on your own computer or private server. Your data never leaves your building, which is critical for hospitals, law firms, and banks.  

No "Guardrail" Frustration: Proprietary AIs (like the ones from big corporations) have strict built-in filters. Open source models allow users to remove or adjust these filters for specific research or creative needs.  

Fine-Tuning: Users can "teach" an open source model new, specific skills (like learning a company's internal jargon) without needing to build a whole new AI from scratch.

Cost: While training a model costs millions, using one that someone else has released for free is significantly cheaper than paying a monthly subscription or "per-word" fee.  

3. Key Examples (As of 2026)

• **Llama (Meta): While not "pure" open source by some strict definitions, Llama is the most popular "open weight" model that sparked the current revolution.  

Mistral & Mixtral: Known for being highly efficient and outperforming much larger corporate models while staying open for the community.

GPT-OSS: A major 2025/2026 milestone where fully open-source alternatives finally reached the "reasoning" capabilities of the best private models.  

Stable Diffusion: The standard-bearer for open-source image generation, allowing users to create art without corporate cloud censorship.  

4. Open Source vs. Closed Source

Closed Source (Proprietary): Like a black box. You send a request, get an answer, but you don't know how it works or what happens to your data (e.g., ChatGPT, Claude, Gemini).

Open Source: Like an open book. You can see the math, change the logic, and keep the whole system in your own "pocket."

The Current Debate: "Open-Washing"

A major topic right now is Open-Washing. This is when a company calls their AI "Open Source" to get good PR, but they actually include restrictive licenses that prevent certain companies from using it or hide the training data so no one can truly replicate it.

No comments:

Post a Comment

AI26019 Copyright and AI V01 100326

  Creative types have the upper hand in AI copyright fight Katie Prescott Kanishka Narayan is the minister for AI Next image  › ‘‘ Pimli-cod...