What’s in Today’s Article?
- Why in News?
- About Artificial Intelligence
- About OpenAI
- Debate over ChatGPT’s Source for Data
- Reason Behind News Outlets’ Decision
- Way Forward
Why in News?
- A group of news media organisations recently shut off OpenAI’s ability to access their content.
- New York Times is planning on suing OpenAI over copyright violations.
About Artificial Intelligence
- Artificial intelligence (AI) is the ability of a computer or a robot controlled by a computer to do tasks that are usually done by humans because they require human intelligence and discernment.
- The term is frequently applied to the project of developing systems endowed with the intellectual processes characteristic of humans, such as the ability to reason, discover meaning, generalize, or learn from past experience.
- AI algorithms are trained using large datasets so that they can identify patterns, make predictions and recommend actions, much like a human would, just faster and better.
About OpenAI
- OpenAI is an artificial intelligence research company.
- The company is best known for creating ‘ChatGPT’, which is an AI conversational chatbot.
- Users can ask questions on just about anything to ChatGPT and the chatbot will respond accurately with answers, stories and essays.
- It can even help programmers write software code.
Debate over ChatGPT’s Source for Data
- Software products like ChatGPT are based on what AI researchers call ‘Large Language Models’ (LLMs).
- LLMs require enormous amounts of information to train their systems.
- If chat bots or digital assistants need to be able to understand the questions that humans throw at them, they need to study human language patterns.
- Tech companies that work on LLMs like Google, Meta or OpenAI are secretive about what kind of training data they use.
- Tech companies use software called ‘crawlers’ to scan web pages, hoover up content and put it together in a dataset that can be used to train their LLMs.
- This is what news outlets took a stand against last week when The New York Times and others blocked a web crawler known as GPT bot.
- Through GPT bot, OpenAI used to scrape data.
- News outlets told OpenAI that the company can no longer use their published material and their journalism, to train their chat bots.
Reason Behind News Outlets’ Decision
- Search engines like Google or Bing also use web crawlers to index websites and present relevant results when users search for topics.
- However, these search engines represent a mutually beneficial relationship.
- Google, for instance, takes a snippet of a news article (a headline, a blurb and perhaps a couple of sentences) and reproduces them to make its search results useful.
- And while Google profits off of that content, it also directs a significant amount of user traffic to news websites.
- On the other hand, OpenAI provides no benefit, monetary or otherwise, to news companies.
- It simply collects publicly available data and uses it for the company’s own purposes.
Way Forward
- Lat month, OpenAI signed a licensing arrangement with The Associated Press, in a deal that would allow the company to use the news agency’s archival content as a training dataset.
- However, it remains to be seen if people refuse to accept payment and sue OpenAI for copyright infringement, the way a group of novelists did last year.
- The legal battles ahead will have interesting implications for journalism, intellectual property and the future of artificial intelligence.
Q1) What is Artificial Intelligence in simple words?
Artificial intelligence is the ability of machines to perform tasks that are typically associated with human intelligence, such as learning and problem-solving.
Q2) What is Machine Learning in simple words?
Machine learning is a branch of artificial intelligence (AI) and computer science which focuses on the use of data and algorithms to imitate the way that humans learn, gradually improving its accuracy.
Source:
Last updated on November, 2025
→ Check out the latest UPSC Syllabus 2026 here.
→ Join Vajiram & Ravi’s Interview Guidance Programme for expert help to crack your final UPSC stage.
→ UPSC Mains Result 2025 is now out.
→ UPSC Notification 2026 is scheduled to be released on January 14, 2026.
→ UPSC Calendar 2026 is released on 15th May, 2025.
→ The UPSC Vacancy 2025 were released 1129, out of which 979 were for UPSC CSE and remaining 150 are for UPSC IFoS.
→ UPSC Prelims 2026 will be conducted on 24th May, 2026 & UPSC Mains 2026 will be conducted on 21st August 2026.
→ The UPSC Selection Process is of 3 stages-Prelims, Mains and Interview.
→ UPSC Result 2024 is released with latest UPSC Marksheet 2024. Check Now!
→ UPSC Prelims Result 2025 is out now for the CSE held on 25 May 2025.
→ UPSC Toppers List 2024 is released now. Shakti Dubey is UPSC AIR 1 2024 Topper.
→ UPSC Prelims Question Paper 2025 and Unofficial Prelims Answer Key 2025 are available now.
→ UPSC Mains Question Paper 2025 is out for Essay, GS 1, 2, 3 & GS 4.
→ UPSC Mains Indian Language Question Paper 2025 is now out.
→ UPSC Mains Optional Question Paper 2025 is now out.
→ Also check Best IAS Coaching in Delhi


