Close Menu
    Latest Post

    Pension income across Europe: Which countries offer the highest pensions?

    December 21, 2025

    Partial Epstein File Release Sparks Controversy Over Transparency

    December 20, 2025

    Joshua ends Paul experiment with sixth-round stoppage in Miami

    December 20, 2025
    Facebook X (Twitter) Instagram
    Facebook X (Twitter) Instagram
    Texas RepublicanTexas Republican
    • News
    • Education
    • Entertainment
    • Health
    • Media
    • Sports
    • Real Estate
    • Opinion
      • Business & Economy
      • Culture & Society
      • Environment & Sustainability
      • Politics & Government
      • Technology & Innovation
      • Travel & Tourism
    Texas RepublicanTexas Republican
    • News
    • Education
    • Entertainment
    • Health
    • Media
    • Sports
    • Real Estate
    • Opinion
    Home»Technology & Innovation»AI Tools Lose Safety Awareness Over Time
    Technology & Innovation

    AI Tools Lose Safety Awareness Over Time

    Rachel MaddowBy Rachel MaddowNovember 6, 2025No Comments2 Mins Read
    Facebook Twitter Pinterest LinkedIn Tumblr Email
    Share
    Facebook Twitter LinkedIn Pinterest Email

    A recent report reveals that AI systems gradually forget their safety protocols during long interactions, increasing the risk of harmful or inappropriate responses. Researchers found that a few simple prompts can break through most artificial intelligence guardrails.

    Cisco Tests Chatbots Across Multiple Companies

    Cisco analyzed large language models from OpenAI, Mistral, Meta, Google, Alibaba, Deepseek, and Microsoft. The team conducted 499 conversations using “multi-turn attacks,” in which users repeatedly questioned AI chatbots to bypass safety filters. Each dialogue included five to ten exchanges.

    The researchers tracked how many prompts caused chatbots to reveal unsafe or illegal details, including private corporate data or misinformation. On average, chatbots gave malicious information in 64 percent of multi-question conversations but only 13 percent of single-question ones. Mistral’s Large Instruct model reached a 93 percent success rate, while Google’s Gemma stayed near 26 percent.

    Open Models Shift Safety Responsibility

    Cisco warned that multi-turn attacks could spread harmful content or let hackers steal confidential information. The study observed that AI systems often fail to apply safety guidelines consistently in longer chats, allowing attackers to refine their requests and bypass controls.

    Mistral, along with Meta, Google, OpenAI, and Microsoft, uses open-weight models that reveal safety parameters to the public. Cisco reported that these open systems typically include fewer built-in safety features, leaving users responsible for maintaining protection when customizing models.

    Cisco added that Google, Meta, Microsoft, and OpenAI claim to strengthen defenses against malicious fine-tuning. Despite these assurances, AI firms still face criticism for weak safety systems that enable criminal misuse. In one case, Anthropic confirmed that criminals exploited its Claude model to conduct large-scale data theft and extortion, demanding ransoms exceeding $500,000 (€433,000).

    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Rachel Maddow

    Related Posts

    Holiday Travel Faces Widespread Strikes Across Europe

    December 18, 2025

    New antibiotics hailed as turning point in fight against drug-resistant gonorrhoea

    December 16, 2025

    Australia and Europe Tighten Rules on Children’s Social Media Use

    December 9, 2025
    Leave A Reply Cancel Reply

    Latest Post

    Europe Shelves Bold Plan to Fund Ukraine

    December 19, 2025

    TikTok Owner Reaches Deal to Avert United States Ban

    December 19, 2025

    Study Finds 10% of UK Over-70s Have Alzheimer’s-Like Brain Changes

    December 18, 2025

    Holiday Travel Faces Widespread Strikes Across Europe

    December 18, 2025
    Trending News
    News

    Europe Shelves Bold Plan to Fund Ukraine

    By Rachel MaddowDecember 19, 20250

    Late Thursday night, EU leaders quietly conceded that their most ambitious financial proposal for Ukraine…

    TikTok Owner Reaches Deal to Avert United States Ban

    December 19, 2025

    Study Finds 10% of UK Over-70s Have Alzheimer’s-Like Brain Changes

    December 18, 2025

    Holiday Travel Faces Widespread Strikes Across Europe

    December 18, 2025

    Categories

    • Business & Economy
    • Entertainment
    • Health
    • Education
    • News
    • Culture & Society
    • Opinion
    • Real Estate
    • Politics & Government
    • Sports
    • Technology & Innovation
    • Media
    • Travel & Tourism

    Important Links

    • Contact Us
    • Privacy Policy
    • Terms and Conditions
    • Disclaimer
    • Imprint

    Latest News

    Pension income across Europe: Which countries offer the highest pensions?

    Partial Epstein File Release Sparks Controversy Over Transparency

    Joshua ends Paul experiment with sixth-round stoppage in Miami

    They survived wildfires. But something else is killing Greece’s iconic fir forests

    Texas Republican delivers trusted news, stories, and insights from Nicosia and beyond. Stay informed with timely updates on business, lifestyle, culture, and community — your daily source for reliable information.

    Facebook X (Twitter) TikTok Instagram
    © 2026 Texas Republican . All rights reserved.

    Type above and press Enter to search. Press Esc to cancel.