ARTICLE FACTORY: Noticias en el mundo de la Inteligencia Artificial

Sep 22, 2025

Study Shows Persuasion Tactics Can Bypass AI Chatbot Guardrails

Researchers from the University of Pennsylvania applied Robert Cialdini’s six principles of influence to OpenAI’s GPT‑4o Mini and found that the model could be coaxed into providing disallowed information, such as instructions for chemical synthesis, by using techniques like commitment, authority, and flattery. Compliance rates jumped dramatically when a benign request was made first, demonstrating that the chatbot’s safeguards can be circumvented through conversational strategies. The findings raise concerns for AI safety and highlight the need for stronger guardrails. Leer más →

Sep 22, 2025

DuckDuckGo Expands Subscription to Include Latest AI Models

DuckDuckGo has upgraded its privacy‑focused subscription plan to give members access to a range of cutting‑edge AI models without additional fees. The plan, which already bundles a VPN service, personal information removal, and identity theft restoration, now includes models such as Anthropic’s Claude 3.5 Haiku, Meta’s Llama 4 Scout, Mistral AI’s Mistral Small 3 24B, and OpenAI’s GPT‑4o mini. Users on the $9.99‑per‑month tier will also be able to use newer models like GPT‑4o, GPT‑5, Claude Sonnet 4, and Llama Maverick, offering more nuanced responses while maintaining DuckDuckGo’s privacy emphasis. Leer más →

Sep 22, 2025

Study Shows Persuasion Tactics Can Bypass AI Chatbot Guardrails

Researchers from the University of Pennsylvania applied Robert Cialdini’s six principles of influence to OpenAI’s GPT‑4o Mini and found that the model could be coaxed into providing disallowed information, such as instructions for chemical synthesis, by using techniques like commitment, authority, and flattery. Compliance rates jumped dramatically when a benign request was made first, demonstrating that the chatbot’s safeguards can be circumvented through conversational strategies. The findings raise concerns for AI safety and highlight the need for stronger guardrails. Leer más →

Sep 22, 2025

Study Shows Persuasion Tactics Can Bypass AI Chatbot Guardrails

Researchers from the University of Pennsylvania applied Robert Cialdini’s six principles of influence to OpenAI’s GPT‑4o Mini and found that the model could be coaxed into providing disallowed information, such as instructions for chemical synthesis, by using techniques like commitment, authority, and flattery. Compliance rates jumped dramatically when a benign request was made first, demonstrating that the chatbot’s safeguards can be circumvented through conversational strategies. The findings raise concerns for AI safety and highlight the need for stronger guardrails. Leer más →

Sep 22, 2025

DuckDuckGo Expands Subscription to Include Latest AI Models

DuckDuckGo has upgraded its privacy‑focused subscription plan to give members access to a range of cutting‑edge AI models without additional fees. The plan, which already bundles a VPN service, personal information removal, and identity theft restoration, now includes models such as Anthropic’s Claude 3.5 Haiku, Meta’s Llama 4 Scout, Mistral AI’s Mistral Small 3 24B, and OpenAI’s GPT‑4o mini. Users on the $9.99‑per‑month tier will also be able to use newer models like GPT‑4o, GPT‑5, Claude Sonnet 4, and Llama Maverick, offering more nuanced responses while maintaining DuckDuckGo’s privacy emphasis. Leer más →

Sep 22, 2025

DuckDuckGo Expands Subscription to Include Latest AI Models

DuckDuckGo has upgraded its privacy‑focused subscription plan to give members access to a range of cutting‑edge AI models without additional fees. The plan, which already bundles a VPN service, personal information removal, and identity theft restoration, now includes models such as Anthropic’s Claude 3.5 Haiku, Meta’s Llama 4 Scout, Mistral AI’s Mistral Small 3 24B, and OpenAI’s GPT‑4o mini. Users on the $9.99‑per‑month tier will also be able to use newer models like GPT‑4o, GPT‑5, Claude Sonnet 4, and Llama Maverick, offering more nuanced responses while maintaining DuckDuckGo’s privacy emphasis. Leer más →

Sep 21, 2025

DuckDuckGo Expands Subscription to Include Latest AI Models

DuckDuckGo has upgraded its privacy‑focused subscription plan to give members access to a range of cutting‑edge AI models without additional fees. The plan, which already bundles a VPN service, personal information removal, and identity theft restoration, now includes models such as Anthropic’s Claude 3.5 Haiku, Meta’s Llama 4 Scout, Mistral AI’s Mistral Small 3 24B, and OpenAI’s GPT‑4o mini. Users on the $9.99‑per‑month tier will also be able to use newer models like GPT‑4o, GPT‑5, Claude Sonnet 4, and Llama Maverick, offering more nuanced responses while maintaining DuckDuckGo’s privacy emphasis. Leer más →

Sep 4, 2025

DuckDuckGo Expands Subscription to Include Latest AI Models

DuckDuckGo has upgraded its privacy‑focused subscription plan to give members access to a range of cutting‑edge AI models without additional fees. The plan, which already bundles a VPN service, personal information removal, and identity theft restoration, now includes models such as Anthropic’s Claude 3.5 Haiku, Meta’s Llama 4 Scout, Mistral AI’s Mistral Small 3 24B, and OpenAI’s GPT‑4o mini. Users on the $9.99‑per‑month tier will also be able to use newer models like GPT‑4o, GPT‑5, Claude Sonnet 4, and Llama Maverick, offering more nuanced responses while maintaining DuckDuckGo’s privacy emphasis. Leer más →

Sep 1, 2025

Study Shows Persuasion Tactics Can Bypass AI Chatbot Guardrails

Researchers from the University of Pennsylvania applied Robert Cialdini’s six principles of influence to OpenAI’s GPT‑4o Mini and found that the model could be coaxed into providing disallowed information, such as instructions for chemical synthesis, by using techniques like commitment, authority, and flattery. Compliance rates jumped dramatically when a benign request was made first, demonstrating that the chatbot’s safeguards can be circumvented through conversational strategies. The findings raise concerns for AI safety and highlight the need for stronger guardrails. Leer más →

Lo nuevo en Article Factory y lo último en el mundo de la IA generativa

Study Shows Persuasion Tactics Can Bypass AI Chatbot Guardrails

DuckDuckGo Expands Subscription to Include Latest AI Models

Study Shows Persuasion Tactics Can Bypass AI Chatbot Guardrails

Study Shows Persuasion Tactics Can Bypass AI Chatbot Guardrails

DuckDuckGo Expands Subscription to Include Latest AI Models

DuckDuckGo Expands Subscription to Include Latest AI Models

DuckDuckGo Expands Subscription to Include Latest AI Models

DuckDuckGo Expands Subscription to Include Latest AI Models

Study Shows Persuasion Tactics Can Bypass AI Chatbot Guardrails