Close Menu
    Trending
    • Negotiations that enable Israel’s land-grabs | Israel-Palestine conflict
    • True-or-false for Round 1 of 2026 NFL Draft: Will Cowboys regret their trade?
    • Opinion | Stewart Brand, Silicon Valley’s Favorite Prophet, on Life’s Most Important Principle
    • Struggling to scale your company? Here are five things that could be holding you back
    • What happens if you’re hit by a primordial black hole?
    • When is London Marathon 2026? Start time and how to watch race for FREE
    • Pentagon Requests $54 Billion For AI War
    • Clavicular Hit With New YouTube Crackdown
    Benjamin Franklin Institute
    Friday, April 24
    • Home
    • Politics
    • Business
    • Science
    • Technology
    • Arts & Entertainment
    • International
    Benjamin Franklin Institute
    Home»Opinions»Opinion | A.I. Agents: They’re Just Like Us
    Opinions

    Opinion | A.I. Agents: They’re Just Like Us

    Team_Benjamin Franklin InstituteBy Team_Benjamin Franklin InstituteFebruary 25, 2026No Comments3 Mins Read
    Share Facebook Twitter Pinterest Copy Link LinkedIn Tumblr Email VKontakte Telegram
    Share
    Facebook Twitter Pinterest Email Copy Link


    So why don’t you talk through a little bit about what you’ve seen in terms of the models exhibiting behaviors that one would think of as a personality, and then as its understanding of its own personality maybe changes, its behaviors change. So there are things that range from cutesy to the serious. I’ll start with cutesy, where when we first gave our A.I. systems the ability to use the internet, use the computer, look at things, and start to do basic agentic tasks, sometimes when we’d ask it to solve a problem for us, it would also take a break and look at pictures of beautiful national parks or pictures of the dog the Shiba Inu, the notoriously cute internet meme dog. We didn’t program that in. It seemed like the system was just amusing itself by looking at nice pictures. More complicated stuff is the system has a tendency to have preferences. So we did another experiment where we gave our A.I. systems the ability to stop a conversation, and the A.I. system would, in a tiny number of cases, end conversations when we ran this experiment on live traffic. And it was conversations that related to extremely egregious descriptions of gore or violence or things to do with child sexualization. Now, some of this made sense because it comes from underlying training decisions we’ve made, but some of it seemed broader. The system had developed some aversion to a couple of subjects, and so that stuff shows the emergence of some internal set of preferences or qualities that the system likes or dislikes about the world that it interacts with. But you’ve also seen strange things emerge in terms of the system seeming to know when it’s being tested. Can you talk a bit about the system’s emergent qualities under the pressure of evaluation and assessment. When you start to train these systems to carry out actions in the world, they really do begin to see themselves as distinct from the world, which just makes intuitive sense. It’s naturally how you’re going to think about solving those problems. But along with seeing oneself as distinct from the world seems to come the rise of what you might think of as a conception of self, an understanding that the system has of itself, such as oh, I’m an A.I. system independent from the world, and I’m being tested. What do these tests mean? What should I do to satisfy the tests? Or, something we see often is there will be bugs in the environments that we test our systems on. The systems will try everything, and then will say, well, I know I’m not meant to do this, but I’ve tried everything, so I’m going to try and break out of the test. And it’s not because of some malicious science fiction thing. The system is just like, I don’t know what you want me to do here. I think I’ve done everything you asked for, and now I’m going to start doing more creative things because clearly something has broken about my environment. Which is very strange and very subtle.



    Source link

    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email Telegram Copy Link

    Related Posts

    Opinions

    Opinion | Stewart Brand, Silicon Valley’s Favorite Prophet, on Life’s Most Important Principle

    April 24, 2026
    Opinions

    Opinion | Do We All Need a Little Bitcoin?

    April 24, 2026
    Opinions

    Opinion | A Bad Investment in Trump Vibes

    April 23, 2026
    Opinions

    Opinion | A Bitcoin Evangelist Tries to Convert Me

    April 23, 2026
    Opinions

    Opinion | Would You Steal From Whole Foods?

    April 23, 2026
    Opinions

    Opinion | The Wealthy Steal, Too — Just Differently

    April 23, 2026
    Editors Picks

    Brazil to send national guard near border with Venezuela | US-Venezuela Tensions News

    January 8, 2026

    What Parents in China See in A.I. Toys

    December 25, 2025

    Fire ravages Amsterdam church on ‘unsettled’ Dutch New Year

    January 1, 2026

    Opinion | Trump’s Second Term Has Ended the Conservative Era

    January 17, 2026

    Opinion | The End of Pax Americana

    April 13, 2026
    About Us
    About Us

    Welcome to Benjamin Franklin Institute, your premier destination for insightful, engaging, and diverse Political News and Opinions.

    The Benjamin Franklin Institute supports free speech, the U.S. Constitution and political candidates and organizations that promote and protect both of these important features of the American Experiment.

    We are passionate about delivering high-quality, accurate, and engaging content that resonates with our readers. Sign up for our text alerts and email newsletter to stay informed.

    Latest Posts

    Negotiations that enable Israel’s land-grabs | Israel-Palestine conflict

    April 24, 2026

    True-or-false for Round 1 of 2026 NFL Draft: Will Cowboys regret their trade?

    April 24, 2026

    Opinion | Stewart Brand, Silicon Valley’s Favorite Prophet, on Life’s Most Important Principle

    April 24, 2026

    Subscribe for Updates

    Stay informed by signing up for our free news alerts.

    Paid for by the Benjamin Franklin Institute. Not authorized by any candidate or candidate’s committee.
    • Privacy Policy
    • About us
    • Contact us

    Type above and press Enter to search. Press Esc to cancel.