Paige Didcote says her anxiety is "through the roof" living on the street
I completely ignored Anthropic’s advice and wrote a more elaborate test prompt based on a use case I’m familiar with and therefore can audit the agent’s code quality. In 2021, I wrote a script to scrape YouTube video metadata from videos on a given channel using YouTube’s Data API, but the API is poorly and counterintuitively documented and my Python scripts aren’t great. I subscribe to the SiIvagunner YouTube account which, as a part of the channel’s gimmick (musical swaps with different melodies than the ones expected), posts hundreds of videos per month with nondescript thumbnails and titles, making it nonobvious which videos are the best other than the view counts. The video metadata could be used to surface good videos I missed, so I had a fun idea to test Opus 4.5:
。同城约会是该领域的重要参考
The game is in Turkish, with English subtitles. It already feels arthouse; like those films Channel 4 used to show with a red triangle in the corner of the screen.
除了懂常识,强大的「主体一致性」是这次 Nano Banana 2 更新的另一大杀手锏。