This does not seem remotely true in my experience. This seems like more “Bad prompt generated entirely predictable garbage results” rather than “We rigorously tested different scenarios here, and AI seems to ‘ignore' new concepts”
Interestingly they did not share what prompts they used to run any of this – highly suspect.
This does not seem remotely true in my experience. This seems like more “Bad prompt generated entirely predictable garbage results” rather than “We rigorously tested different scenarios here, and AI seems to ‘ignore' new concepts”
Interestingly they did not share what prompts they used to run any of this – highly suspect.