Section A covers Biology, with 16 questions carrying 30 marks. Section B covers Chemistry, with 13 questions carrying 25 marks.
AI safety tests found to rely on 'obvious' trigger words; with easy rephrasing, models labeled 'reasonably safe' suddenly fail, with attacks succeeding up to 98% of the time. New corporate research ...