Also, Xai’s Grok-2, Claude context caching, Imagen 3, Tree Attention, and more! · Follow What happened this week in AI by Louie This week, xAI joined the growing crowd of broadly GPT-4 class models, which now includes models from OpenAI, Anthropic, Deepmind, xAI, Meta, Mistral, and DeepSeek (but only the first 4 have multimodal capabilities). Anthropic also launched a context caching option saving up to 10x for reused input tokens costs. We recently flagged that context caching opens up many new opportunities, including for complex LLM agent pipelines, and on this note, this week, Sakana AI introduced “The AI Scientist,” an LLM agent for assisting machine learning research. Sakana’s agent begins by brainstorming new ideas using an initial topic and codebase (provided by a human researcher) and performs a […]
Original web page at pub.towardsai.net