Transformer Architecture Explained AshNeurotech · Follow 11 min read · 2 hours ago When thinking about the immense impact of transformers on artificial intelligence, I always refer back to the story of Fei-Fei Li and Andrej Karpathy. Andrej Karpathy, co-founder of OpenAI and Phd student under Fei-Fei Li, is one of the most recognizable names in AI research. For computer scientists, he’s the modern day version of Michael Jordan, albeit a bit nerdier. As part of his Phd in 2015, Andrej pioneered a novel computer vision algorithm that described a photo in human natural language. Fei-Fei suggested, “Very Cool, now do it backwards”, to which Andrej immediately responded “ Ha Ha that’s impossible”. Andrej Karpathy & Fei-Fei Li CVPR, 2015 Presentation Diagram As we now know, it is in […]
Original web page at medium.com