Looking back at AI progress since the 2012 blog post “The state of Computer Vision and AI: we are really, really far away”
“What would it take for a computer to understand this image as you or I do? I challenge you to think explicitly of all the pieces of knowledge that have to fall in place for it to make sense.” [1]
Twelve years ago, on October 22, 2012, Andrej Karpathy published a blog post titled “The state of computer vision and AI: we are really, really far away” [1].
In his blog post, he used the image of former President Barack Obama jokingly putting his toe on the scale as a starting point for his take on the state of computer vision and artificial intelligence (AI) in 2012.
Karpathy argues that AI models need to have a lot of knowledge about our world in order to make inferences based on the values of pixels in an image, not only to understand what’s happening but also to understand the context of why it’s funny.
“It is mind-boggling that all of the above inferences unfold from a brief…