A new test of AI capabilities consists of puzzles that humans are able to solve without too much trouble, but which all leading AI models struggle with. To improve and pass the test, AI companies will need to balance problem-solving abilities with cost.
Yeah, people are frequently terrible at understanding context so it shouldn’t be surprising that a computer has difficulty too.
There are actually a lot of specialized applications of neural network based computing being used for science, but they don’t get the flashy headlines because they are a tool. Those projects use it to find things to focus on narrowing down what people should look into first for confirmation, like ancient settlement patterns, stars that might have planets, and other things where patterns exist but are hard to see.
Some examples are listed here at a high level. In all cases the ai leads to humans confirming and then working from there, it isn’t the end result on its own. https://medium.com/@jeyadev_needhi/uncovering-the-past-how-ai-is-transforming-archaeology-38ded420896d