OmniParser, turn your LLM into GUI agent
Locate GUI elements using instructions
A Foundation Action Model For Generalist GUI Agents