From the Desk of Josh Beckman (www.joshbeckman.org) home / replies / new-comment-by-bckmn-in-show-hn-tarsier-vision-utilities-for-web-interaction-agents New comment by bckmn in 'Show HN: Tarsier – Vision utilities for web interaction agents' In Reply To: thread View SourceReminds me of [Language as Intermediate Representation](https://chrisvoncsefalvay.com/posts/lair/) - LLMs are optimized for language, so translate an image into language and they'll do better at modeling it. Josh Beckman Reference llm programming-languages machines tools hacker-news 2024, May 15 Edit Comment/Reply on HackerNewsvia email Widgets Network Graph Legend Insight This widget generates “insights” about a post - you can read about how it works. Generating Keyboard Shortcuts Key Action o Source e Edit i Insight r Random h Home s or / Search Close www.joshbeckman.org/replies/hacker-news-item-40369904