New comment by bckmn in 'Show HN: Tarsier – Vision utilities for web interaction agents'
In Reply To: thread
Reminds me of [Language as Intermediate Representation](https://chrisvoncsefalvay.com/posts/lair/) - LLMs are optimized for language, so translate an image into language and they'll do better at modeling it.
Reference
- Replies
- llm, programming-languages, machines, tools, hacker-news
-
Permalink to
2024.RPY.001
- Edit
← Previous | Next → |
New comment by bckmn in 'Free and open source software projects are in transition' | Reply to @caseyliss |