New comment by bckmn in 'Show HN: Tarsier – Vision utilities for web interaction agents'
In Reply To: thread
Reminds me of [Language as Intermediate Representation](https://chrisvoncsefalvay.com/posts/lair/) - LLMs are optimized for language, so translate an image into language and they'll do better at modeling it.
Reference
-
Permalink (
2024.RPY.001) - On
- In Replies
- Tagged llm, programming-languages, machines, tools, hacker-news
- Edit
| ← Previous | Next → |
| Chicago 🏃 | Functional Strength Training 🏋️ |