New comment by bckmn in 'Show HN: Tarsier – Vision utilities for web interaction agents'

In Reply To: thread

Reminds me of [Language as Intermediate Representation](https://chrisvoncsefalvay.com/posts/lair/) - LLMs are optimized for language, so translate an image into language and they'll do better at modeling it.

Reference

Permalink (2024.RPY.001)
On 2024, May 15, Wednesday
In Replies
Tagged llm, programming-languages, machines, tools, hacker-news
Edit

← Previous	Next →
Chicago 🏃	Functional Strength Training 🏋️

Widgets

Network Graph

Legend

Key	Action
`o`	Source
`e`	Edit
`i`	Insight
`r`	Random
`h`	Home
`s` or `/`	Search

New comment by bckmn in 'Show HN: Tarsier – Vision utilities for web interaction agents'

In Reply To: thread

Reference

Comments & Replies

Widgets