Note on Mapping the Mind of a Large Language Model via anthropic.com
The features we found represent a small subset of all the concepts learned by the model during training, and finding a full set of features using our current techniques would be cost-prohibitive (the computation required by our current approach would vastly exceed the compute used to train the model in the first place).
Reference
| ← Previous | Next → |
| Functional Strength Training 🏋️ | Chicago 🚶 |