Logs

2024-07-13

While I didn't have much success getting gpt-4o to perform Task 1 - Counting Line Intersection from the Vision Language Models Are Blind paper, I pulled down some code and did a bit of testing with Claude 3.5 Sonnet. The paper reports the following success rate for Sonnet for this line intersection...

Logs

2024-07-03

I was inspired by Daniel's post to add sidenotes to this blog. I used claude-3.5-sonnet to generate the CSS and HTML shortcode to do this. I was impressed how well it turned out[^1]. It was almost too easy. I'm not the most competent CSS writer and I had never written a Hugo shortcode before. In...