VLMs aren't blind
I attempted to reproduce the results for one task from the VLMs are Blind paper. Specifically, Task 1: Counting line intersections. I ran 150 examples of lines generated by the code from the project with line thickness 4.
6 entries
I attempted to reproduce the results for one task from the VLMs are Blind paper. Specifically, Task 1: Counting line intersections. I ran 150 examples of lines generated by the code from the project with line thickness 4.
While I didn't have much success getting gpt-4o to perform Task 1 - Counting Line Intersection from the Vision Language Models Are Blind paper, I pulled down some code and did a bit of testing with Claude 3.5 Sonnet. The paper reports the following success rate for Sonnet for this line intersection...
I spent some time working with Claude Artifacts for the first time. I started with this prompt I want to see what you can do. Can you please create a 2d rendering of fluid moving around obstacles of different shapes?
I was inspired by Daniel's post to add sidenotes to this blog. I used claude-3.5-sonnet to generate the CSS and HTML shortcode to do this. I was impressed how well it turned out[^1]. It was almost too easy. I'm not the most competent CSS writer and I had never written a Hugo shortcode before. In...
I'm trying to avoid buying too much into the hype (maybe it's too late), but here are several folks talking about their notably impressive experiences with...
One of my favorite things to do with language models is to use them to write code. I've been wanting to build a variation on tic-tac-toe involving a bit of game theory. I called it "Tactic". I wasn't even really sure if the game would be any more interesting than tic-tac-toe itself, which reliably...