Please create an account to participate in the Slashdot moderation system

 



Forgot your password?
typodupeerror
×
Software AI

'Make It Real' AI Prototype Turns Drawings Into Working Software (arstechnica.com) 50

An anonymous reader quotes a report from Ars Technica: On Wednesday, a collaborative whiteboard app maker called "tldraw" made waves online by releasing a prototype of a feature called "Make it Real" that lets users draw an image of software and bring it to life using AI. The feature uses OpenAI's GPT-4V API to visually interpret a vector drawing into functioning Tailwind CSS and JavaScript web code that can replicate user interfaces or even create simple implementations of games like Breakout. "I think I need to go lie down," posted designer Kevin Cannon at the start of a viral X thread that featured the creation of functioning sliders that rotate objects on screen, an interface for changing object colors, and a working game of tic-tac-toe. Soon, others followed with demonstrations of drawing a clone of Breakout, creating a working dial clock that ticks, drawing the snake game, making a Pong game, interpreting a visual state chart, and much more.

Tldraw, developed by Steve Ruiz in London, is an open source collaborative whiteboard tool. It offers a basic infinite canvas for drawing, text, and media without requiring a login. Launched in 2021, the project received $2.7 million in seed funding and is supported by GitHub sponsors. When The GPT-4V API launched recently, Ruiz integrated a design prototype called "draw-a-ui" created by Sawyer Hood to bring the AI-powered functionality into tldraw. GPT-4V is a version of OpenAI's large language model that can interpret visual images and use them as prompts. As AI expert Simon Willison explains on X, Make it Real works by "generating a base64 encoded PNG of the drawn components, then passing that to GPT-4 Vision" with a system prompt and instructions to turn the image into a file using Tailwind.
You can experiment with a live demo of Make It Real online. However, running it requires providing an API key from OpenAI, which is a security risk.
This discussion has been archived. No new comments can be posted.

'Make It Real' AI Prototype Turns Drawings Into Working Software

Comments Filter:
  • by parityshrimp ( 6342140 ) on Thursday November 16, 2023 @10:36PM (#64011211)

    The truth of the headline strongly depends on how "working" is defined, and I suppose who defines it...

    Honestly, "working" and "software" aren't normally associated in my mind. I can't think of anything that I use daily which is less reliable than some of the software I have to use.

    • by Narcocide ( 102829 ) on Thursday November 16, 2023 @10:55PM (#64011243) Homepage

      Don't worry. By tomorrow morning there will be a whole industry backing this piece of software and lauding it as the next replacement for basic competence.

      • It is even worse, I read about it yesterday on slashdot, which was literally linking to a literary magazine of dubious quality.

      • Don't worry. By tomorrow morning there will be a whole industry backing this piece of software and lauding it as the next replacement for basic competence.

        Of course there will! After all, it just takes a start block, leading to the magic block, which leads to the end profit block. I have no idea why someone didn’t think of this 50 years ago.

      • not because this tech works, but because your boss is now seriously thinking about automation and on the lookout for stuff that can be automated that previously he didn't think could be.

        The real change here is that a ton of shit old Mr pointy haired didn't think could be automated but *could* he now believes can be. Yes, we'll all have a good laugh at our boss calling a perl script "AI".... right before he fires us.

        And if you don't think you can be replaced by a perl script, well, go away or "I shal
    • > I can't think of anything that I use daily which is less reliable than some of the software I have to use.

      I always think of if Microsoft Windows was a car they would have been sued out of existence by all the accident victims. I mean, imagine if your car had an ignition switch where any key could turn it on, the steering wheel became inverted at arbitrary intervals, and the tires were made of paper. No person in their right mind would buy such a car. But somehow everyone accepted such a shoddy product

      • by LindleyF ( 9395567 ) on Friday November 17, 2023 @01:20AM (#64011429)
      • Why would anyone in their right mind buy an airplane that goes down when you tell it to go up?

      • I always think of if Microsoft Windows was a car they would have been sued out of existence by all the accident victims. I mean, imagine if your car had an ignition switch where any key could turn it on, the steering wheel became inverted at arbitrary intervals, and the tires were made of paper. No person in their right mind would buy such a car. But somehow everyone accepted such a shoddy product from a software company. With the added benefit that through no fault of your own your bank account could be depleted if someone stole your car.

        Yes, but look at it from the bright side. You could fix most problems with your car by just closing all the windows and restarting. Though, a BSOD on the highway would be literal...

        • Just wait until the "No reckless driving" code is in cars. Deaths by car accident will rise incredibly.
      • by nasch ( 598556 )

        But somehow everyone accepted such a shoddy product from a software company.

        1. Nobody dies when your PC does something inappropriate.
        2. What was the alternative? There's Mac, which is much more expensive, and Linux, which... well I guess the year of the Linux desktop is coming any day now.

    • This is a wonderful tech demo.

  • by Tablizer ( 95088 ) on Thursday November 16, 2023 @11:36PM (#64011311) Journal

    You "drew" by dragging and dropping widgets from the toolbox into the form designer. Press the "run" button and you have a form. Variations of the idea from other vendors did automatic data hookups also.

    It all fell out of favor because it didn't violate YAGNI enough like modern bloated crap does. Rocket surgery stacks are job security, so professional bloaters scare customers into wanting potential features using reverse YAGNI logic. "Trust me, your lovely little company may easily need web scale and internationalization soon because you are such wonderful shiny skilled managers!" Gag me.

    • Hell its basically Dreamweaver that idgits back in the late 90s used.
      • by Tablizer ( 95088 )

        Re Dreamweaver, I didn't like the product myself, but it usually got the job done quickly as long as you didn't want some screwy eye-candy. For intranet, it was usually good enough. It just couldn't keep up with eye-candy fads on the internet.

  • by Mr. Dollar Ton ( 5495648 ) on Thursday November 16, 2023 @11:46PM (#64011327)

    Ran the resulting "working software" on my computer, still cleaning up bits of PCB, cabling and what used to be a cool case.

    • PLEASE OH PLEASE MOD THIS UP.

      And I would like to join your rag-tag crew of product testers.

    • by EvilSS ( 557649 )
      I tried it with a Salvador Dalí. Does anyone know how to put dimensions back to their default state?
  • by thesjaakspoiler ( 4782965 ) on Friday November 17, 2023 @12:30AM (#64011371)

    I only see a text in the middle saying : need more funding to complete this game.

  • 2.7 million for this? How?

    I couldn't do it. I'd either start laughing at the absurdity of it all or confess from the guilt.

  • by Qbertino ( 265505 ) <moiraNO@SPAMmodparlor.com> on Friday November 17, 2023 @01:59AM (#64011477)

    Tailwind CSS is specifically designed for humans to push out clean implemented designs quickly. That's why its utility approach has CSS-Nazis foam at the mouth and go into a hissy fit whenever someone even mentions Tailwind. Discussions about utility CSS for humans aside (I personally do see their point), it makes no sense to have bots use this approach

    If you're going to have an AI implement the design you might as well generate the actual CSS right away without any framework or intermediary tooling. Obviously.

    • by kmoser ( 1469707 )
      Outputting Tailwind CSS makes the code easier for a human to read, understand, and modify. It also removes the need for the AI to worry about weird CSS implementation details--which is to say, the developers don't need to worry about that and can instead train the AI to work in the more abstract realm, which makes for a more powerful AI with less programming necessary.
      • ... about anything. And appreciate CSS classes with appropriate naming are way easier to understand than generic utility CSS microclasses and way more readable because they eliminate the significant bloat that comes with tools like Tailwind.

      • by olau ( 314197 )

        I don't think this AI is trained at all? It sounds like they just plug in ChatGPT.

    • by AmiMoJo ( 196126 )

      What is the quality of the produced code though?

      How maintainable by humans is it, or are you stuck having to ask the AI to make any changes?

      How robust is the software, is it full of SQL injection attacks or trivial code editing hacks?

  • UML ? (Score:5, Insightful)

    by dargaud ( 518470 ) <slashdot2@gdargau[ ]et ['d.n' in gap]> on Friday November 17, 2023 @02:09AM (#64011489) Homepage
    Didn't they try that already and call it UML ? Draw a bunch of boxes, which you can turn into code automatically. It may work for the simple things, provided you like complex drawings more than a clear line of specs. I took a course of it and personally loathed it. I couldn't help laughing out loud when one of the instructor showed us a 'real life example' that had thousands of boxes and weird arrows crossing all over the plane; like yeah, how does that help in the least ?!?
    • Aww, come on, using UML (or any other similar diagram spec) to represent the general architecture or a specific hairy piece of a program or algorithm visually can be quite useful. I prefer pictures to long-winded specifications when reasonable.

      It ain't a silver bullet, but it has its uses.

      And nobody forces you to buy the whole toolkit :)

    • And before UML there was Structured Analysis / Structured Design, and before SA/SD there was the humble flowchart. People have been trying to take programmers out of the programming loop for well over 50 years now. "Just sketch out your spec and it runs! So simple even a marketeer can do it!" Except that it always turns out that the only thing detailed enough to produce non-trivial working code is... code. Whether that code is assembly language or Rust, written on a keyboard or by dragging and dropping el

  • Took long enough but with this we can finally get to the software development that James martin and Michael Jackson where saying we would all be using back in the 1970, 1980, and 1990
    Time to bring back those CRUD tables and create a new version of knowledgeware.
    • Do programmers often underestimate time needed to code anything?

      • When I started my career in the mid 80s, my manager, a brilliant woman with several patents to her name, told me 'take your initial estimate and multiply by four.' Still true.
  • Has everyone forgotten VisualBasic Forms? Literally the same thing (draw UI, get application) with a little more work (you have to hook up buttons and things yourself).

    Sure used a LOT less energy.

    • I miss that time. It was much, much easier to come up with a decent interface and then just build the code needed for the buttons and forms to do something.

      I know you can do similar things with "web technologies" these days, but it's always been an ugly workaround around a protocol that was NOT designed for this (html) and where you only see the visual result of what you're doing when you run the page into a server.
      • I know you can do similar things with "web technologies" these days, but it's always been an ugly workaround around a protocol that was NOT designed for this (html) and where you only see the visual result of what you're doing when you run the page into a server.

        HTML/DOM is too inconsistent to get WYSIWYG. It's why it's impossible to build a PDF renderer in JavaScript + DOM unless you emulate at the pixel level, essentially reinventing an OS from scratch, an "abstraction inversion".

        And it's not true that W

  • This guy created a AI David Attenborough to narrate his life. The software snaps pics from his webcam and analyzes them, using an AI generated voice of David Attenborough to vocalize the results. It's both funny and, if you think about what the AI is doing, scary. Not the AI voice, that's been done to death, but the ability of the AI to pretty accurately figure out the context of what is going on with the subject and the environment around him. There is no way that doesn't get used for nefarious purposes by

1 + 1 = 3, for large values of 1.

Working...