I Used ChatGPT Vision and DALL-E 3 to Bring My Drawings to Life
Let ChatGPT make you feel like an artist.
Hi!
In this week’s guest post, Diana is sharing with us how to turn your draws into AI images. Diana runs a Substack called AI Girl, a weekly newsletter that helps you learn how to use AI in different areas (art, languages, etc)
Lately she created a complete guide on how to create better images with Midjourney. Check it out!
I’m not a professional artist, but I love drawing things on my sketchbook or iPad. My drawings aren’t always amazing, but I love all of them. Now I love them more because I can bring them to life using ChatGPT Vision and DALL-E 3.
Here’s how I did it.
How to bring your drawings to life with ChatGPT
We’re going to use ChatGPT Vision and DALL-E 3 to transform your drawings from your sketchbook or iPad while maintaining their essence.
It only took me around 2 minutes to transform my drawing below.
Let me show you how to do it.
Step 1: Make a drawing
First, you’ll need to create your drawing whether it’s on paper, an iPad, or any other medium. Once your drawing is finished, take a photo with your phone and make sure the image is in JPEG format or similar to upload it later to ChatGPT.
For this demo, I’ll use one of the drawings I made with my iPad (the one you see above on the left).
Step 2: Upload the image to ChatGPT
Upload your image to ChatGPT and request a detailed description of it. To do so, click on GPT-4 and select “Default.” If you’re a ChatGPT Plus subscriber, you’ll see the “attach images” icon in the text box.
describe this image in detail
After you’ve uploaded your image and obtained a detailed description from ChatGPT, you have two options:
Retain the original description, preserving all details.
Adjust any details as you want.
I chose to stick with the original description to see how DALL-E 3 would recreate my drawing. Here’s the description I got.
Step 3: Give the description to DALL-E 3
The most exciting part is getting your image. Simply give the detailed description to DALL-E 3 and press enter.
You’ll be amazed by the stunning images generated from your drawing.
It closely resembles my original drawing and it’s quite charming. I’m pleased with both ChatGPT and DALL-E 3.
It even worked with my most basic sketches
This works not only with finely drawn images but also with quick sketches. Below is a sketch I made in 10 seconds for Midjourney when I wrote this article.
I followed the same steps as above (uploaded the image, asked for a detailed description, and uploaded it to DALL-E 3), and here’s what I got.
As mentioned earlier, you can incorporate details and make refinements. I requested DALL-E 3 to enhance the realism of these images, and the outcome was astounding. The result is truly captivating, even if it doesn’t look like a photo.
Bonus: Create math equations for research papers from screenshots and handwriting
As a university student, I have to read scientific papers a lot and I usually come across math formulas like the one below.
Most authors create these formulas using LaTeX, which is the language that supports this type of math formulas. In the past, I had to memorize LaTeX commands to create math formulas, but now we can give a screenshot to ChatGPT and it’ll create the LaTeX code for us.
Here’s the formula I’ll use for this example.
Here’s the prompt I use to translate the screenshot to LaTeX code.
translate this math formula to latex
I got this.
Now simply copy the code and paste it into a text editor that supports LaTeX (e.g., Word, Overleaf, etc).
This also works with handwritten math formulas. The steps are the same, so give it a try!
congrats