If you are using Chatgpt a lot, you’ve probably experimented with Dall-E 3, Openai’s latest iteration of an AI image generator. Dall-e 3 can be shockingly good at bringing your ideas to life, but sometimes shockingly bad at understanding certain details or just shocking in what it chooses to emphasize. Still, if you want to see a specific scene move from your head to your screen, Dall-E 3 is usually pretty helpful, it can even make your hands write.
But here’s the thing, Dall-E 3 is still an AI model, not a mind reader. If you want pictures that actually look like what you imagine, learn to speak its language.
After a certain trial and mistake (and a few unintended horrors), I have become pretty good at speaking his language. So here are five key tips that help you do the same.
Polish in HD
Dall-E 3 has a quality mode called ‘HD’ that makes images look sharper and more detailed. Think of it like adding an advanced camera lens to your AI-generated art textures look richer, lighting is more refined, and things like fabric, fur and reflections have more depth.
Check out what it looks like in this prompt: “A reproduction of a close -up of a butterfly that rests on a sunflower. Quality: HD.”
Play with aspects of aspects
Not every image should be a square. Dall-E 3 allows you to set the image ratio which may seem less, but can be huge if you want to make something look more cinematic or as a portrait.
Square is fine, but why limit yourself when you can make something epic? This is especially useful for graphics and posters on social media, just like the one that had prompt: “A vertical poster of a vintage travel ad for Paris, size 1024×1792 pixels.”
Think like a movie regime
To get a picture that is able to evoke emotions, it sometimes helps to think as if you are a photographer in the real world. Think of camera angle or composition techniques; Turn them up if necessary. The result can dramatically change what a picture looks like.
Instead of a flat, death-on-image, you can request angles such as close-up, bird’s eye view or over-shoulder. The same applies to composition styles and expressions such as ‘symmetrical composition’ or ‘depth of field.’
Here’s how to get the following image from this prompt: “A dramatic over-shot of a lonely cowboy standing on a rocky rock staring at the big desert landscape below. Sol Canyon.
Iterate, Iterate, Iterate
One of Dall-e 3’s lesser known but very effective tricks is to tell you what is not to be included. This helps avoid unwanted elements in your image. It can mean specifying negative elements such as colors, objects or styles you do not want, or refining the style and mood knows what you do not want it to feel.
This is how I got the picture below using the prompt: “A peaceful park in the fall with a young woman, no litter, just a quiet, quiet moment in nature.”
Be too specific
Think of Dall-E 3 as a very literal genie: It gives you exactly what you ask for, no more, no less. So if you write “a dog”, don’t be surprised when spitting a random dog of indefinite breed, vibe or moral adaptation. The more details you include – such as race, color, setting, mood or even art style – the better the results are.
As an example, you might start with: “A guide that throws a spell,” but you would feel better to submit: “An elderly wizard with a long, braided white beard, dressed in emerald-green clothes embroidered with gold Runes that spell out a swirling swirling of blue lightning from the fingertips of a stormy mountain landscape. “You can see both below.