
Stop treating AI generators like search engines and start treating them like camera sensors. Your photography knowledge—f-stops, focal lengths, and lighting ratios—is your biggest unfair advantage. This guide translates physical lens physics into AI tokens, teaching you how to force Midjourney and DALL-E to respect the laws of optics. The secret? Don't just name the setting; describe the visual consequence of that setting.
Listen up. You’ve spent years mastering the exposure triangle. You know that an 85mm lens flatters a face and a 16mm lens distorts it for drama. You know exactly why golden hour hits different than high noon.
Most people prompting AI are guessing. They type "cool photo" and hope for the best. You? You actually know how light works.
But here’s the friction: AI doesn't have a sensor. It doesn't have glass elements, a shutter curtain, or a physical aperture. It has a neural network trained on billions of image-text pairs. When you type "f/1.8," the AI doesn't mechanically open a lens; it looks for patterns in its training data labeled "f/1.8."
To master prompt engineering, you need to stop thinking like a technician and start thinking like a translator. You are translating the physics of optics into the logic of tokens. This deep dive will show you exactly how to do that.
When you press the shutter on your Sony A7IV, physics happens. Photons hit the sensor. When you hit "Enter" on a prompt, semantic association happens.
If you prompt Canon R5, 50mm lens, the AI doesn't simulate the optics of that specific lens. Instead, it retrieves the vibe associated with photos tagged with that gear. It pulls up high-resolution, sharp, generally commercial-looking textures.
The Golden Rule of AI Photography:
The model understands the effect better than the setting.
While f/11 might mean "deep depth of field" to you, the AI often ignores the number. But if you type deep depth of field, everything in focus, hyper-detailed background, the AI understands the visual intent. We need to merge these approaches: use the technical term to ground the aesthetic, and the descriptive term to enforce the physics.
Let’s break down the exposure triangle and translate it into language that Midjourney (MJ) and DALL-E actually respect.
The Mistake: Typing f/1.8 and expecting a perfect portrait blur.
The Fix: Combine the f-stop with the visual description of bokeh.
Shallow depth of field, subject isolation, creamy bokeh, blurred background, macro photography.Deep depth of field, edge-to-edge sharpness, hyper-detailed, everything in focus, architectural photography.
AI models are surprisingly good at mimicking focal length because the visual difference between 16mm and 200mm is drastic in their training data.
Wide angle lens, fisheye, GoPro footage, panoramic, distortion, dynamic perspective.35mm street photography, 50mm prime, human eye view, documentary style.Telephoto lens, 85mm portrait, background compression, zoom lens, intimate.
Since AI generates static images, shutter speed is purely about motion artifacts.
Frozen action, suspended in air, crisp details, high speed photography, stop motion.Motion blur, long exposure, light trails, ethereal, ghosting, smooth water.
This is where you win. Beginners write "good lighting." You write specific lighting setups.
haze, fog, or dust.off-center composition or negative space.
Instead of describing 20 adjectives, use a specific film stock or camera system. These are "macro" tokens that carry a massive amount of weight.
Shot on Sony A7R IV, Phase One XF, 100MP, ultra-sharp, unreal engine 5 (for texture).35mm film, medium format film, grain, light leaks.Midjourney is the "Art Director." It cares about style and aesthetics more than strict logic.
--style raw to reduce the AI's tendency to "beautify" everything. This makes photos look more like photos and less like digital art.--stylize (0-1000). Lower values (e.g., --s 50) stick closer to your prompt. Higher values (--s 750) let the AI get creative.--ar 3:2 (Standard 35mm) or --ar 16:9 (Cinematic).
DALL-E is the "Literalist." It listens to conversational instructions.
Don't expect the perfect shot in one go. Treat it like a photoshoot. You take a test shot, chimp the screen, adjust settings, and shoot again.
--no to remove unwanted elements. --no illustration, painting, drawing, cgi helps force photorealism.Photo of a dog, 35mm is better than a 200-word paragraph. Let the AI fill in the blanks, then steer it.
AI isn't replacing the photographer's eye; it's digitizing it. The technical barriers of cost and gear are gone. You no longer need a $50,000 Phase One camera to get the Phase One look. But you do need the knowledge of what that look is.
Your understanding of light, composition, and optics is the bridge between a generic generation and a masterpiece. Use the vocabulary of the lens. Force the machine to see the world the way you do.
Now, go shoot. (Or type).

Grab 10 of my Most used lightroom presets
+Get weekly updates on our
projects and client stories
ABOUT
HEY, I’M DREW I AM A DIGTAL CREATOR.
LEGAL
QUICK LINKS
SUBSCRIBE

Copyright drewdeltz 2025. All Rights Reserved.
AS SEEN ON
