Can someone help me make a better ChatGPT photo prompt?

I used ChatGPT to create a photo prompt, but the results keep coming out wrong and don’t match what I was trying to describe. I think my wording is too vague or missing important details. I need help figuring out how to write a clearer ChatGPT photo prompt so I can get more accurate image results.

Your prompt is likely missing structure.

Most bad image prompts fail in 3 spots:

  1. Subject is vague.
  2. Camera and lighting are missing.
  3. Style conflicts with scene details.

Use this template:

Subject, age, gender, pose, clothing, facial expression.
Setting, time of day, weather, background details.
Camera angle, lens, framing, depth of field.
Lighting type, color tone.
Style, realism level, image quality.
Negative prompt.

Example:
“35-year-old woman, short black hair, red raincoat, standing on a wet city street, looking left, serious expression. Nighttime, neon signs, light fog, reflections on pavement. Medium shot, eye-level angle, 50mm lens, shallow depth of field. Cinematic lighting, blue and magenta tones. Photorealistic, high detail, natural skin texture. Negative prompt: extra fingers, blurry face, distorted eyes, warped hands, text, watermark.”

If results still drift, cut fluff. Use fewer ideas. One scene. One mood. One style. If you want, post your original prompt and I’ll help rewrite it.

You probably don’t just need a “better prompt.” You need a better feedback loop.

I mostly agree with @cacadordeestrelas about structure, but I’d add this: a lot of image prompts fail because people describe what they mean emotionally, not what the model can actually render visually. “Powerful,” “dreamy,” “editorial,” “cool vibe” sounds useful to humans, but the model needs visible cues.

So instead of:
“a cool dramatic girl in a stylish city scene”

try:
“young woman with a sharp jawline, slicked-back dark hair, long black wool coat, standing under a glowing bus stop sign, wet sidewalk, light rain, chin slightly raised, unsmiling”

That’s more direct. Less poetry, more stuff the model can see.

Also, don’t cram everything into one mega-prompt. Sometimes shorter is better, yeah realy. I’d break it into 3 passes:

  1. Get the subject right
  2. Get the environment right
  3. Add style and polish

If the face is wrong, don’t keep tweaking lighting. If the pose is wrong, don’t fight with camera terms yet. Fix one problem at a time.

Another thing people miss: use reference priorities. Put the most important details first. Models often weight early prompt info more heavily than the tail end.

Try this format:

  • must-have subject traits
  • must-have setting traits
  • optional mood/style traits
  • things to avoid

And if you can, compare your bad result to your intended result in plain language:
“it keeps making her smiling”
“it keeps turning sunset into nighttime”
“it keeps making it look like anime”

That usually makes rewrites way easier than just saying “the prompt feels off.” If you post the exact prompt and what the image is getting wrong, people can probly fix it fast.

I’d push one point a little differently from @cacadordeestrelas: “more specific” is not always “better.” Sometimes the prompt fails because it contains conflicting specifics.

Example:

  • cinematic realism
  • soft pastel dream lighting
  • gritty urban documentary
  • luxury fashion editorial
  • natural candid pose

That’s five different visual directions fighting each other.

A cleaner fix is to decide your hierarchy:

  1. What absolutely cannot change
  2. What can flex
  3. What should be excluded

A good prompt usually has:

  • one subject
  • one setting
  • one lighting idea
  • one camera idea
  • one style target

If you stack three versions of each, the model averages them into mush.

Also, watch out for “label words” that mean different things to different models:

  • dramatic
  • aesthetic
  • moody
  • cinematic
  • realistic

These are not useless, but they’re unstable. Replace them with evidence:

  • low-key lighting
  • hard side light
  • desaturated colors
  • shallow depth of field
  • 85mm portrait framing

Another underrated trick: write your prompt like a casting brief, not a poem. You’re not persuading the model. You’re specifying the shot.

Bad:
“an unforgettable magnetic woman in a powerful modern scene”

Better:
“woman, late 20s, black tailored coat, direct eye contact, standing alone on a wet downtown sidewalk, blue neon reflected in pavement, night, medium shot, realistic skin texture”

Pros for ‘’: can improve readability if you use it to organize prompt versions and compare outputs.
Cons for ‘’: useless if you keep changing ten variables at once.

Post your exact prompt plus what is wrong in the output. That’s usually where the real fix happens.