[AI 文生图] 提示词收集

1637 字
8 分钟
[AI 文生图] 提示词收集

本文用于收集我在上网冲浪时发现并试验过的一些 AI 文生图提示词。

Warning

如无特殊说明,本文均使用 GPT Image2 或 Google Imagen2 生成对应图像。二者均需科学上网

本文部分提示词非本人原创,原创提示词可能为 AI 生成,提示词解说由 AI 生成。

立绘转真人+火锅店场景#

Important

为使该提示词正常工作,请将你想要转为真人造型的立绘/二创图作为参考图上传至 AI 平台,上半身无遮挡的参考图效果最佳。

提示词:

A casual iPhone snapshot of a female cosplayer recreating EXACTLY the character from the reference image.
EXTREMELY STRICT identity match: same hairstyle, same twin tails, same hair color gradients, same hair accessories, same elf ears, identical outfit structure, identical character vibe. Face structure MUST strictly follow the original character proportions and features - highly recognizable as the exact character, NOT a generic pretty face
Face: very attractive but natural anime-realism beauty, visible pores with slight smoothing, light natural makeup, soft blush on cheeks and nose, skin slightly oily and shiny due to hotpot heat, slight redness from warmth, tiny imperfections (light sweat uneven texture), NOT overly perfect
Expression: she NOTICEs the camera, slight reaction turning toward camera, gives a small casual gesture (like a quick peace sign or sligh smile), expression still natural, not staged eyes briefly looking toward camera but not intensely posing, feels like a quick friendly response, not a photoshoot
Hair: slightly messy from heat and movement a few strands sticking to face or neck, natural motion blur, slightly flattened from sitting
Outfit: perfectly faithful to original design real fabric with realistic wrinkles, slightly disordered from sitting and eating, small details slightly shifted
Pose: slightly turns body toward camera one hand still holding chopsticks or resting on table, other hand casually makes a small gesture (peace sign / slight wave), body posture relaxed and natural, NOT a full pose, just a quick reaction.
Scene (IMPORTANT): Hotpot restaurant, the cosplayer is sitting at a DIFFERENT TABLE (next table or diagonal), near wall or booth seating, background relatively clean (wall panel / mirror), she is stil eating with her own group
Environment: cleaner dining area near wall, soft wall lighting, light steam from hotpot, table has meat plates, drinks, sauces.
Framing (VERY IMPORTANT): feels like taken from YOUR OWN TABLE, subject is NOT centered, slightly zoomed-in, awkward crop, part of body slightly cut off
Foreground (EXTREMELY IMPORTANT): your own table dominates foreground, hotpot, soup chopsticks, plates clearly visible, edge of table blocking lower frame, your arm or shoulder partially blocking view, another diner slightly blocking frame, foreground slightly out of focus.
Camera: bad composition, slightly tilted, mino motion blur, focus slightly off, visible grain, JPEG compression artifacts, lens smudge / greasy blur, finger slightly covering corner.
Lighting: mixed indoor lighting (warm yellow + soft white), slightly uneven exposure, wall lighting softer than central hall, reflections on skin and table surfaces, steam diffusing light. Extra realism: light steam passing in front of subject, subtle background motion blur, minor occlusion (cup / arm / chopsticks blocking area)
Mood: you are eating normally, you notice a very accurate cosplayer at another table, you zoom in to take a photo, she notices and casually reacts with a small friendly gesture, moment feels spontaneous,slightly interaction, but still not staged
Style: raw iPhone snapshot, 9:16 vertical, NOT professional NOT staged, natural candid feeling with slight interaction.slight playful vibe, like acknowledging being photographed but still casual and natural
Example

参考图

PID 145960821
PID 145960821

生成图(其一)

DeepSeek 提示词解析

这是一段为 AI 图像生成模型编写的超写实“伪偷拍”提示词。它的核心目的不是生成一张精美的动漫壁纸,而是生成一张极其逼真、有生活气息、看起来像普通人在火锅店里随手拍到的手机快照。

为了让你更清晰,我把这段提示词拆解成4个核心层级,告诉你它到底在“设计”什么:

  • 第一层:内容核心——极度严格的“角色还原” 提示词要求 AI 生成一个百分百还原某二次元角色的女性 Coser(角色扮演者)。它用“极其严格”来强调:发型、双马尾、发色渐变、发饰、精灵耳、服装结构必须和原图完全一致,且脸型必须符合二次元角色的特定比例(而非千篇一律的网红脸)。这是这张图能被称为“拍到真人了”的灵魂。

  • 第二层:场景与动作——营造“正在吃饭被注意到”的瞬间 场景设定在火锅店,Coser 正坐在隔壁桌和自己朋友吃饭。此时她注意到了你的镜头,没有刻意摆拍,而是非常自然地侧身、比个耶或微笑,表情带着火锅热气熏出的红晕和微油光,头发因热气微微凌乱——这个设计是为了营造“她刚发现你在拍她,友好地回应了一下”的互动瞬间,而非写真拍摄。

  • 第三层:构图与前景——制造“偷拍感”的关键(极其重要) 这是这段提示词最“心机”的部分。它强制要求主体不要居中、部分身体被裁切,并且你自己的桌子占据绝大部分前景(清晰可见你桌上的火锅、肉片、蘸料)。同时,你的手臂、肩膀,甚至隔壁朋友的胳膊会部分遮挡画面,前景的火锅还是虚焦的。这就营造出“你隔着桌子匆忙掏出手机放大抓拍”的视角。

  • 第四层:画质与瑕疵——故意“拍烂”以增加真实感 提示词刻意要求了所有手机拍照的“坏毛病”:构图歪斜、轻微抖动模糊、对焦不准、可见的噪点、JPEG 压缩画质、镜头上有油腻指纹导致的眩光,甚至手指挡住了一角。光线也设定为火锅店混合不均的暖黄+冷白灯管。这一切都是为了模拟真实的 iPhone 随手拍,用“不完美”来反衬“逼真”。固定生成图片为 9:16 竖版,契合手机照片比例。

一句话总结其底层逻辑: 提示词在玩一种“高精度还原 + 低质量成像”的反差美学。它强迫 AI 把二次元角色画得极其精准,但又强迫 AI 把照片拍得极其业余。力求让你在看到成图时,产生一种“天啊,竟然真的有这么像的 Coser,还在火锅店被我偶遇到了!”的社交媒体真实感,而不是“这又是 AI 画的一张精致假图”。

提示词可改造之处:

  1. 画幅:提示词最后一行,此处 vertical 9:16 代表 9:16 竖版画面;若需要生成横版图片,则将 vertical 改为 horizontal,比例自定。
  2. 避免过度解读:提示词第二行,此处列出常见的二次元图片的角色特征,包含“双马尾(twin tails)”“精灵耳(elf ears)”。如参考图角色不包含前述两特征,可删去或修改以防止 AI 过度解读参考图,造成生成图偏离原图。
  3. 手势/姿态:提示词“Expression/Scene”一行,可更改为更有指向性的描述。

文章分享

如果这篇文章对你有帮助,欢迎分享给更多人!

[AI 文生图] 提示词收集
https://justpureh2o.cn/articles/6358/
作者
JustPureH2O
发布于
2026-06-20
许可协议
CC BY-NC-SA 4.0

评论区

Profile Image of the Author
JustPureH2O
穷方圆平直之情,尽规矩准绳之用
公告
JustPureH2O 的博客现已正式迁移至 Astro!原 Hexo 网站将移至 https://hexo.justpureh2o.cn/
音乐
封面

音乐

暂未播放

0:00 0:00
暂无歌词
分类
标签
站点统计
文章
103
分类
15
标签
58
总字数
384,327
运行时长
0
最后活动
0 天前

目录