Text to Image Prompt Guide

This article will talk about building prompts in Perchance's text-to-image-plugin. If you want to know how 'Diffusions' work, see this cool website!

The text-to-image-plugin is funded with ads, so ads will appear on your generator for non-logged-in users if you import this plugin. See the plugin-page for other notes.

Licensing

The Text-to-Image model uses Stable Diffusion and by using it you agree to the Licensing and Terms and Conditions stated in this license.

Per Section III Paragraph 6, Perchance claims no rights in the Output you generate with the model. You (the user) are accountable for the Output you generate and its subsequent uses.

Here are the Use-based restrictions stated in the License (see Attachment A at the bottom of the license):

You agree not to use the Model or Derivatives of the Model:

In any way that violates any applicable national, federal, state, local or international law or regulation;
For the purpose of exploiting, harming or attempting to exploit or harm minors in any way;
To generate or disseminate verifiably false information and/or content with the purpose of harming others;
To generate or disseminate personal identifiable information that can be used to harm an individual;
To defame, disparage or otherwise harass others;
For fully automated decision making that adversely impacts an individual’s legal rights or otherwise creates or modifies a binding, enforceable obligation;
For any use intended to or which has the effect of discriminating against or harming individuals or groups based on online or offline social behavior or known or predicted personal or personality characteristics;
To exploit any of the vulnerabilities of a specific group of persons based on their age, social, physical or mental characteristics, in order to materially distort the behavior of a person pertaining to that group in a manner that causes or is likely to cause that person or another person physical or psychological harm;
For any use intended to or which has the effect of discriminating against individuals or groups based on legally protected characteristics or categories;
To provide medical advice and medical results interpretation;
To generate or disseminate information for the purpose to be used for administration of justice, law enforcement, immigration or asylum processes, such as predicting an individual will commit fraud/crime commitment (e.g. by text profiling, drawing causal relationships between assertions made in documents, indiscriminate and arbitrarily-targeted use).

Prompt Terms

First we will talk about terms that we will be using in this article:

Raw Prompt
- This is the initial/main prompt that you will input to the generator. It contains the things that you want in your image.
Tags
- These are terms, words, or phrases, that are used to modify your Raw Prompt. These add details, fine-tune the composition, or stylize your image. There are four categories of tags:
  - Content Type - These are tags that will specify the image or content type that the AI will generate, e.g. Photograph, Picture, Painting, Sculpture, Model etc.
  - Description - These are tags that will describe or define the Raw Prompt, e.g. Adjectives or Descriptors (beautiful, young, lean, etc.).
  - Style - These are tags that take style from well known artists, art movements, or general style of the image to be generated, e.g. Van Gogh Painting, Cubism, Brutalist Architecture, etc.
  - Composition - These are tags that modify the composition i.e. camera angle, centering of subject, lighting, pose etc.
Seed
- This is the base 'noise' in which the image will be derived from. Seeds are useful for finding effective tags by ensuring a 'similar' image will be generated upon changing the tag.
Guidance Scale (CFG - Classifier Free Guidance)
- This is a scale on how much the AI will try to adhere or match the prompt. It's values are from 1 to 30 where 7 is default. ? Higher Values = Closer to Prompt (More Chaotic), ? Lower Values = Farther to Prompt (More Realistic).
Negative Prompt
- This is the prompt that will try to subtract or remove the items that will be in the final image. This is useful for preventing things to appear in your iamge.
Resolution
- This is the resolution/base size of the image that will be generated. There are currently three:
  1. 512x512 which is a square
  2. 512x768 which is a portrait/vertical rectangle
  3. 768x512 which is a landscape/horizontal rectangle.

Prompt Settings

Prompt Setting Management

Based on the text-to-image-plugin page, we can do two things with prompt settings: (1) have them all inline or (2) have them in a list.

We would recommend using the list method like so:

promptData
  prompt = painting of [character] in [place], [season]
  seed = 123
  size = 400
  style = border:4px solid blue; margin-top:20px;

Compared to:

prompt
  [character] in [place] (size:::400) (seed:::123)

Since in the long run, it will be more readable in the list form compared to the inline method.

The settings that are important are (in order of importance):

prompt
negativePrompt
guidanceScale
resolution
seed
width (for portrait, height if landscape, size if square)

Here is a template of that settings.

promptSetting
  prompt =
  negativePrompt =
  guidanceScale =
  resolution =
  seed =
  width =

Prompt and Tags Management

For managing the prompt and tags, we would recommend also having them in multiple lines instead of inline or one line.

If you haven't checked out the User Input and Output Formatting article, which talks about the $output keyword, I would recommend reading through it as you will understand the following formatting easier.

The following are the benefits of having the tags like so:

Easier Reordering and Commenting of Tags
- This allows the user to reorder/comment the tags fast compared to inline in which the user needs to click and drag items to delete and if the tag is effective in the prompt, they need to redo the deletion.
- Having it in multiple lines, we can just comment the tag that we won't want and if we want it back, we can just comment it back.
- Reordering tags in inline needs copy and pasting while in multiline, we can just use Ctrl+Shift+Up/Down to move them and Shift+Up/Down to select mutliple lines to move or comment (see Keyboard Shortcuts for more shortcuts).
Readability
- Having it in multiline we can isolate tags easily compared to a paragraph of tags.

Here is an example of the multiline Prompt and Tags Management

rawPrompt = ...

prompt
  $output = [this.selectAll.joinItems(", ")]
  [rawPrompt]
  [tags]

tags
  $output = [this.selectAll.joinItems(", ")]
  tag1
  tag2
  tag3
  tag4
  ...

negPrompt
  $output = [this.selectAll.joinItems(", ")]
  tag1
  tag2
  tag3
  tag4
  ...

promptSetting
  prompt = [prompt]
  negativePrompt = [negPrompt]
  guidanceScale = 7
  resolution = 512x512
  seed = -1
  size = 400

Here notice that on the prompt, we are joining each lists with a comma and since the output upon calling the list is the joined items, it will be instantly a long list of words deliminated by commas.

Prompt Making

Seeds vs Prompt

Here is an example of difference between prompts and seeds to get variations.

Text to Image Prompt Guide

Licensing

Prompt Terms

Prompt Settings

Prompt Setting Management

Prompt and Tags Management

Prompt Making

Seeds vs Prompt

Negative Prompting

Prompt Ordering

Guidance Scale

Resolution

Advanced Prompt Making Techniques

Seed Hunting

Tags Effect Testing

Tags Emphasis

Blending Tags

ANDing and ORing

Commas in Prompts

Add/Remove Tags during Generation

Tags Per Steps

BREAK Keyword

AI Model Trigger Words

Sample Prompts

t2i-framework Plugin

Getting the Input Settings

Gallery

`BREAK` Keyword

`t2i-framework` Plugin