Upload 11 files

Browse files

Files changed (11) hide show

README.md +40 -2
config.json +21 -0
merges.txt +0 -0
special_tokens_map.json +30 -0
text_encoder.bin +3 -0
text_encoder.xml +0 -0
tokenizer_config.json +30 -0
vae_decoder.bin +3 -0
vae_decoder.xml +0 -0
vae_encoder.bin +3 -0
vae_encoder.xml +0 -0

README.md CHANGED Viewed

@@ -1,6 +1,44 @@
 ---
-license: apache-2.0
 ---
 # Uses
 ## Direct Use
@@ -58,4 +96,4 @@ ability of the model to generate content with non-English prompts is significant
 ### Intel’s Human Rights Disclaimer:
-Intel is committed to respecting human rights and avoiding complicity in human rights abuses. See Intel's Global Human Rights Principles. Intel's products and software are intended only to be used in applications that do not cause or contribute to a violation of an internationally recognized human right.

 ---
+license: creativeml-openrail-m
+extra_gated_prompt: |-
+  This model is open access and available to all, with a CreativeML OpenRAIL-M license further specifying rights and usage.
+  The CreativeML OpenRAIL License specifies:
+  1. You can't use the model to deliberately produce nor share illegal or harmful outputs or content
+  2. Intel claims no rights on the outputs you generate, you are free to use them and are accountable for their use which must not go against the provisions set in the license
+  3. You may re-distribute the weights and use the model commercially and/or as a service. If you do, please be aware you have to include the same use restrictions as the ones in the license and share a copy of the CreativeML OpenRAIL-M to all your users (please read the license entirely and carefully)
+  Please read the full license carefully here: https://huggingface.co/spaces/CompVis/stable-diffusion-license
+extra_gated_heading: Please read the LICENSE to access this model
 ---
+# SD v1-5 square Model Card
+The original source of this model is : [runwayml/stable-diffusion-v1-5](https://huggingface.co/runwayml/stable-diffusion-v1-5).
+This model is just optimized and converted to Intermediate Representation (IR) using OpenVino's Model Optimizer and POT tool to run on Intel's Hardware - CPU, GPU, NPU.
+We have FP16 and INT8 versions of the model. Please note currently only unet model is quantized to int8.
+Intended to be used with:
+- GIMP plugin [openvino-ai-plugins-gimp](https://github.com/intel/openvino-ai-plugins-gimp.git)
+- Blender Addon [dream-textures-openvino](https://github.com/intel/dream-textures-openvino)
+## Original Model Details
+- **Original Developed by:** Robin Rombach, Patrick Esser
+- **Model type:** Diffusion-based text-to-image generation model
+- **Language(s):** English
+- **License:** [The CreativeML OpenRAIL M license](https://huggingface.co/spaces/CompVis/stable-diffusion-license) is an [Open RAIL M license](https://www.licenses.ai/blog/2022/8/18/naming-convention-of-responsible-ai-licenses), adapted from the work that [BigScience](https://bigscience.huggingface.co/) and [the RAIL Initiative](https://www.licenses.ai/) are jointly carrying in the area of responsible AI licensing. See also [the article about the BLOOM Open RAIL license](https://bigscience.huggingface.co/blog/the-bigscience-rail-license) on which our license is based.
+- **Model Description:** This is a model that can be used to generate and modify images based on text prompts. It is a [Latent Diffusion Model](https://arxiv.org/abs/2112.10752) that uses a fixed, pretrained text encoder ([CLIP ViT-L/14](https://arxiv.org/abs/2103.00020)) as suggested in the [Imagen paper](https://arxiv.org/abs/2205.11487).
+- **Resources for more information:** [GitHub Repository](https://github.com/CompVis/stable-diffusion), [Paper](https://arxiv.org/abs/2112.10752).
+- **Cite as:**
+      @InProceedings{Rombach_2022_CVPR,
+          author    = {Rombach, Robin and Blattmann, Andreas and Lorenz, Dominik and Esser, Patrick and Ommer, Bj\"orn},
+          title     = {High-Resolution Image Synthesis With Latent Diffusion Models},
+          booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
+          month     = {June},
+          year      = {2022},
+          pages     = {10684-10695}
+      }
 # Uses
 ## Direct Use
 ### Intel’s Human Rights Disclaimer:
+Intel is committed to respecting human rights and avoiding complicity in human rights abuses. See Intel's Global Human Rights Principles. Intel's products and software are intended only to be used in applications that do not cause or contribute to a violation of an internationally recognized human right.

config.json ADDED Viewed

	@@ -0,0 +1,21 @@

+{
+    "power modes supported": "yes",
+    "best performance": [
+        "GPU",
+        "GPU",
+        "GPU",
+        "GPU"
+    ],
+    "balanced": [
+        "NPU",
+        "NPU",
+        "GPU",
+        "GPU"
+    ],
+    "best power efficiency": [
+        "NPU",
+        "NPU",
+        "NPU",
+        "GPU"
+    ]
+}

merges.txt ADDED Viewed

The diff for this file is too large to render. See raw diff

special_tokens_map.json ADDED Viewed

	@@ -0,0 +1,30 @@

+{
+  "bos_token": {
+    "content": "<|startoftext|>",
+    "lstrip": false,
+    "normalized": true,
+    "rstrip": false,
+    "single_word": false
+  },
+  "eos_token": {
+    "content": "<|endoftext|>",
+    "lstrip": false,
+    "normalized": false,
+    "rstrip": false,
+    "single_word": false
+  },
+  "pad_token": {
+    "content": "<|endoftext|>",
+    "lstrip": false,
+    "normalized": false,
+    "rstrip": false,
+    "single_word": false
+  },
+  "unk_token": {
+    "content": "<|endoftext|>",
+    "lstrip": false,
+    "normalized": false,
+    "rstrip": false,
+    "single_word": false
+  }
+}

text_encoder.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:f245795911e496f451a01e1a5de70578753c985ed76ffa302966d2e716d2b8d0
+size 246133100

text_encoder.xml ADDED Viewed

The diff for this file is too large to render. See raw diff

tokenizer_config.json ADDED Viewed

	@@ -0,0 +1,30 @@

+{
+  "add_prefix_space": false,
+  "added_tokens_decoder": {
+    "49406": {
+      "content": "<|startoftext|>",
+      "lstrip": false,
+      "normalized": true,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "49407": {
+      "content": "<|endoftext|>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    }
+  },
+  "bos_token": "<|startoftext|>",
+  "clean_up_tokenization_spaces": true,
+  "do_lower_case": true,
+  "eos_token": "<|endoftext|>",
+  "errors": "replace",
+  "model_max_length": 77,
+  "pad_token": "<|endoftext|>",
+  "tokenizer_class": "CLIPTokenizer",
+  "unk_token": "<|endoftext|>"
+}

vae_decoder.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:907ab331d35ce61c6732a0802372d1f809e1dea27184c7297277d743cf6ca206
+size 98980680

vae_decoder.xml ADDED Viewed

The diff for this file is too large to render. See raw diff

vae_encoder.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:8170c14a8904c820a2fe3745e8a292118cd15e4805c448c18959524fa697d257
+size 68327548

vae_encoder.xml ADDED Viewed

The diff for this file is too large to render. See raw diff