NSFW Wan 1.3b T2V 是一个强大的文本转视频生成模型,拥有 13 亿参数,专门针对生成“不适合工作场所观看”(Not Safe For Work, NSFW)内容进行了微调。该模型在从大约 1,250 个专注于 NSFW 的子论坛(subreddit)中精选出的前 1,000 个帖子组成的广泛而多样的数据集上进行了训练。

Wan 2.1 的主要目标是提供一个研究和创意工具,能够基于成人内容领域的文本提示(prompt)生成连贯且主题相关的短视频片段。它旨在理解和渲染自然语言描述的各种 NSFW 场景、美学和动作。

根据文字提示生成成人视频内容 该模型基于一个包含大约 1,250 个不同的 NSFW 子版块中排名前 1,000 帖子的数据集训练。

Model Description

NSFW Wan 1.3b T2V is a powerful text-to-video generation model, with 1.3 billion parameters, specifically fine-tuned for generating Not Safe For Work (NSFW) content. This model has been trained on an extensive and diverse dataset curated from the top 1,000 posts across approximately 1,250 NSFW-focused subreddits.

The primary goal of Wan 2.1 is to provide a research and creative tool capable of generating coherent and thematically relevant short video clips based on text prompts within the adult content domain. It aims to understand and render a wide array of NSFW scenarios, aesthetics, and actions described in natural language.

<svg viewBox="0 0 256 256" preserveAspectRatio="xMidYMid meet" height="1em" width="1em" aria-hidden="true" xmlns="http://www.w3.org/2000/svg" class="text-gray-500 hover:text-black dark:hover:text-gray-200 w-4"></svg>

Model Details

  • Architecture: Wan 2.1 (Text-to-Video Transformer Architecture)
  • Parameters: 1.3 Billion
  • Type: Text-to-Video (T2V)
  • Specialization: NSFW Content Generation

<svg viewBox="0 0 256 256" preserveAspectRatio="xMidYMid meet" height="1em" width="1em" aria-hidden="true" xmlns="http://www.w3.org/2000/svg" class="text-gray-500 hover:text-black dark:hover:text-gray-200 w-4"></svg>

Note

Since the checkpoint was only fine-tuned on images, you may see some deterioration throughout the video. That is to be expected, and in my testing was easily resolved by applying a LoRA, which I would recommend doing at this time to get the desired motion, style, and video quality.

<svg viewBox="0 0 256 256" preserveAspectRatio="xMidYMid meet" height="1em" width="1em" aria-hidden="true" xmlns="http://www.w3.org/2000/svg" class="text-gray-500 hover:text-black dark:hover:text-gray-200 w-4"></svg>

Training Data

The model was trained on a dataset comprising the top 1,000 posts from approximately 1,250 distinct NSFW subreddits. This dataset was carefully curated to capture a broad spectrum of adult themes, visual styles, character archetypes, specific kinks, and actions prevalent in these online communities.

The captions associated with the training data leveraged the language and tagging conventions found within these subreddits. For insights into effective prompting strategies for specific styles or content, please refer to the prompting-guide.json file included in this repository.

Note: Due to the nature of the source material, the training dataset inherently contains explicit adult content.

<svg viewBox="0 0 256 256" preserveAspectRatio="xMidYMid meet" height="1em" width="1em" aria-hidden="true" xmlns="http://www.w3.org/2000/svg" class="text-gray-500 hover:text-black dark:hover:text-gray-200 w-4"></svg>

Training Procedure

  • Hardware: Trained on a cluster of 8x A100 GPUs.
  • Epochs: 10 epochs.
  • Duration: Approximately 3 days.
  • Checkpoints: Model weights are provided for each epoch (wan_1.3B_e1.safetensors through wan_1.3B_e10.safetensors). This allows users to select the checkpoint that best balances fidelity, generalization, and specific stylistic nuances for their needs. Early epochs might be more creative or varied, while later epochs may show higher fidelity to the training data.

<svg viewBox="0 0 256 256" preserveAspectRatio="xMidYMid meet" height="1em" width="1em" aria-hidden="true" xmlns="http://www.w3.org/2000/svg" class="text-gray-500 hover:text-black dark:hover:text-gray-200 w-4"></svg>

Files Included

  • wan_1.3B_e1.safetensors
  • wan_1.3B_e2.safetensors
  • wan_1.3B_e3.safetensors
  • wan_1.3B_e4.safetensors
  • wan_1.3B_e5.safetensors
  • wan_1.3B_e6.safetensors
  • wan_1.3B_e7.safetensors
  • wan_1.3B_e8.safetensors
  • wan_1.3B_e9.safetensors
  • wan_1.3B_e10.safetensors
  • prompting-guide.json: This crucial JSON file contains an analysis of common keywords, phrases, and descriptive language associated with the content from various source subreddits. It is designed to help users craft more effective prompts by understanding the vocabulary the model was trained on for different niches.

<svg viewBox="0 0 256 256" preserveAspectRatio="xMidYMid meet" height="1em" width="1em" aria-hidden="true" xmlns="http://www.w3.org/2000/svg" class="text-gray-500 hover:text-black dark:hover:text-gray-200 w-4"></svg>

How to Use

This model is intended for generating short video clips (typically a few seconds) from descriptive text prompts.

  1. Select an Epoch Checkpoint: Experiment with different wan_1.3B_e{i}.safetensors files. Later epochs might offer more refined results for common themes, while earlier ones could be explored for broader interpretations.
  2. Craft Your Prompt: Utilize natural language to describe the desired scene, subjects, actions, and style.
  3. Consult prompting-guide.json: For best results, especially when targeting specific sub-community styles or niche fetishes, refer to the prompting-guide.json. This guide will provide insights into the terminology and phrasing most likely to elicit the desired output based on the training data's captioning patterns.
  4. Generate: Use your preferred inference pipeline compatible with this model architecture.

<svg viewBox="0 0 256 256" preserveAspectRatio="xMidYMid meet" height="1em" width="1em" aria-hidden="true" xmlns="http://www.w3.org/2000/svg" class="text-gray-500 hover:text-black dark:hover:text-gray-200 w-4"></svg>

Ideal Base for LoRA Fine-Tuning

While Wan 2.1 1.3B T2V is a capable NSFW model on its own, its true strength for many users lies in its efficacy as a foundational base for training specialized LoRAs (Low-Rank Adaptations).

The extensive NSFW training provides a robust understanding of:

  • Core NSFW Anatomy: It already has a strong grasp of how to depict features like a penis, vagina, breasts, etc.
  • Common Sexual Acts: Concepts like blowjobs, masturbation, various sexual positions, and basic interactions are part of its foundational knowledge.
  • General NSFW Aesthetics: It understands common lighting, settings, and visual cues within adult content.

This means you don't need to teach your LoRA these fundamental NSFW building blocks from scratch. Instead, you can focus your LoRA training dataset exclusively on the specific niche concept, character, artistic style, unique action, specific motion, or specialized terminology you want to master. NSFW Wan will effectively "fill in the rest," leveraging its broad NSFW foundation to complement your targeted LoRA training. This can lead to more efficient LoRA training and better results for highly specific NSFW content generation.

<svg viewBox="0 0 256 256" preserveAspectRatio="xMidYMid meet" height="1em" width="1em" aria-hidden="true" xmlns="http://www.w3.org/2000/svg" class="text-gray-500 hover:text-black dark:hover:text-gray-200 w-4"></svg>

Community & Support

Join our Discord server!

Connect with other users, share your creations, get help with prompting, discuss model updates, and contribute to the community:

https://discord.gg/mjnStFuCYh

We encourage active participation and feedback to help improve future iterations and resources!

<svg viewBox="0 0 256 256" preserveAspectRatio="xMidYMid meet" height="1em" width="1em" aria-hidden="true" xmlns="http://www.w3.org/2000/svg" class="text-gray-500 hover:text-black dark:hover:text-gray-200 w-4"></svg>

Limitations and Bias

  • NSFW Focus: The model's knowledge is heavily biased towards the content prevalent in the NSFW subreddits it was trained on. It will likely perform poorly on SFW (Safe For Work) prompts or concepts far removed from its training data.
  • Specificity & Artifacts: While trained for detail, the model may still produce visual artifacts, anatomical inaccuracies, or fail to perfectly capture highly complex or nuanced prompts. Video generation is an evolving field.
  • Bias: The training data reflects the content, biases, preferences, and potentially problematic depictions present in the source NSFW communities. The model may generate content that perpetuates these biases.
  • Safety: This model does not have built-in safety filters to prevent the generation of potentially harmful or offensive interpretations of NSFW content, beyond the scope of its training data. Users are responsible for the ethical application of the model.
  • Temporal Coherence: While a T2V model, very long or complex actions might still exhibit some temporal inconsistencies.

<svg viewBox="0 0 256 256" preserveAspectRatio="xMidYMid meet" height="1em" width="1em" aria-hidden="true" xmlns="http://www.w3.org/2000/svg" class="text-gray-500 hover:text-black dark:hover:text-gray-200 w-4"></svg>

Ethical Considerations & Responsible AI

This model is intended for adult users (18+/21+ depending on local regulations) only.

  • Consent and Harm: This model generates fictional, synthetic media. It must not be used to create non-consensual depictions of real individuals, to impersonate, defame, harass, or generate content that could cause harm.
  • Legal Use: Users are solely responsible for ensuring that their use of this model and the content they generate complies with all applicable local, national, and international laws and regulations.
  • Distribution: Exercise extreme caution and responsibility if distributing content generated by this model. Be mindful of platform terms of service and legal restrictions regarding adult content.
  • No Endorsement: The creators of this model do not endorse or condone the creation or distribution of illegal, unethical, or harmful content.

We strongly recommend users familiarize themselves with responsible AI practices and the potential societal impacts of generative NSFW media.

<svg viewBox="0 0 256 256" preserveAspectRatio="xMidYMid meet" height="1em" width="1em" aria-hidden="true" xmlns="http://www.w3.org/2000/svg" class="text-gray-500 hover:text-black dark:hover:text-gray-200 w-4"></svg>

License

Steal this model!

<svg viewBox="0 0 256 256" preserveAspectRatio="xMidYMid meet" height="1em" width="1em" aria-hidden="true" xmlns="http://www.w3.org/2000/svg" class="text-gray-500 hover:text-black dark:hover:text-gray-200 w-4"></svg>

Disclaimer

The outputs of this model are entirely synthetic and computer-generated. They do not depict real people or events unless explicitly prompted to do so with user-provided data (which is not the intended use of this pre-trained model). The developers of this model are not responsible for the outputs created by users.


©️版权声明:若无特殊声明,本站所有文章版权均归AI工具集原创和所有,未经许可,任何个人、媒体、网站、团体不得转载、抄袭或以其他方式复制发表本站内容,或在非我站所属的服务器上建立镜像。否则,我站将依法保留追究相关法律责任的权利。

类似网站