Question 1

How long does model training take?

Accepted Answer

Full pre-training of a large foundation model can take weeks to months on clusters of hundreds of GPUs and costs millions of pounds. Fine-tuning a personal LoRA model on a consumer GPU, by contrast, can take anywhere from twenty minutes to a few hours depending on dataset size and hardware.

Question 2

What data is used to train AI image and video models?

Accepted Answer

Most large image generation models have been trained on billions of image-text pairs scraped from the internet. Video models add temporal data: sequences of frames with associated captions or metadata. The specific composition of training data varies by model and is often not fully disclosed by developers.

Question 3

What is overfitting and why does it matter for fine-tuning?

Accepted Answer

Overfitting occurs when a model memorises its training data too closely and loses the ability to generalise. In fine-tuning for creative use, an overfitted model might reproduce your reference images too literally, losing flexibility in response to varied prompts. Controlling training steps and data diversity helps avoid this.

Question 4

Can I train my own AI model without a research background?

Accepted Answer

Yes: parameter-efficient fine-tuning methods like LoRA have been made accessible through tools with graphical interfaces and detailed community guides. Full pre-training from scratch remains the domain of well-resourced teams, but meaningful customisation is within reach for technically curious creators.

Question 5

What is the difference between training and fine-tuning?

Accepted Answer

Training (or pre-training) builds a model's capabilities from the ground up on a massive dataset. Fine-tuning takes an already trained model and continues training on a smaller, more specific dataset to specialise its behaviour: it is far cheaper and faster than training from scratch.

Question 6

How does training data affect bias in AI outputs?

Accepted Answer

A model reflects the patterns present in its training data. If the data over-represents certain demographics, aesthetics, or cultural viewpoints, the model will reproduce those biases in its outputs. This is a significant and ongoing challenge in AI development, particularly for models used in public-facing creative production.

Model Training

What is Model Training?

Direct scenes, design characters, and ship full films

Types and variations

Ready to make your first scene in Morphic?

Common use cases

Direct scenes, design characters, and ship full films

FAQs