AI视频制作之胖喵年检记

近期胖橘猫的视频火爆网络，紧跟热点新闻话题，胖橘猫扮演多种角色，如做手术的橘猫、少林橘猫、踩缝纫机的橘猫等等，形象可爱，情节紧跟时事，深得网友喜欢。这种视频是如何制作的呢？下面介绍一下如何制作一个胖橘猫视频。

2025 年 5 月 14 日

torch 张量

张量是一种特殊的数据结构，与数组和矩阵非常相似。在 PyTorch 中，我们使用张量来编码模型的输入和输出，以及模型的参数。

张量类似于 NumPy 的 ndarray，不同之处在于张量可以在 GPU 或其他硬件加速器上运行。实际上，张量和 NumPy 数组通常可以共享底层内存，从而无需复制数据（详见与 NumPy 的桥接）。张量还针对自动微分进行了优化（我们将在后面的 Autograd 部分详细介绍）。如果您熟悉 ndarrays，您会很快适应 Tensor API。如果不熟悉，请继续阅读！

import torch
import numpy as np

初始化张量

张量可以通过多种方式进行初始化。请看以下示例

直接从数据创建

张量可以直接从数据创建。数据类型会自动推断。

data = [[1, 2],[3, 4]]
x_data = torch.tensor(data)

从 NumPy 数组创建

张量可以从 NumPy 数组创建（反之亦然 – 详见与 NumPy 的桥接）。

np_array = np.array(data)
x_np = torch.from_numpy(np_array)

继续阅读

2025 年 5 月 12 日2025 年 5 月 12 日

Quickstart

This section runs through the API for common tasks in machine learning. Refer to the links in each section to dive deeper.

Working with data

PyTorch has two primitives to work with data: torch.utils.data.DataLoaderand torch.utils.data.Dataset. Dataset stores the samples and their corresponding labels, and DataLoader wraps an iterable around the Dataset.

import torch
from torch import nn
from torch.utils.data import DataLoader
from torchvision import datasets
from torchvision.transforms import ToTensor

PyTorch offers domain-specific libraries such as TorchText, TorchVision, andTorchAudio, all of which include datasets. For this tutorial, we will be using a TorchVision dataset.

The torchvision.datasets module contains Dataset objects for many real-world vision data like CIFAR, COCO (full list here). In this tutorial, we use the FashionMNIST dataset. Every TorchVision Dataset includes two arguments:transform and target_transform to modify the samples and labels respectively.

# Download training data from open datasets.
training_data = datasets.FashionMNIST(
    root="data",
    train=True,
    download=True,
    transform=ToTensor(),
)

# Download test data from open datasets.
test_data = datasets.FashionMNIST(
    root="data",
    train=False,
    download=True,
    transform=ToTensor(),
)
print(len(training_data))