半醉残影-Pytorch如何评估模型的复杂度

Pytorch如何评估模型的复杂度

那棵树看起来生气了

2024-03-15 11:40:50

0 点赞

642 阅读

2024-03-15

前言

FLOPS（Floating-Point Operations per Second）

每秒所执行的浮点运算次数，是计算设备的计算速度指标，主要是衡量硬件性能

GFLOPs（Giga-FLOPs）

每秒执行的十亿次浮点运算，主要是衡量硬件性能

FLOPs（Floating-Point Operations）

某个任务或算法中执行的总浮点运算次数，用于衡量计算复杂度或算法的计算量。
主要是衡量模型或算法复杂度

OPS

OPS：指的是每秒钟可以执行的整数运算次数，它代表着计算机在处理图像、音频等任务时的处理能力。TOPS的单位是万亿次每秒（trillion operations per second）。一般是指整数运算能力INT8

评估模型的复杂度

import time
import torch
from thop import profile, clever_format
from net.net import net
 
width = 3840
height = 2160

"""
NVIDIA RTX 3090
Number of parameters: 341.767K
Size of model: 1.30 MB
Computational complexity: 2.828T FLOPs
device: cuda - fps: 1304.787
"""

def compute_FLOPs_and_model_size(model):
    input = torch.randn(1, 3, width, height).cuda() 
    macs, params = profile(model, inputs=(input,), verbose=False)
    return macs, params
 
@torch.no_grad()
 
def compute_fps(model, shape, epoch=100, device=None):
    total_time = 0.0
 
    if not device:
        device = torch.device("cuda:0" if torch.cuda.is_available() else "cpu")
    model = model.to(device)
 
    for i in range(epoch):
        data = torch.randn(shape).cuda()
 
        start = time.time()
        outputs = model(data)
        torch.cuda.synchronize()
        end = time.time()
 
        total_time += (end - start)
 
    return epoch/total_time
 
 
def test_model_flops():
    model = net()     #这里使用你的模型
    model.cuda()
 
    FLOPs, params = compute_FLOPs_and_model_size(model)

    model_size = params * 4.0 / 1024 / 1024
    params_M = params/pow(10, 6)
    flops, params = clever_format([FLOPs, params], "%.3f")
 
    print('Number of parameters: {}'.format(params))
    print('Size of model: {:.2f} MB'.format(model_size))
    print('Computational complexity: {} FLOPs'.format(flops))
 
def test_fps():
    model = net()       #这里使用你的模型
    model.cuda()
 
    device = torch.device('cuda:0' if torch.cuda.is_available() else 'cpu')
    fps = compute_fps(model, (1, 3, width, height), device=device)
    print('device: {} - fps: {:.3f}'.format(device.type, fps))
 
 
if __name__ == '__main__':
    test_model_flops()
    test_fps()

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69

版权属于：

那棵树看起来生气了

本文链接：

https://dengyb.com/archives/64.html（转载时请注明本文出处及文章链接）

作品采用：

《署名-非商业性使用-相同方式共享 4.0 国际 (CC BY-NC-SA 4.0)》许可协议授权

Pytorch如何评估模型的复杂度

前言

FLOPS（Floating-Point Operations per Second）

GFLOPs（Giga-FLOPs）

FLOPs（Floating-Point Operations）

OPS

评估模型的复杂度

参考