第一章数组 (Array)#

一句话理解：数组，就是一段连续内存上排列的同类型元素。它是计算机硬件最偏爱的数据结构——没有之一。

1.1 概念直觉 —— What & Why#

什么是数组？#

数组是最古老、最基础的数据结构。它的定义极简：

连续的内存空间
相同类型的元素
通过下标（index）随机访问

就这么三条。但正因为简单，它和 CPU 的硬件特性配合得天衣无缝。

为什么数组快？—— Cache Line 的秘密#

现代 CPU 不会一次只读一个字节。当你访问内存地址 0x1000 时，CPU 会把从 0x1000 开始的一整个 Cache Line（通常 64 字节）一次性拉进 L1 缓存。

1
┌─────────────────────────────────────────────────────────┐
2
│                  CPU Cache Line (64 bytes)              │
3
│  arr[0]  arr[1]  arr[2]  arr[3]  ...  arr[15]           │
4
│   4B      4B      4B      4B           4B               │
5
└─────────────────────────────────────────────────────────┘

当你遍历一个 int 数组时，访问 arr[0] 的同时，arr[1] 到 arr[15] 已经被预取到缓存了。接下来 15 次访问全部命中 L1 缓存——这就是 Spatial Locality（空间局部性）。

链表就享受不到这个待遇。链表的每个节点可能散布在内存的任意位置，每次访问下一个节点都可能引发 cache miss。这就是为什么：

💡 在数据量较小、遍历频繁的场景下，数组几乎总是比链表快——即使理论复杂度相同。

数组 vs 其他结构的定位#

需求	数组	链表	哈希表
随机访问	✅ O(1)	❌ O(n)	✅ O(1) 均摊
尾部插入	✅ O(1) 均摊	✅ O(1)	✅ O(1) 均摊
中间插入	❌ O(n)	✅ O(1)	N/A
内存连续	✅	❌	❌
缓存友好	✅✅✅	❌	❌

1.2 结构图解#

静态数组的内存布局#

block-beta columns 8 block:header:8 columns 1 h["std::array<int, 8> — 栈上连续内存"] end a0["arr[0]\n10"] a1["arr[1]\n20"] a2["arr[2]\n30"] a3["arr[3]\n40"] a4["arr[4]\n50"] a5["arr[5]\n60"] a6["arr[6]\n70"] a7["arr[7]\n80"]

地址计算公式：addr(arr[i]) = base_addr + i * sizeof(element)
这就是 O(1) 随机访问的本质——一次乘法 + 一次加法。

`std::vector` 的内部结构#

std::vector 是 C++ 中最常用的动态数组。它在堆上管理一段连续内存，用三个指针维护状态：

graph LR subgraph "std::vector 对象 (栈上, 24 bytes)" begin["_begin\n(指向首元素)"] end_ptr["_end\n(指向最后一个元素的下一位)"] cap["_capacity\n(指向分配空间的末尾)"] end subgraph "堆上连续内存" e0["10"] --- e1["20"] --- e2["30"] --- e3["40"] --- e4["?"] --- e5["?"] end begin --> e0 end_ptr --> e4 cap --> e5 style e4 fill:#555,stroke:#888,color:#aaa style e5 fill:#555,stroke:#888,color:#aaa

1
size()     = _end - _begin      // 当前元素个数 = 4
2
capacity() = _capacity - _begin // 已分配容量 = 6

多维数组的内存布局#

C/C++ 的多维数组采用 Row-Major（行优先） 布局：

block-beta columns 6 block:title:6 columns 1 t["int matrix[2][3] — Row-Major Layout"] end m00["[0][0]"] m01["[0][1]"] m02["[0][2]"] m10["[1][0]"] m11["[1][1]"] m12["[1][2]"] block:row0:3 columns 1 r0["← Row 0 →"] end block:row1:3 columns 1 r1["← Row 1 →"] end

💡 面试考点：遍历二维数组时，按行遍历（for i → for j）远比按列遍历快，因为行内元素在内存中连续，缓存命中率高。这是一道经典的性能优化面试题。

1.3 C++ 底层实现#

1.3.1 `std::array` —— 编译期固定大小#

std::array 是 C++11 引入的对原始 C 数组的封装。它在栈上分配，大小编译期确定：

1
#include <array>
2

3
// 底层本质上就是一个结构体包了一个 C 数组
4
// 简化版实现：
5
template <typename T, std::size_t N>
6
struct array {
7
    T _data[N];  // 就这么简单
8

9
    // 接口包装
10
    constexpr T& operator[](std::size_t i) { return _data[i]; }
11
    constexpr std::size_t size() const noexcept { return N; }
12
    constexpr T* data() noexcept { return _data; }
13

14
    // 迭代器
15
    constexpr T* begin() noexcept { return _data; }
16
    constexpr T* end() noexcept { return _data + N; }
17
};

为什么用 std::array 而不用 C 数组？

1
// C 数组的问题
2
int arr[5] = {1, 2, 3, 4, 5};
3
// 传递给函数时退化成指针，丢失大小信息：
4
void foo(int arr[]) {  // 实际上是 int*，sizeof(arr) == 8 ❌
5
    // 无法知道数组长度！
6
}
7

8
// std::array 不退化
9
std::array<int, 5> arr = {1, 2, 3, 4, 5};
10
void foo(const std::array<int, 5>& arr) {
11
    arr.size();  // ✅ 始终知道长度是 5
12
}

何时用 std::array？

大小编译期已知且不会变
需要栈分配（高性能、无堆开销）
需要 STL 接口（begin/end、size）

1.3.2 `std::vector` —— 动态数组，C++ 的瑞士军刀#

std::vector 是 C++ 最常用的容器，没有之一。面试中 90% 的数组题都用它。

核心源码解析（精简版）#

1
template <typename T, typename Allocator = std::allocator<T>>
2
class vector {
3
    T* _begin;      // 指向第一个元素
4
    T* _end;        // 指向最后一个元素的下一位（past-the-end）
5
    T* _capacity;   // 指向分配内存块的末尾
6

7
public:
8
    // ========== 容量查询 ==========
9
    size_t size() const     { return _end - _begin; }
10
    size_t capacity() const { return _capacity - _begin; }
11
    bool empty() const      { return _begin == _end; }
12

13
    // ========== 元素访问 ==========
14
    T& operator[](size_t i)       { return _begin[i]; }        // 不做越界检查！
15
    T& at(size_t i) {                                           // 做越界检查
16
        if (i >= size()) throw std::out_of_range("vector::at");
17
        return _begin[i];
18
    }
19
    T& front() { return *_begin; }
20
    T& back()  { return *(_end - 1); }
21
    T* data()  { return _begin; }       // 获取底层裸指针
22

23
    // ========== 尾部操作 ==========
24
    void push_back(const T& value) {
25
        if (_end == _capacity) {
26
            // 容量不足，需要扩容！
27
            _grow();
28
        }
29
        *_end = value;      // 在 _end 位置构造元素
30
        ++_end;             // 移动 _end 指针
31
    }
32

33
    void pop_back() {
34
        --_end;
35
        _end->~T();          // 析构最后一个元素
36
    }
37

38
    // ========== 扩容核心 ==========
39
    void _grow() {
40
        size_t old_cap = capacity();
41
        size_t new_cap = old_cap == 0 ? 1 : old_cap * 2;  // GCC: 2倍扩容
42
        // MSVC: old_cap + old_cap / 2 (1.5倍扩容)
43

44
        T* new_buf = allocator.allocate(new_cap);          // 1. 分配新内存
45
        std::uninitialized_move(_begin, _end, new_buf);    // 2. 移动旧元素
46
        allocator.deallocate(_begin, old_cap);              // 3. 释放旧内存
47

48
        _end = new_buf + size();
49
        _begin = new_buf;
50
        _capacity = new_buf + new_cap;
51
    }
52
};

扩容策略深入#

编译器	扩容因子	示例 (初始 cap=1)
GCC (libstdc++)	2x	1 → 2 → 4 → 8 → 16 → 32
MSVC (STL)	1.5x	1 → 2 → 3 → 4 → 6 → 9 → 13
Clang (libc++)	2x	同 GCC

为什么 MSVC 用 1.5x？

2x 扩容有个问题：每次扩容后释放的旧内存块之和 (1 + 2 + 4 + … + n/2 = n - 1) 永远小于新分配的块 (n)，所以旧内存永远不够重用。

1.5x 扩容在几轮之后，旧内存块的总和会超过新块大小，内存分配器有机会重用旧内存，减少内存碎片。

💡 面试中的表述：“vector 扩容时会分配新的更大的内存块，将元素搬过去，释放旧空间。时间复杂度均摊 O(1)，但单次扩容是 O(n)。如果能预估大小，应该提前 reserve 避免多次扩容。“

`reserve` vs `resize` vs `shrink_to_fit`#

1
std::vector<int> v;
2

3
// reserve：只分配内存，不创建元素
4
v.reserve(1000);      // capacity = 1000, size = 0
5
                       // 适合：你知道大概要放多少元素
6

7
// resize：改变 size，可能创建/销毁元素
8
v.resize(100);         // capacity = 1000, size = 100
9
                       // 新元素被值初始化（int → 0）
10

11
v.resize(50);          // capacity = 1000, size = 50
12
                       // 后 50 个元素被析构
13

14
// shrink_to_fit：尝试释放多余内存（非强制）
15
v.shrink_to_fit();     // capacity ≈ 50, size = 50 (实现可以忽略这个请求)

迭代器失效 —— 面试重灾区#

什么操作会导致迭代器失效？

操作	是否失效	原因
`push_back` (未扩容)	只有 `end()` 失效	end 后移
`push_back` (触发扩容)	全部失效	内存地址变了！
`insert`	全部失效	可能扩容 + 元素搬移
`erase`	被删元素及之后的失效	后面的元素前移
`clear`	全部失效	所有元素被销毁
`reserve` (扩容时)	全部失效	内存地址变了
`operator[]` / `at`	不失效	只读/写元素

1
// ❌ 经典错误：遍历中删除元素
2
std::vector<int> v = {1, 2, 3, 4, 5};
3
for (auto it = v.begin(); it != v.end(); ++it) {
4
    if (*it % 2 == 0) {
5
        v.erase(it);  // it 失效了！下一次 ++it 是未定义行为
6
    }
7
}
8

9
// ✅ 正确做法 1：erase 返回下一个有效迭代器
10
for (auto it = v.begin(); it != v.end(); ) {
11
    if (*it % 2 == 0) {
12
        it = v.erase(it);  // erase 返回下一个有效迭代器
13
    } else {
14
        ++it;
15
    }
16
}
17

18
// ✅ 正确做法 2：erase-remove idiom (C++17 更推荐 std::erase_if)
19
v.erase(std::remove_if(v.begin(), v.end(),
20
    [](int x) { return x % 2 == 0; }), v.end());
21

22
// ✅ 正确做法 3：C++20 std::erase_if
23
std::erase_if(v, [](int x) { return x % 2 == 0; });

1.3.3 Ring Buffer（环形缓冲区）#

环形缓冲区是一种逻辑上首尾相连的数组。它用两个指针 head 和 tail 来标记读写位置，写满时覆盖最旧的数据（或阻塞）。

graph TD subgraph "Ring Buffer (capacity = 8)" direction LR s0["[0] "] --- s1["[1] "] --- s2["[2]\nhead→"] --- s3["[3]\ndata"] --- s4["[4]\ndata"] --- s5["[5]\n←tail"] --- s6["[6] "] --- s7["[7] "] end style s2 fill:#2d6a4f,stroke:#40916c,color:white style s3 fill:#1b4332,stroke:#2d6a4f,color:white style s4 fill:#1b4332,stroke:#2d6a4f,color:white style s5 fill:#d00000,stroke:#e85d04,color:white

手写实现#

1
#include <array>
2
#include <optional>
3
#include <stdexcept>
4

5
template <typename T, std::size_t Capacity>
6
class RingBuffer {
7
    static_assert(Capacity > 0, "Capacity must be > 0");
8

9
    std::array<T, Capacity> _buf{};
10
    std::size_t _head = 0;     // 下一个读取的位置
11
    std::size_t _tail = 0;     // 下一个写入的位置
12
    std::size_t _size = 0;     // 当前元素数量
13

14
    std::size_t _next(std::size_t idx) const {
15
        return (idx + 1) % Capacity;
16
    }
17

18
public:
19
    // 写入：如果满了，覆盖最旧的元素（适用于游戏帧缓冲场景）
20
    void push(const T& value) {
21
        _buf[_tail] = value;
22
        _tail = _next(_tail);
23

24
        if (_size == Capacity) {
25
            _head = _next(_head);  // 满了，头部前移（丢弃最旧元素）
26
        } else {
27
            ++_size;
28
        }
29
    }
30

31
    // 读取并移除最旧的元素
32
    std::optional<T> pop() {
33
        if (_size == 0) return std::nullopt;
34

35
        T value = _buf[_head];
36
        _head = _next(_head);
37
        --_size;
38
        return value;
39
    }
40

41
    // 查看最旧的元素（不移除）
42
    std::optional<T> peek() const {
43
        if (_size == 0) return std::nullopt;
44
        return _buf[_head];
45
    }
46

47
    std::size_t size() const { return _size; }
48
    bool empty() const { return _size == 0; }
49
    bool full() const { return _size == Capacity; }
50

51
    // 获取倒数第 n 个元素（0 = 最新的）
52
    // 游戏中常用：获取"前 N 帧"的数据
53
    const T& back(std::size_t n = 0) const {
54
        if (n >= _size) throw std::out_of_range("RingBuffer::back");
55
        std::size_t idx = (_tail + Capacity - 1 - n) % Capacity;
56
        return _buf[idx];
57
    }
58
};

使用示例：

1
RingBuffer<float, 120> frame_times;  // 存储最近 120 帧的帧时间
2

3
// 每帧写入
4
frame_times.push(delta_time);
5

6
// 计算最近 60 帧的平均帧时间
7
float sum = 0.0f;
8
int count = std::min<int>(60, frame_times.size());
9
for (int i = 0; i < count; ++i) {
10
    sum += frame_times.back(i);
11
}
12
float avg_frame_time = sum / count;
13
float fps = 1.0f / avg_frame_time;

1.4 复杂度速查表#

`std::array` 复杂度#

操作	时间复杂度	说明
`operator[]` / `at`	O(1)	直接地址计算
`front` / `back`	O(1)
`size`	O(1)	编译期常量
`fill`	O(n)	遍历赋值
`swap`	O(n)	逐元素交换（不是指针交换！）

`std::vector` 复杂度#

操作	时间复杂度	备注
`operator[]` / `at`	O(1)	随机访问核心优势
`front` / `back`	O(1)
`push_back`	O(1) 均摊	单次扩容 O(n)，但均摊 O(1)
`pop_back`	O(1)
`insert` (中间)	O(n)	需要搬移后续元素
`erase` (中间)	O(n)	需要搬移后续元素
`insert` / `erase` (尾部)	O(1)
`find` (线性查找)	O(n)	无序数组
`binary_search` (已排序)	O(log n)	需要先排序
`sort`	O(n log n)	`std::sort` 用 introsort
`reserve`	O(n) 或 O(1)	扩容时 O(n)，不扩容时 O(1)
`resize`	O(n)	可能构造/析构元素
`clear`	O(n)	析构所有元素 (POD 类型优化为 O(1))
`swap`	O(1)	只交换三个指针！

横向对比#

操作	`array`	`vector`	`deque`	`list`
随机访问	O(1)	O(1)	O(1)	❌ O(n)
头部插入	❌	O(n)	O(1)	O(1)
尾部插入	❌	O(1)*	O(1)*	O(1)
中间插入	❌	O(n)	O(n)	O(1)**
缓存友好	✅✅✅	✅✅✅	✅✅	❌
内存连续	✅	✅	分段连续	❌

* 均摊 O(1)
** 指已有迭代器的情况

1.5 面试高频题#

1.5.1 双指针 —— 数组面试的万金油#

双指针是数组题中出现频率最高的技巧。根据指针移动方向可分为：

同向双指针（快慢指针）：一个走得快，一个走得慢
对向双指针：从两端向中间移动
背向双指针：从中间向两端扩散（常用于回文）

经典题：移除元素 (LeetCode 27)#

给定数组 nums 和值 val，原地移除所有值等于 val 的元素，返回新长度。

1
// 快慢指针：slow 维护结果数组的边界
2
int removeElement(std::vector<int>& nums, int val) {
3
    int slow = 0;
4
    for (int fast = 0; fast < nums.size(); ++fast) {
5
        if (nums[fast] != val) {
6
            nums[slow] = nums[fast];
7
            ++slow;
8
        }
9
    }
10
    return slow;  // slow 就是新长度
11
}
12
// 时间 O(n), 空间 O(1)

经典题：两数之和 II — 有序数组 (LeetCode 167)#

给定已排序数组，找到两个数使其和等于 target。

1
// 对向双指针：利用有序性收缩范围
2
std::vector<int> twoSum(std::vector<int>& nums, int target) {
3
    int left = 0, right = nums.size() - 1;
4
    while (left < right) {
5
        int sum = nums[left] + nums[right];
6
        if (sum == target) {
7
            return {left + 1, right + 1};  // 题目要求 1-indexed
8
        } else if (sum < target) {
9
            ++left;   // 和太小，左指针右移
10
        } else {
11
            --right;  // 和太大，右指针左移
12
        }
13
    }
14
    return {};  // 题目保证有解
15
}
16
// 时间 O(n), 空间 O(1)

经典题：三数之和 (LeetCode 15)#

找出数组中所有和为 0 的三元组，不能重复。

1
std::vector<std::vector<int>> threeSum(std::vector<int>& nums) {
2
    std::vector<std::vector<int>> result;
3
    std::sort(nums.begin(), nums.end());  // 必须先排序
4

5
    for (int i = 0; i < static_cast<int>(nums.size()) - 2; ++i) {
6
        if (i > 0 && nums[i] == nums[i - 1]) continue;  // 跳过重复
7

8
        int left = i + 1, right = static_cast<int>(nums.size()) - 1;
9
        while (left < right) {
10
            int sum = nums[i] + nums[left] + nums[right];
11
            if (sum == 0) {
12
                result.push_back({nums[i], nums[left], nums[right]});
13
                // 跳过重复值
14
                while (left < right && nums[left] == nums[left + 1]) ++left;
15
                while (left < right && nums[right] == nums[right - 1]) --right;
16
                ++left; --right;
17
            } else if (sum < 0) {
18
                ++left;
19
            } else {
20
                --right;
21
            }
22
        }
23
    }
24
    return result;
25
}
26
// 时间 O(n²), 空间 O(1)（不算结果）

1.5.2 滑动窗口 —— 子数组/子串问题的利器#

滑动窗口的核心思想：维护一个窗口 [left, right]，right 扩大窗口探索新元素，left 收缩窗口满足约束。

模板#

1
int slidingWindow(std::vector<int>& nums) {
2
    int left = 0;
3
    int result = 0;
4
    // 窗口状态（根据题目定义，可能是 sum、map、set 等）
5
    int window_state = 0;
6

7
    for (int right = 0; right < nums.size(); ++right) {
8
        // 1. 扩大窗口：加入 nums[right]
9
        window_state += nums[right];
10

11
        // 2. 收缩窗口：当窗口不满足条件时
12
        while (/* 窗口不满足条件 */) {
13
            window_state -= nums[left];
14
            ++left;
15
        }
16

17
        // 3. 更新结果
18
        result = std::max(result, right - left + 1);
19
    }
20
    return result;
21
}

经典题：长度最小的子数组 (LeetCode 209)#

找到元素和 ≥ target 的最短连续子数组长度。

1
int minSubArrayLen(int target, std::vector<int>& nums) {
2
    int left = 0;
3
    int sum = 0;
4
    int min_len = INT_MAX;
5

6
    for (int right = 0; right < nums.size(); ++right) {
7
        sum += nums[right];
8

9
        while (sum >= target) {
10
            min_len = std::min(min_len, right - left + 1);
11
            sum -= nums[left];
12
            ++left;
13
        }
14
    }
15
    return min_len == INT_MAX ? 0 : min_len;
16
}
17
// 时间 O(n), 空间 O(1)

经典题：最大连续 1 的个数 III (LeetCode 1004)#

最多翻转 k 个 0 为 1，求最长连续 1 的长度。

1
int longestOnes(std::vector<int>& nums, int k) {
2
    int left = 0, zeros = 0, max_len = 0;
3

4
    for (int right = 0; right < nums.size(); ++right) {
5
        if (nums[right] == 0) ++zeros;
6

7
        while (zeros > k) {
8
            if (nums[left] == 0) --zeros;
9
            ++left;
10
        }
11

12
        max_len = std::max(max_len, right - left + 1);
13
    }
14
    return max_len;
15
}
16
// 时间 O(n), 空间 O(1)

1.5.3 前缀和 —— 区间查询的预处理神器#

核心思想：预处理一个前缀和数组 prefix[i] = nums[0] + nums[1] + ... + nums[i-1]，然后任意区间 [l, r] 的和 = prefix[r+1] - prefix[l]，O(1) 查询。

graph LR subgraph "原数组 nums" n0["2"] --- n1["3"] --- n2["1"] --- n3["4"] --- n4["5"] end subgraph "前缀和 prefix" p0["0"] --- p1["2"] --- p2["5"] --- p3["6"] --- p4["10"] --- p5["15"] end n0 -.-> p1 n1 -.-> p2 n4 -.-> p5

1
区间 [1, 3] 的和 = prefix[4] - prefix[1] = 10 - 2 = 8
2
验证：nums[1] + nums[2] + nums[3] = 3 + 1 + 4 = 8 ✅

经典题：和为 K 的子数组 (LeetCode 560)#

统计数组中和为 k 的连续子数组个数。

1
int subarraySum(std::vector<int>& nums, int k) {
2
    // 前缀和 + 哈希表
3
    // 核心：如果 prefix[j] - prefix[i] == k，说明 [i, j) 的和为 k
4
    //       即寻找之前出现过的 prefix[i] == prefix[j] - k
5
    std::unordered_map<int, int> prefix_count;
6
    prefix_count[0] = 1;  // 前缀和为 0 出现 1 次（空子数组）
7

8
    int sum = 0, count = 0;
9
    for (int num : nums) {
10
        sum += num;
11
        if (prefix_count.count(sum - k)) {
12
            count += prefix_count[sum - k];
13
        }
14
        ++prefix_count[sum];
15
    }
16
    return count;
17
}
18
// 时间 O(n), 空间 O(n)

1.5.4 原地操作 —— 空间 O(1) 的艺术#

面试官常见追问：“能不能不用额外空间？“

经典题：旋转数组 (LeetCode 189)#

将数组右移 k 位。[1,2,3,4,5,6,7]，k=3 → [5,6,7,1,2,3,4]

1
// 三次翻转法：优雅且 O(1) 空间
2
void rotate(std::vector<int>& nums, int k) {
3
    int n = nums.size();
4
    k %= n;  // 处理 k > n 的情况
5

6
    // [1,2,3,4,5,6,7]
7
    std::reverse(nums.begin(), nums.end());            // [7,6,5,4,3,2,1]
8
    std::reverse(nums.begin(), nums.begin() + k);      // [5,6,7,4,3,2,1]
9
    std::reverse(nums.begin() + k, nums.end());        // [5,6,7,1,2,3,4]
10
}
11
// 时间 O(n), 空间 O(1)

💡 面试技巧：三次翻转法适用于所有”循环移位”类问题。记住这个模式：全翻 → 前段翻 → 后段翻。

经典题：螺旋矩阵 (LeetCode 54)#

按螺旋顺序输出矩阵中的所有元素。

1
std::vector<int> spiralOrder(std::vector<std::vector<int>>& matrix) {
2
    std::vector<int> result;
3
    if (matrix.empty()) return result;
4

5
    int top = 0, bottom = matrix.size() - 1;
6
    int left = 0, right = matrix[0].size() - 1;
7

8
    while (top <= bottom && left <= right) {
9
        // → 从左到右（上边界）
10
        for (int j = left; j <= right; ++j)
11
            result.push_back(matrix[top][j]);
12
        ++top;
13

14
        // ↓ 从上到下（右边界）
15
        for (int i = top; i <= bottom; ++i)
16
            result.push_back(matrix[i][right]);
17
        --right;
18

19
        // ← 从右到左（下边界）
20
        if (top <= bottom) {
21
            for (int j = right; j >= left; --j)
22
                result.push_back(matrix[bottom][j]);
23
            --bottom;
24
        }
25

26
        // ↑ 从下到上（左边界）
27
        if (left <= right) {
28
            for (int i = bottom; i >= top; --i)
29
                result.push_back(matrix[i][left]);
30
            ++left;
31
        }
32
    }
33
    return result;
34
}
35
// 时间 O(m*n), 空间 O(1)（不算结果）

1.5.5 二分查找 —— 有序数组的终极武器#

二分查找本质上就是利用数组的随机访问 + 有序性。

标准模板（左闭右闭）#

1
// 找 target，返回下标，不存在返回 -1
2
int binarySearch(const std::vector<int>& nums, int target) {
3
    int left = 0, right = nums.size() - 1;  // [left, right] 左闭右闭
4

5
    while (left <= right) {  // 注意：<= 因为 right 是闭区间
6
        int mid = left + (right - left) / 2;  // 防溢出写法
7
        if (nums[mid] == target) {
8
            return mid;
9
        } else if (nums[mid] < target) {
10
            left = mid + 1;
11
        } else {
12
            right = mid - 1;
13
        }
14
    }
15
    return -1;
16
}

`lower_bound` / `upper_bound` —— 工程中更常用#

1
// lower_bound：找第一个 >= target 的位置
2
// upper_bound：找第一个 >  target 的位置
3
// 二者的差就是 target 出现的次数
4

5
auto it_lo = std::lower_bound(v.begin(), v.end(), target);
6
auto it_hi = std::upper_bound(v.begin(), v.end(), target);
7
int count = it_hi - it_lo;  // target 在数组中出现了多少次

经典题：搜索旋转排序数组 (LeetCode 33)#

在被旋转过的有序数组中搜索。[4,5,6,7,0,1,2]，target=0 → 返回 4。

1
int search(std::vector<int>& nums, int target) {
2
    int left = 0, right = nums.size() - 1;
3

4
    while (left <= right) {
5
        int mid = left + (right - left) / 2;
6
        if (nums[mid] == target) return mid;
7

8
        // 判断哪一半是有序的
9
        if (nums[left] <= nums[mid]) {
10
            // 左半段有序
11
            if (nums[left] <= target && target < nums[mid]) {
12
                right = mid - 1;  // target 在左半段
13
            } else {
14
                left = mid + 1;   // target 在右半段
15
            }
16
        } else {
17
            // 右半段有序
18
            if (nums[mid] < target && target <= nums[right]) {
19
                left = mid + 1;   // target 在右半段
20
            } else {
21
                right = mid - 1;  // target 在左半段
22
            }
23
        }
24
    }
25
    return -1;
26
}
27
// 时间 O(log n), 空间 O(1)

1.5.6 Kadane 算法 —— 最大子数组和 (LeetCode 53)#

给定整数数组，找到具有最大和的连续子数组，返回其最大和。

这道题是 DP / 贪心的经典入门，也是数组类面试的高频题之一。

1
int maxSubArray(std::vector<int>& nums) {
2
    int max_sum = nums[0];
3
    int current_sum = nums[0];
4

5
    for (int i = 1; i < static_cast<int>(nums.size()); ++i) {
6
        // 核心决策：接上前面的子数组，还是从"我"重新开始？
7
        current_sum = std::max(nums[i], current_sum + nums[i]);
8
        max_sum = std::max(max_sum, current_sum);
9
    }
10
    return max_sum;
11
}
12
// 时间 O(n), 空间 O(1)

图解推演（nums = [-2, 1, -3, 4, -1, 2, 1, -5, 4]）：

1
i=0: current=-2, max=-2
2
i=1: max(1, -2+1=−1) → current=1,  max=1
3
i=2: max(-3, 1-3=−2)  → current=−2, max=1
4
i=3: max(4, -2+4=2)   → current=4,  max=4    ← 从 4 重新开始
5
i=4: max(-1, 4-1=3)   → current=3,  max=4
6
i=5: max(2, 3+2=5)    → current=5,  max=5
7
i=6: max(1, 5+1=6)    → current=6,  max=6    ★ 最终答案
8
i=7: max(-5, 6-5=1)   → current=1,  max=6
9
i=8: max(4, 1+4=5)    → current=5,  max=6
10

11
答案：6（子数组 [4, -1, 2, 1]）

💡 Kadane 算法的本质：在每个位置做一个贪心决策——如果前面累积的 current_sum 是负数，它对后续元素只有”拖后腿”的作用，不如直接抛弃，从当前元素重新开始。

面试追问：

“如果要返回子数组本身而不只是和？” → 记录 start 和 end 下标
“如果要求最小子数组和？” → 把 max 换成 min
“如果数组是环形的？(LC 918)” → max(普通 Kadane, total_sum - 最小子数组和)

1.5.7 合并区间 (LeetCode 56)#

给出若干个区间，合并所有重叠的区间。

1
std::vector<std::vector<int>> merge(std::vector<std::vector<int>>& intervals) {
2
    if (intervals.empty()) return {};
3

4
    // 1. 按起点排序
5
    std::sort(intervals.begin(), intervals.end(),
6
        [](const auto& a, const auto& b) { return a[0] < b[0]; });
7

8
    std::vector<std::vector<int>> merged;
9
    merged.push_back(intervals[0]);
10

11
    for (int i = 1; i < static_cast<int>(intervals.size()); ++i) {
12
        auto& last = merged.back();
13

14
        if (intervals[i][0] <= last[1]) {
15
            // 有重叠 → 合并（取更大的右端点）
16
            last[1] = std::max(last[1], intervals[i][1]);
17
        } else {
18
            // 无重叠 → 直接加入
19
            merged.push_back(intervals[i]);
20
        }
21
    }
22
    return merged;
23
}
24
// 时间 O(n log n) 排序主导, 空间 O(n) 结果

图解：

1
输入: [[1,3],[2,6],[8,10],[15,18]]
2
排序后: [[1,3],[2,6],[8,10],[15,18]]
3

4
[1,3] → merged = [[1,3]]
5
[2,6] → 2 <= 3 重叠！ merged = [[1,6]]    ← max(3,6)=6
6
[8,10] → 8 > 6 不重叠  merged = [[1,6],[8,10]]
7
[15,18] → 15 > 10 不重叠 merged = [[1,6],[8,10],[15,18]]

💡 扩展思考：合并区间是”扫描线思想”的简化版。扫描线算法在游戏开发中用于碰撞检测（Sweep Line / Sweep and Prune）——按 X 轴排序后线性扫描，快速剔除不可能碰撞的对象对。

1.5.8 除自身以外数组的乘积 (LeetCode 238)#

给定数组 nums，返回数组 answer，其中 answer[i] 等于 nums 中除 nums[i] 之外其余各元素的乘积。不能用除法，要求 O(n)。

1
std::vector<int> productExceptSelf(std::vector<int>& nums) {
2
    int n = nums.size();
3
    std::vector<int> answer(n, 1);
4

5
    // 第一遍：从左到右，计算前缀积
6
    int prefix = 1;
7
    for (int i = 0; i < n; ++i) {
8
        answer[i] = prefix;
9
        prefix *= nums[i];
10
    }
11

12
    // 第二遍：从右到左，乘上后缀积
13
    int suffix = 1;
14
    for (int i = n - 1; i >= 0; --i) {
15
        answer[i] *= suffix;
16
        suffix *= nums[i];
17
    }
18

19
    return answer;
20
}
21
// 时间 O(n), 空间 O(1)（answer 不算额外空间）

图解（nums = [1, 2, 3, 4]）：

1
第一遍（前缀积）:
2
  answer = [1, 1, 2, 6]
3
  含义: answer[i] = nums[0] × ... × nums[i-1]
4

5
第二遍（后缀积）:
6
  i=3: answer[3] = 6 × 1 = 6,   suffix = 4
7
  i=2: answer[2] = 2 × 4 = 8,   suffix = 12
8
  i=1: answer[1] = 1 × 12 = 12, suffix = 24
9
  i=0: answer[0] = 1 × 24 = 24, suffix = 24
10

11
  answer = [24, 12, 8, 6] ✅
12
  验证: 2×3×4=24, 1×3×4=12, 1×2×4=8, 1×2×3=6 ✅

💡 前缀和/前缀积的一般化：前缀和解决”区间求和”，前缀积解决”排除某个元素的乘积”。本质上都是利用可逆运算的预处理。加法可逆所以前缀和更常见，乘法遇到 0 就不可逆，所以 LC 238 要求不用除法。

1.5.9 面试题速查表#

题号	题目	核心技巧	难度
LC 1	两数之和	哈希表 (无序) / 双指针 (有序)	Easy
LC 15	三数之和	排序 + 对向双指针	Medium
LC 26	删除有序数组重复项	快慢指针	Easy
LC 27	移除元素	快慢指针	Easy
LC 33	搜索旋转排序数组	二分查找变种	Medium
LC 53	最大子数组和	Kadane 算法 / DP	Medium
LC 54	螺旋矩阵	边界收缩	Medium
LC 56	合并区间	排序 + 线性扫描	Medium
LC 128	最长连续序列	哈希集合	Medium
LC 189	旋转数组	三次翻转	Medium
LC 209	长度最小的子数组	滑动窗口	Medium
LC 238	除自身以外数组的乘积	前缀积 + 后缀积	Medium
LC 560	和为 K 的子数组	前缀和 + 哈希	Medium
LC 1004	最大连续 1 的个数 III	滑动窗口	Medium

1.6 🎮 实战场景#

1.6.1 Object Pool —— 对象池（游戏开发核心模式）#

问题：射击游戏中，子弹频繁创建和销毁。每次 new/delete 都有堆分配开销，且频繁分配会产生内存碎片，导致 cache 性能下降。

解决方案：用数组预分配一批对象，复用而非销毁。

1
#include <array>
2
#include <bitset>
3
#include <optional>
4
#include <cassert>
5

6
template <typename T, std::size_t PoolSize>
7
class ObjectPool {
8
    std::array<T, PoolSize> _pool;           // 预分配的对象数组
9
    std::bitset<PoolSize>   _active;         // 标记哪些位置正在使用
10
    std::size_t             _next_free = 0;  // 下一个可能空闲的位置（hint）
11

12
public:
13
    struct Handle {
14
        std::size_t index;
15
        bool valid() const { return index < PoolSize; }
16
    };
17

18
    static constexpr Handle INVALID = Handle{PoolSize};
19

20
    // 获取一个空闲对象
21
    Handle acquire() {
22
        // 从 hint 位置开始找空闲槽
23
        for (std::size_t i = 0; i < PoolSize; ++i) {
24
            std::size_t idx = (_next_free + i) % PoolSize;
25
            if (!_active.test(idx)) {
26
                _active.set(idx);
27
                _pool[idx] = T{};  // 重置对象状态
28
                _next_free = (idx + 1) % PoolSize;
29
                return Handle{idx};
30
            }
31
        }
32
        return INVALID;  // 池满
33
    }
34

35
    // 归还对象
36
    void release(Handle h) {
37
        assert(h.valid() && _active.test(h.index));
38
        _active.reset(h.index);
39
        _next_free = h.index;  // 更新 hint
40
    }
41

42
    // 访问对象
43
    T& get(Handle h) {
44
        assert(h.valid() && _active.test(h.index));
45
        return _pool[h.index];
46
    }
47

48
    const T& get(Handle h) const {
49
        assert(h.valid() && _active.test(h.index));
50
        return _pool[h.index];
51
    }
52

53
    // 遍历所有活跃对象（批量 update 用）
54
    template <typename Func>
55
    void for_each_active(Func&& func) {
56
        for (std::size_t i = 0; i < PoolSize; ++i) {
57
            if (_active.test(i)) {
58
                func(_pool[i]);
59
            }
60
        }
61
    }
62

63
    std::size_t active_count() const { return _active.count(); }
64
    std::size_t capacity() const { return PoolSize; }
65
};

使用场景：

1
struct Bullet {
2
    float x = 0, y = 0;         // 位置
3
    float vx = 0, vy = 0;       // 速度
4
    float damage = 10.0f;
5
    float lifetime = 2.0f;      // 剩余存活时间
6
};
7

8
ObjectPool<Bullet, 1024> bullet_pool;  // 最多 1024 颗子弹
9

10
// 发射子弹
11
auto h = bullet_pool.acquire();
12
if (h.valid()) {
13
    auto& b = bullet_pool.get(h);
14
    b.x = player.x;
15
    b.y = player.y;
16
    b.vx = dir_x * speed;
17
    b.vy = dir_y * speed;
18
}
19

20
// 更新所有子弹
21
bullet_pool.for_each_active([&](Bullet& b) {
22
    b.x += b.vx * dt;
23
    b.y += b.vy * dt;
24
    b.lifetime -= dt;
25
});

为什么用数组而不是链表？

✅ 数组内存连续 → 遍历 update 时 cache 命中率极高
✅ 预分配 → 运行时零堆分配
✅ bitset 标记 → 分配/释放 O(1)（平均）
❌ 链表每个节点散布在堆中 → 遍历时频繁 cache miss

1.6.2 SoA vs AoS —— 数据布局优化#

这是游戏引擎和 ECS（Entity Component System）架构中的核心概念。

AoS（Array of Structures）—— 传统面向对象：

1
// 每个粒子的所有属性放在一起
2
struct Particle {
3
    float x, y, z;       // 位置
4
    float vx, vy, vz;    // 速度
5
    float r, g, b, a;    // 颜色
6
    float life;           // 剩余寿命
7
};
8

9
std::vector<Particle> particles(10000);  // 1 万个粒子
10

11
// 更新位置时：
12
for (auto& p : particles) {
13
    p.x += p.vx * dt;  // 访问 x 时，速度和颜色数据也被加载到 cache
14
    p.y += p.vy * dt;  // 浪费了大量 cache 空间
15
    p.z += p.vz * dt;
16
}

1
内存布局：[x y z vx vy vz r g b a life] [x y z vx vy vz r g b a life] ...
2
                ^^^^^^^^^^^^^^^^^^^^^^^^
3
                     一个 Particle (44 bytes)
4
每个 cache line (64B) 只能装 ~1.4 个粒子 → cache 利用率低

SoA（Structure of Arrays）—— 数据驱动设计：

1
// 把同类属性打包到一起
2
struct ParticleSystem {
3
    std::vector<float> x, y, z;       // 位置数组
4
    std::vector<float> vx, vy, vz;    // 速度数组
5
    std::vector<float> r, g, b, a;    // 颜色数组
6
    std::vector<float> life;           // 寿命数组
7
    std::size_t count = 0;
8

9
    void resize(std::size_t n) {
10
        x.resize(n); y.resize(n); z.resize(n);
11
        vx.resize(n); vy.resize(n); vz.resize(n);
12
        r.resize(n); g.resize(n); b.resize(n); a.resize(n);
13
        life.resize(n);
14
        count = n;
15
    }
16
};
17

18
ParticleSystem ps;
19
ps.resize(10000);
20

21
// 更新位置时：
22
for (std::size_t i = 0; i < ps.count; ++i) {
23
    ps.x[i] += ps.vx[i] * dt;  // x 数组连续 → 完美 cache line
24
    ps.y[i] += ps.vy[i] * dt;  // y 数组连续 → 完美 cache line
25
    ps.z[i] += ps.vz[i] * dt;  // z 数组连续 → 完美 cache line
26
}

1
内存布局：
2
x 数组：[x0 x1 x2 x3 x4 x5 x6 x7 x8 x9 x10 x11 x12 x13 x14 x15]
3
                   ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
4
                              一个 cache line 装 16 个 float!
5
vx数组：[vx0 vx1 vx2 vx3 ...]   ← 同样连续

性能对比：

指标	AoS	SoA
只更新位置时的 cache 利用率	~27% (12B/44B)	100%
SIMD 向量化	❌ 困难	✅ 天然适合
代码可读性	✅ 直觉	❌ 需要用 index
添加新属性	✅ 加字段	❌ 加数组

💡 总结：AoS 适合属性之间强关联的场景（一次处理一个完整对象），SoA 适合批量处理单个属性的场景（如物理引擎、粒子系统、ECS）。

1.6.3 帧输入环形缓冲区#

格斗游戏、FPS 和网络同步中，常需要存储最近 N 帧的玩家输入，用于：

回放系统：录像功能
输入预测 & 回滚：GGPO 风格的 rollback netcode
连招检测：判断玩家是否在时间窗口内输入了特定序列

1
struct InputFrame {
2
    uint32_t frame_number;
3
    uint8_t  buttons;       // 按键位掩码
4
    float    stick_x;       // 摇杆 X
5
    float    stick_y;       // 摇杆 Y
6

7
    // 按键位定义
8
    static constexpr uint8_t BTN_ATTACK  = 1 << 0;
9
    static constexpr uint8_t BTN_JUMP    = 1 << 1;
10
    static constexpr uint8_t BTN_BLOCK   = 1 << 2;
11
    static constexpr uint8_t BTN_SPECIAL = 1 << 3;
12

13
    bool pressed(uint8_t btn) const { return buttons & btn; }
14
};
15

16
// 存储最近 300 帧的输入（60fps 下约 5 秒）
17
RingBuffer<InputFrame, 300> input_history;
18

19
// 每帧记录输入
20
void record_input(uint32_t frame, const InputFrame& input) {
21
    input_history.push(input);
22
}
23

24
// 连招检测：检查最近 N 帧是否匹配特定输入序列
25
bool detect_combo(const std::vector<uint8_t>& combo_sequence, int window_frames) {
26
    if (input_history.size() < combo_sequence.size()) return false;
27

28
    int combo_idx = combo_sequence.size() - 1;
29
    int frames_checked = 0;
30

31
    for (int i = 0; i < input_history.size() && frames_checked < window_frames; ++i) {
32
        const auto& frame = input_history.back(i);
33
        if (frame.pressed(combo_sequence[combo_idx])) {
34
            --combo_idx;
35
            if (combo_idx < 0) return true;  // 完整匹配！
36
        }
37
        ++frames_checked;
38
    }
39
    return false;
40
}
41

42
// 示例：检测"↓ → 攻击"（波动拳）
43
// detect_combo({BTN_DOWN, BTN_RIGHT, InputFrame::BTN_ATTACK}, 30);

1.6.4 Vertex Buffer —— GPU 数据传输#

GPU 渲染管线要求顶点数据以连续数组的形式传入。这是数组在图形编程中最直接的应用：

1
struct Vertex {
2
    float position[3];   // x, y, z
3
    float normal[3];     // 法线
4
    float texcoord[2];   // UV 坐标
5
};
6

7
// 顶点数据必须是连续数组 —— GPU 要求 DMA 读取连续内存
8
std::vector<Vertex> vertices;
9
std::vector<uint32_t> indices;  // 索引数组（EBO）
10

11
// OpenGL 上传到 GPU
12
GLuint vbo, ebo;
13
glGenBuffers(1, &vbo);
14
glBindBuffer(GL_ARRAY_BUFFER, vbo);
15
// data() 返回连续内存的裸指针 → 直接 DMA 传输给 GPU
16
glBufferData(GL_ARRAY_BUFFER,
17
    vertices.size() * sizeof(Vertex),
18
    vertices.data(),     // ← std::vector::data() 的价值所在
19
    GL_STATIC_DRAW);
20

21
glGenBuffers(1, &ebo);
22
glBindBuffer(GL_ELEMENT_ARRAY_BUFFER, ebo);
23
glBufferData(GL_ELEMENT_ARRAY_BUFFER,
24
    indices.size() * sizeof(uint32_t),
25
    indices.data(),
26
    GL_STATIC_DRAW);

💡 这也是为什么 std::vector::data() 方法如此重要——它提供了与 C API 和 GPU API 的桥梁。std::list 就无法做到这一点。

1.6.5 配置表 —— 策划数据的内存映射#

游戏中大量配置数据（道具表、技能表、怪物属性表等）通常由策划在 Excel 中编辑，导出为 CSV/JSON/Protobuf，运行时加载到内存数组中。

1
struct ItemConfig {
2
    int32_t     id;
3
    std::string name;
4
    int32_t     type;       // 0=消耗品, 1=装备, 2=材料
5
    int32_t     rarity;     // 1~5 星
6
    int32_t     price;
7
    float       effect_value;
8
};
9

10
class ItemTable {
11
    std::vector<ItemConfig> _items;  // 按 ID 排序的数组
12

13
public:
14
    // 加载后按 ID 排序，之后用二分查找
15
    void load(const std::string& path) {
16
        // ... 从 CSV/JSON 加载 ...
17
        std::sort(_items.begin(), _items.end(),
18
            [](const auto& a, const auto& b) { return a.id < b.id; });
19
    }
20

21
    // 二分查找 O(log n)
22
    const ItemConfig* find(int32_t id) const {
23
        auto it = std::lower_bound(_items.begin(), _items.end(), id,
24
            [](const ItemConfig& item, int32_t target) {
25
                return item.id < target;
26
            });
27
        if (it != _items.end() && it->id == id) {
28
            return &(*it);
29
        }
30
        return nullptr;
31
    }
32

33
    // 按类型筛选
34
    std::vector<const ItemConfig*> filter_by_type(int32_t type) const {
35
        std::vector<const ItemConfig*> result;
36
        for (const auto& item : _items) {
37
            if (item.type == type) {
38
                result.push_back(&item);
39
            }
40
        }
41
        return result;
42
    }
43

44
    std::size_t count() const { return _items.size(); }
45
};

为什么用 vector 而不是 unordered_map？

配置表加载后只读，不需要动态增删
排序后的数组可用二分查找 O(log n)，而 unordered_map 的常数开销在小数据量下可能更大
遍历筛选时，数组的 cache 友好性远优于哈希表
内存更紧凑，没有哈希桶的额外开销

1.7 本章小结#

核心要点#

概念	要点
内存模型	连续内存 → 地址计算 O(1) → cache 友好
`std::array`	栈上分配、编译期大小、零额外开销、不退化
`std::vector`	三指针 (`begin/end/capacity`)、均摊 O(1) 尾插、扩容搬迁
扩容策略	GCC 2x / MSVC 1.5x、知道原因 (内存碎片)
迭代器失效	扩容 → 全失效、erase → 当前及之后失效
Ring Buffer	逻辑环形、覆写最旧数据、帧缓冲/输入历史
SoA vs AoS	批量处理 → SoA (ECS)、个体操作 → AoS

面试 30 秒速答#

Q：数组和链表的区别？
A：数组是连续内存，支持 O(1) 随机访问，缓存友好，但中间插删需要 O(n) 搬移。链表是离散节点通过指针串联，已知位置的插删是 O(1)，但不支持随机访问，缓存不友好。在数据量不大、遍历为主的场景下，数组通常更快，因为 CPU cache line 的空间局部性。

Q：std::vector 的扩容机制？
A：当 push_back 导致 size == capacity 时，vector 会分配一块更大的新内存（通常 1.5x 或 2x），把旧元素移动过去，释放旧内存。单次扩容 O(n)，但均摊 O(1)。注意扩容后所有迭代器和指针全部失效。如果能预估大小，应该提前调用 reserve() 避免多次扩容。

Q：什么场景下数组比链表差？
A：频繁在中间或头部插入/删除时，数组每次需要 O(n) 搬移元素，链表只需 O(1) 改指针。但要注意，链表的 O(1) 插删前提是已经有目标位置的指针，查找目标位置本身是 O(n)。

📖 下一章：第二章链表：指针的艺术 —— 单链表、双链表、侵入式链表，以及为什么游戏引擎偏爱侵入式设计。

音乐

音乐

第一章 数组：最朴素的力量

第一章 数组 (Array)#