为什么在嵌入层（'Embedding（V+1，D）（i）'）中V+1，其中V是词汇量？-解网

问：

假设

from tensorflow.keras.preprocessing.text import Tokenizer
tokenizer = Tokenizer()
...
V = len(tokenizer.word_index)

词汇量在哪里。V

有人告诉我，嵌入层

x = Embedding(V+1,D)(i)

其中，输出向量的维度。但是我不确定为什么嵌入层的大小必须是而不是，特别是因为索引的起点是而不是，即D(V+1,D)(V,D)tokenizer.word_index10

tokenizer.word_index
{'UNK': 1,
 'the': 2,
 ',': 3,
 '.': 4,
 'of': 5,
 'and': 6, 
...}

所以（字典）的最大索引（如果转换为列表）实际上是。tokenizer.word_indexV-1

为什么在嵌入layer（）中词汇量大？V+1Embedding(V+1,D)(i)V

Python TensorFlow 形状词嵌入

为什么在嵌入层（'Embedding（V+1，D）（i）'）中V+1，其中V是词汇量？