## 英汉术语对照 鞍点,saddle point 变换,transform 编码器,encoder 标签,label 步幅,stride 参数,parameter 长短期记忆网络,long short-term memory (LSTM) 超参数,hyperparameter 层序softmax,hierarchical softmax 查准率,precision 成本,cost 词表,vocabulary 词嵌入,word embedding 词向量,word vector 词元,token 词元分析器,tokenizer 词元化,tokenize 汇聚层,pooling layer 稠密,dense 大小,size 导入,import 轮,epoch 暂退法,dropout 动量法,momentum (method) 独立同分布,independent and identically distributed (i.i.d.) 端到端,end-to-end 多层感知机,multilayer perceptron 多头注意力,multi-head attention 二元分类,binary classification 二元,bigram 子采样,subsample 发散,diverge 泛化,generalization 泛化误差,generalization error 方差,variance 分类,classification 分类器,classifier 负采样,negative sampling 感受野,receptive field 格拉姆矩阵,Gram matrix 共现,co-occurrence 广播,broadcast 规范化,normalization 过拟合,overfitting 核回归,kernel regression 恒等映射,identity mapping 假设,hypothesis 基准,baseline 激活函数,activation function 解码器,decoder 近似法,approximate method 经验风险最小化,empirical risk minimization 局部最小值,local minimum 卷积核,convolutional kernel 卷积神经网络,convolutional neural network 决策边界,decision boundary 均值,mean 均方误差,mean squared error 均匀采样,uniform sampling 块,block 困惑度,perplexity 拉普拉斯平滑,Laplace smoothing 连结,concatenate 类,class 交叉熵,cross-entropy 连续词袋,continous bag-of-words (CBOW) 零张量,zero tensor 流水线,pipeline 滤波器,filter 门控循环单元,gated recurrent units (GRU) 目标检测,object detection 偏置,bias 偏导数,partial derivative 偏移量,offset 批量,batch 齐普夫定律,Zipf's law 欠拟合,underfitting 情感分析,sentiment analysis 全连接层,fully-connected layer 权重,weight 三元,trigram 上采样,upsample 上下文变量,context variable 上下文窗口,context window 上下文词,context word 上下文向量,context vector 实例/示例,instance 收敛,converge 属性,property 数值方法,numerical method 数据集,dataset 数据示例,data instance 数据样例,data example 顺序分区,sequential partitioning softmax回归,softmax regression 随机采样,random sampling 损失函数,loss function 双向循环神经网络,bidirectional recurrent neural network 特征,feature 特征图,feature map 特征值,eigenvalue 梯度,gradient 梯度裁剪,gradient clipping 梯度消失,vanishing gradients 填充,padding 跳元模型,skip-gram model 调参,tune hyperparameter 停用词,stop words 通道,channel 凸优化,convex optimization 图像,image 未知词元,unknown token 无偏估计,unbiased estimate 误差,error 小批量,minibatch 小批量梯度,minibatch gradient 线性模型,linear model 线性回归,linear regression 协同过滤,collaborative filtering 学习率,learning rate 训练误差,training error 循环神经网络,recurrent neural network (RNN) 样例,example 一维梯度下降,gradient descent in one-dimensional space 一元,unigram 隐藏变量,hidden variable 隐藏层,hidden layer 优化器,optimizer 语料库,corpus 运算符,operator 自注意力,self-attention 真实值,ground truth 指标,metric 支持向量机,support vector machine 注意力机制,attention mechanism 注意力模型,attention model 注意力提示,attention cue 准确率/精度,accuracy