当前位置：首页 > news >正文

宁波网站排名公司百度点击排名收费软件

news 2026/4/20 1:58:08

宁波网站排名公司,百度点击排名收费软件,与建设部网站,如何设计自己网站主要组成部分#xff1a; 1. 定义注意力层#xff1a; 定义一个Attention_Layer类#xff0c;接受两个参数#xff1a;hidden_dim#xff08;隐藏层维度#xff09;和is_bi_rnn#xff08;是否是双向RNN#xff09;。 2. 定义前向传播#xff1a; 定义了注意力层的…主要组成部分 1. 定义注意力层定义一个Attention_Layer类接受两个参数hidden_dim隐藏层维度和is_bi_rnn是否是双向RNN。 2. 定义前向传播定义了注意力层的前向传播过程包括计算注意力权重和输出。 3. 数据准备生成一个随机的数据集包含3个句子每个句子10个词每个词128个特征。 4. 实例化注意力层实例化一个注意力层接受两个参数hidden_dim隐藏层维度和is_bi_rnn是否是双向RNN。 5. 前向传播将数据传递给注意力层的前向传播方法。 6. 分析结果获取第一个句子的注意力权重。 7. 可视化注意力权重使用matplotlib库可视化了注意力权重。 **主要函数和类** Attention_Layer类定义了注意力层的结构和前向传播过程。 forward方法定义了注意力层的前向传播过程。 torch.from_numpy函数将numpy数组转换为PyTorch张量。 matplotlib库用于可视化注意力权重。import torch import torch.nn as nn import torch.nn.functional as F import numpy as np import matplotlib.pyplot as plt# 定义注意力层 class Attention_Layer(nn.Module):def __init__(self, hidden_dim, is_bi_rnn):super(Attention_Layer,self).__init__()self.hidden_dim hidden_dimself.is_bi_rnn is_bi_rnnif is_bi_rnn:self.Q_linear nn.Linear(hidden_dim * 2, hidden_dim * 2, bias False)self.K_linear nn.Linear(hidden_dim * 2, hidden_dim * 2, bias False)self.V_linear nn.Linear(hidden_dim * 2, hidden_dim * 2, bias False)else:self.Q_linear nn.Linear(hidden_dim, hidden_dim, bias False)self.K_linear nn.Linear(hidden_dim, hidden_dim, bias False)self.V_linear nn.Linear(hidden_dim, hidden_dim, bias False)def forward(self, inputs, lens):# 获取输入的大小size inputs.size()Q self.Q_linear(inputs) K self.K_linear(inputs).permute(0, 2, 1)V self.V_linear(inputs)max_len max(lens)sentence_lengths torch.Tensor(lens)mask torch.arange(sentence_lengths.max().item())[None, :] sentence_lengths[:, None]mask mask.unsqueeze(dim 1)mask mask.expand(size[0], max_len, max_len)padding_num torch.ones_like(mask)padding_num -2**31 * padding_num.float()alpha torch.matmul(Q, K)alpha torch.where(mask, alpha, padding_num)alpha F.softmax(alpha, dim 2)out torch.matmul(alpha, V)return out# 准备数据 data np.random.rand(3, 10, 128) # 3个句子每个句子10个词每个词128个特征 lens [7, 10, 4] # 每个句子的长度# 实例化注意力层 hidden_dim 64 is_bi_rnn True att_L Attention_Layer(hidden_dim, is_bi_rnn)# 前向传播 att_out att_L(torch.from_numpy(data).float(), lens)# 分析结果 attention_weights att_out[0, :, :].detach().numpy() # 获取第一个句子的注意力权重# 可视化注意力权重 plt.imshow(attention_weights, cmaphot, interpolationnearest) plt.colorbar() plt.show()

查看全文

http://www.hkea.cn/news/14335282/