Skip to main content

Table 2 Parameter settings

From: Few-shot relation classification by context attention-based prototypical networks with BERT

Max length of a sentence64
Batch size1
Training classes for one batch8
Learning rate2e-5
Train iterations10000
Convolutional window size3
Hidden layer dimension dh768
Number of multihead12