关于自然语言理解任务的问题 #34

JaheimLee · 2022-01-11T08:29:00Z

Hi，我想和你们确认个问题。Huggingface的模型在文本分类任务上用BertForSequenceClassification这个类时，其中用到的是bert的pooled_output结果，然后接最终的一层classifier输出。而你们论文中说：“We build the downstream models for the natural language understanding tasks by adding a linear classifier on top of the “[CLS]" token to predict label probabilities.”。这个意思是仅用bert的CLS token，然后直接到最终的classifier是吗？因为我看你们预训练任务中有NSP任务，所以想确认一下文本分类你们具体用的哪种方式。谢谢~

Ag2S1 · 2022-01-11T13:52:23Z

pooled_output 就是 [CLS]，参见代码

JaheimLee · 2022-01-12T00:14:19Z

pooled_output 就是 [CLS]，参见代码

他的pool操作是cls，dense，最后接具体任务的classifier。我想明确你们是否是cls直接到具体任务的classifier，中间有没有那个dense。看你们论文描述像是没有的

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

关于自然语言理解任务的问题 #34

关于自然语言理解任务的问题 #34

JaheimLee commented Jan 11, 2022 •

edited

Loading

Ag2S1 commented Jan 11, 2022

JaheimLee commented Jan 12, 2022

关于自然语言理解任务的问题 #34

关于自然语言理解任务的问题 #34

Comments

JaheimLee commented Jan 11, 2022 • edited Loading

Ag2S1 commented Jan 11, 2022

JaheimLee commented Jan 12, 2022

JaheimLee commented Jan 11, 2022 •

edited

Loading