Skip to content

关于预处理的问题 #10

Description

@mushan09

你好,我简单看了下BERT对中文是用字做embedding,我想请问下使用bert之后,是否还需要自己做一些文本清洗,去停词什么的。我现在是中文和英文混着的情感分析任务,跑通了你的代码后验证集大概90左右精度,请问对于精度提升还有什么建议吗?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions