开通VIP,畅享免费电子书等14项超值服
首页
好书
留言交流
下载APP
联系客服
2021.07.24
▲图1错误类型
解码策略:
论文标题:
AHybridApproachtoAutomaticCorpusGenerationforChineseSpellingCheck
论文链接:
代码链接:
SpellingErrorCorrectionwithSoft-MaskedBERT
4.1Query纠错
4.1.1百度中文纠错
1.错误检测:检测句子中错误位置和错误类型
2.候选召回:对检测出来的错误位置召回正确普安段候选
3.纠错排序:deep&wide,deep结合当前错误点上下文表示,wide部分基于形音、词法、语义、用户行为等特征学习原词与候选词的多维度距离表示。通过GBDT&LR进行排序。
数据链接:
1.首先移除表面错误(错别字和标点错误)
2.检测和改正语法错误
参考文献
[1]DenoisingbasedSequence-to-SequencePre-trainingforTextGeneration.EMNLP2019.
[2]CorporaGenerationforGrammaticalErrorCorrection.NAACL2019Google
[3]ImprovingGrammaticalErrorCorrectionwithDataAugmentationbyEditingLatentRepresentationCOLING2020
[4]FluencyBoostLearningandInferenceforNeuralGrammaticalErrorCorrection.ACL2018
[5]ImprovingGrammaticalErrorCorrectionwithMachineTranslationPairsEMNLP2020
[6]AnEmpiricalStudyofIncorporatingPseudoDataintoGrammaticalErrorCorrectionEMNLP2019
[7]ImprovingGrammaticalErrorCorrectionModelswithPurpose-BuiltAdversarialExamplesENNLP2020
[8]Encode,Tag,Realize:High-PrecisionTextEditing.EMNLP2019.
[9]ParallelIterativeEditModelsforLocalSequenceTransductionEMNLP2019
[10]GECToR–GrammaticalErrorCorrection:Tag,NotRewrite.ACL2020workshop
[11]ImprovingGrammaticalErrorCorrectionviaPre-TrainingaCopy-AugmentedArchitecturewithUnlabeledData.NAACL2019.
[12]Encoder-DecoderModelsCanBenefitfromPre-trainedMaskedLanguageModelsinGrammaticalErrorCorrection.ACL2020
[13]ImprovingtheEfficiencyofGrammaticalErrorCorrectionwithErroneousSpanDetectionandCorrectionEMNLP2020
[14]GeneratingDiverseCorrectionswithLocalBeamSearchforGrammaticalErrorCorrectionCOLING2020
[15]ASelf-RefinementStrategyforNoiseReductioninGrammaticalErrorCorrectionEMNLP2020
[16]MaskGEC:ImprovingNeuralGrammaticalErrorCorrectionviaDynamicMasking.AAAI-2020
[17]WrongingaRight:GeneratingBetterErrorstoImproveGrammaticalErrorDetectionEMNLP2020
[18]AdversarialGrammaticalErrorCorrectionEMNLP2020
[19]HeterogeneousRecycleGenerationforChineseGrammaticalErrorCorrectionCOLING2020
[20]PLOME:Pre-trainingwithMisspelledKnowledgeforChineseSpellingCorrectionACL2021
[21]PHMOSpell:PhonologicalandMorphologicalKnowledgeGuidedChineseSpellingCheckACL2021
[22]ExplorationandExploitation:TwoWaystoImproveChineseSpellingCorrectionModelsACL2021
[23]Read,Listen,andSee:LeveragingMultimodalInformationHelpsChineseSpellCheckingACL2021
[24]GlobalAttentionDecoderforChineseSpellingErrorCorrectionACL2021
[25]CorrectingChineseSpellingErrorswithPhoneticPre-trainingACL2021
[26]ChineseBERT:ChinesePretrainingEnhancedbyGlyphandPinyinInformationACL2021
[27]DynamicConnectedNetworksforChineseSpellingCheckACL2021
[28]InstantaneousGrammaticalErrorCorrectionwithShallowAggressiveDecodingACL2021
[29]Tail-to-TailNon-AutoregressiveSequencePredictionforChineseGrammaticalErrorCorrectionACL2021
[30]DoGrammaticalErrorCorrectionModelsRealizeGrammaticalGeneralizationACL2021
[31]GrammaticalErrorCorrectionasGAN-likeSequenceLabelingACL2021
[32]AHybridModelforChineseSpellingCheck
[33]EmpiricalAnalysisofUnlabeledEntityProbleminNamedEntityRecognitionICLR2021
[34]FastandAccurateDeepBidirectionalLanguageRepresentationsforUnsupervisedLearningACL2020
[35]LevenshteinTransformerNIPS2019
特别鸣谢
更多阅读
#投稿通道#
让你的文字被更多人看到
如何才能让更多的优质内容以更短路径到达读者群体,缩短读者寻找优质内容的成本呢?答案就是:你不认识的人。
总有一些你不认识的人,知道你想知道的东西。PaperWeekly或许可以成为一座桥梁,促使不同背景、不同方向的学者和学术灵感相互碰撞,迸发出更多的可能性。