Table of Contents
题名: 基于深度学习的自动句法纠错研究
专业: | |
外文题名: | Deep learning based automatic grammer error correction |
关键词: | |
外文关键词: | Natural language processing Automatic grammar correction Parallel corpus Deep learning Pre-training |
论文摘要: | 自动语法改错(GEC),是自然语言处理中句法分析中较为困难的任务之一。在日常对话中,语法上的细微差别对于一个非母语的人来说是最困难掌握与理解的,当前自然语言中的语法改错不仅包含语法错误,也包含拼写与搭配错误。 |
外文摘要: | Automatic grammar correction (GEC) is one of the most difficult tasks in syntactic analysis in natural language processing. In daily conversations, grammatical nuances are the most difficult to grasp and understand for a non-native speaker. The grammatical corrections in current natural language include not only grammatical errors, but also spelling and collocation errors. |
题名: 基于自然语言处理的学生英文检错规则抽取研究
外文题名: | Research on the Extraction of English Error Detection Rules based on Natural Language Processing |
关键词: | |
外文关键词: | Composition correction Rules extraction Rules matching Natural language processing |
论文摘要: | 已有二语习得研究表明,提供有效校正反馈,有利于提高第二语言学习者语言水平。目前市面上也出现了一些英语写作的纠错工具,例如国外有LanguageTool、Grammarly,国内有批改网等软件。这些工具大多局限于英文写作中的单词拼写错误、语法错误,当涉及中式英语、搭配错误、句型错误、含义模糊等偏主观错误时,主要依靠人工制定规则进行识别。另外,虽然已有LanguageTool等开源工具可以进行错误识别,但是不能针对规则特点灵活进行适配和更改,且识别速度较慢。针对以上问题,本研究提出利用已标注的学习者语料,从中半自动地抽取检错规则,然后自行开发轻量级的规则匹配器来验证和应用规则。 在本研究中,首先提取了英文改错规则。通过对已标注的学习者语料库CLEC和NUCLE进行详尽分析,确定可由程序自动抽取的错误类别;通过Java程序设计算法实现规则的初步提取,并且将抽取结果写入MySQL数据库。并且对抽取之后的规则进行测试和验证,通过人工方式筛选规则。最后合理利用牛津搭配词典、Google Books等语料资源对现有规则进行延伸,以达到通过订正错误来帮助学习者学习英语的目的。 其次,笔者设计和实现了轻量级规则匹配器。本匹配器是针对抽取出的规则进行设计,可以对半自动抽取的规则表进行验证,也可证明从学习者语料库中半自动抽取规则的可行性。 本研究的成果是通过科学方法从学习者语料库中抽取英文改错规则,识别准确率达90%以上;并且对规则进行了预处理,为后续专家校正提供了可靠依据,减少了时间成本;另外设计并实现了轻量级的规则匹配器,作为LanguageTool的补充,将速度提升30%以上,可以高效处理各种自定义规则。研究表明,此半自动生成规则与应用的方式,提高了效率,节省了人力,能够给英语学习者以帮助。同时此项目具有通用性和易扩展性,对于其他学习者语料库或语料资源,可以很好地进行扩展和进一步研究。 |
外文摘要: | The studies on the learn of the second language has shown that providing effective correction feedback is beneficial for the learners to develop the ability of learning the second language. At present, there are some error correction tools for English writing on the market, such as the foreign LanguageTools and Grammarly, and the domestic Pigai. org, etc. However, most of these tools are limited to the word spelling errors and grammatical errors in English writing, while the retrieval is mainly relied on manual rules when the rules involve subjective errors, such as chinglish, mismatches, sentence pattern errors and ambiguous meanings and so on. Although the open source tools, such as LanguageTool, etc., can identify the errors, they are not appropriate to adapt and change the rules flexibly. Besides, the recognition speed is slow, too. Regarding the issue above, this study aims to use the annotated learner corpus to extract the the error detection rules semi-automatically, and then develop a lightweight rule matcher to verify and apply the extracted rules. The first part is the extraction of English correction rules. Firstly, the error categories that can be automatically extracted by the program are determined through the detailed systematic analysis of the existing and annotated learner corpus CLEC and NUCLE. Secondly, the initial extraction of the rules is implemented by the design algorithm of Java program and the extraction results are written into the MySQL database. In addition, the rules after extraction are tested and verified, and then filtered manually. Finally, the resources, such as Oxford collocation dictionary and Google books, etc., are used to extend the existing rules, so as to help the learners learn English through correcting errors. The second part is the design and implementation of lightweight rule matcher. This matcher is developed to design the extracted rules and the existing rule base. On the one hand, the semi-automatically extracted rule table can be verified conveniently. On the other hand, the feasibility of the rules that are semi-automatically extracted from the learner corpus can be proved. The result of this research is that English error correction rules can be extracted from the learner corpus through scientific methods, with an accuracy rate of over 90%. Moreover, the rules are preprocessed, which provides a reliable basis for the subsequent experts to perform correction and reduce the time cost. Furthermore, the lightweight rule matcher is designed and implemented, which can be taken as a complement to LanguageTool, making the speed increase more than 30% to efficiently handle various customized rules. The studies have shown that the rules generated semi-automatically and the mode of application can improve efficiency, save manpower and help English learners. At the same time, this project has universality and extensibility, so it can extend and further research the future learner corpora or other resources. |
题名: 基于深度学习的视频行为识别研究
作者: | |
关键字(中文): | |
文摘: | 近年来互联技术逐渐变得成熟,尤其是智能手机和一些数码设备的普及,令网络 上覆盖着大量的视频信息,面对急剧增长的视频数量,一些含有暴力和色情的视频内 容被肆意传播,这给青少年的身心健康带来了一定的危害,并且也给网络的监管带来 了巨大的压力。由于监控视频数量不断地增长,互联网上视频数量的持续增长令人们 对视频内容的理解以及视频中人体行为分析的需求也在不断地增加。使用计算机不仅 能够更好地理解视频中的内容,而且能够避免人们花费大量的时间对视频进行分析。 深度学习在计算机视觉领域做出了很大的贡献。将深度神经网络在大规模数据集 上进行训练使得深度学习方法在目标检测,图像分类和视频中的人体动作识别等领域 都达到了较好的效果。由于深度学习对图像数据具有很好地抽象建模能力以及能够自 动提取图像特征,而视频可看成是一系列的图像帧堆叠而成。所以对于本文研究的对 视频中的人物行为进行识别的技术采用深度学习方法来进行探索。 本文的主要工作内容如下: 现有的基于双流卷积网络的行为识别方法中用的卷机网络大部分是 BN-Inception 结构或者是 VGG 结构,这样的结构参数量较大不易于网络的训练,因此本文采用 Densenet 结构来分别提取视频的空间信息和时间信息,原有的 Densenet 结构采用的是 全局的密集连接方式,即网络中的 Dense 块中的某一层都与其它层互相连接,这样容 易造成特征冗余且参数量较大,并且由于每一层的输入都是之前所有层输出的特征映 射的拼接,所以在网络的前向传播和反向传播的过程中都要存储这些中间层的特征映 射,所以原有的 Densenet 在训练过程中占有的内存较大。本文针对上述问题,对原有 的 Densenet 做出改进参首先将原有的 Densenet 中的每一层互相连接改成局部连接,也 就是每一层只与之前的一些层部分连接,这大大减少了模型在学习过程中需要训练的 参数量。并且采用共享内存的方式减少模型占有的内存。其次,现有的双流卷机网络 最后再对人体行为进行预测时是将两个网络的结果加权平均,这样没有更好的利用视 频的时空信息,所以本文通过将提取到的视频信息在空间维度和时间维度上进行合并。 |
题名: 辅助写作的语料库查询系统设计与实现
外文题名: | Design and Implementation of an Corpus Query System for Writing Assistance |
关键词: | |
外文关键词: | Corpus Assisting Reading Assisting Writing Corpus Query System Design And Implementation |
论文摘要: | 英语写作是国内学生英语能力的短板。目前,国内学校开设的英语写作公共课程的教学效果相对有限,市面上虽然有针对英语写作的学习网站和书籍,但大多都是模板句型等资源的汇集展示,向学习者即时提供的指导不具备针对性。各类英语写作辅助工具主要提供作文的机器评阅和自动打分,仅能帮助用户发现作文中的常见错误,对于正确的内容无法予以改进指导。 对写作学习者来说,参考已有的专业或优秀的行文表达是写作学习的有效途径,语料库作为真实的语言资源知识库可以在写作学习及教学中提供可信赖的指导意见。但现阶段语料库的设计大部分面向学术研究,普通的教师和学生使用起来并不方便,具体表现在:(1)功能繁多、查询参数的设置较为复杂,用户学习使用的成本较高;(2)经常会出现前几条检索结果是长难句的情况,用户的阅读体验不佳;(3)查询结果中难免包含复杂的词汇和语句,对于普通学习者来说会有一定的阅读负担。 本研究将从辅助英语写作的角度出发,面向普通学生和老师设计并实现一个语料库查询系统,使用户可以便捷有效地获取语料信息、并利用语料库查询手段帮助发现和改正作文中的错误及不足。 本文首先期望解决目前语料库查询系统对于普通英语学习者的易用性问题,包括功能使用不便和语料阅读困难。本系统针对这些问题实现了语料库基础检索模块、检索结果重构模块和句子辅助阅读模块。基础检索模块提供常用的语料库查询功能,且具备简易查询模式,可以提升语料库使用的便捷性;检索结果重构模块的主要目的在于提升语料阅读体验,将对例句按照从易到难的顺序进行排序,并对查询结果中的用户陌生词汇进行特殊显示;句子辅助阅读模块旨在帮助用户习得语料,将提供句子的机器翻译结果、句法拆解结果和简单句等信息。 本文接下来研究了语料库查询在写作场景中的应用,着眼于解决国内学生在英文写作中常出现的词汇搭配僵化、搭配表达偏口语化等具体问题实现了语料库查询辅助写作/批改模块。该模块提供了作文搭配抽取、搭配丰富程度分析和搭配校验三个功能,可以利用语料库数据来帮助用户审阅与改进作文中的搭配使用。 经测试,检索结果重构有效提高了用户对语料的阅读兴趣;句子辅助阅读在长难句理解方面的辅助效果得到了被试者的一致认可;语料库辅助写作可以实际改善作文中词汇搭配错误及搭配重复使用的问题,在对10篇作文进行修改后,作文在批改网的得分平均提高了1.9分(满分100),最高提升了5分。 |
外文摘要: | English writing is a major problem for Chinese students. English writing classes opened by domestic schools have relatively limited effects. Although there are websites and books aiming to help students write English articles, most of them are simply the displays of resources like sentence patterns. They do not provide users with adaptive instructions. Assisted tools for English writing mostly can not instruct learners to reach a higher level of English writing as they can not provide guidance on right sentences, all they do is examining articles, picking out common mistakes and offering grades. Referring to idiomatic articles is a valid way for learners to improve writing ability. Corpus, being a data bank of language, can provide reliable guidance for writing learning and teaching. But the corpuses nowadays are mostly designed for academic purposes which do not fit the goals of common teachers and students. Actually, teachers and students may find such corpuses inconvenient when using them because: 1) They have redundant functions and complicated query parameter settings so that they are hard for users to learn. 2) It is often the case that the first few search results are long and difficult, which gives users a hard time in reading. 3) Complex words and sentences in the query results can be a burden for average learners. Motivated by the idea of assisting English writing, this study designs and implements a corpus query system for ordinary students and teachers Using the system, users can easily obtain corpus information and use corpus query methods to help identify and improve deficiencies in the composition. This article wishes to tackle the usability problems including the inconvenience in using as well as reading corpus data. To overcome these obstacles, the system is designed with basic query module, query results automatic reconstructing module and assisting sentence reading module. Basic query module was embedded with simple query mode which makes the corpus more convenient to use. Result reconstruction module aims at enhancing reading experience, it can reconstruct query results by sorting the example sentences in order of difficulty and highlight the unfamiliar words. Assisting sentence reading module provides syntactic splitting results (including clause recognition and collocation), machine translation results and information of simple sentences etc. This study also explores the application of corpus query in English writing. Towards the issue of lexical fossilization and colloquialism of Chinese students, the author develops the corpus assisting writing/examing module to help them write articles by querying corpus. This module not only extracts collocations but also analyzes their richness and verifies them. It helps users examine and improve their use of collocation. Test shows that the result reconstruction validly boost users’ interest in reading the corpus query result. The assisted sentence reading module was widely welcomed by the subjects when tested on understanding the long difficult sentences. The assisted writing module improves the use of collocation in articles. On average, 10 revised articles graded on Pigaiwang (a website provides grading module) are elevated by 1.9 points (full credit 100) after using the system for modification. Most points elevated is 5. |
题名: 基于文献的中医经方靶点预测关键技术研究
分类号: | TP3 |
题名: 基于网络表示学习的科技简报自动生成关键技术研究
参考文献列表: |
[1] 李念峰. 基于自动摘要的网络情报收集系统研究[J]. 现代情报, 2007, 27(11):161-163.
[2] 尹显贵. 基于Web的企业竞争情报服务平台中多文本摘要技术研究[D]. 昆明理工大学, 2012. [3] 孟凡坤. 特定领域知识库的构建与简报生成[D]. 北京工业大学, 2014. [4] 张晓艳, 王挺, 陈火旺. 命名实体识别研究[J]. 计算机科学, 2005, 32(4):44-48. [5] Collins M, Singer Y. Unsupervised models for named entity classification[C]//1999 Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora. 1999. [6] Bikel D M, Schwartz R, Weischedel R M. An algorithm that learns what's in a name[J]. Machine learning, 1999, 34(1-3): 211-231. [7] Curran J, Clark S. Language independent NER using a maximum entropy tagger[C]//Proceedings of the seventh conference on Natural language learning at HLT-NAACL 2003. 2003. [8] McNamee P, Mayfield J. Entity extraction without language-specific resources[C]//proceedings of the 6th conference on Natural language learning-Volume 20. Association for Computational Linguistics, 2002: 1-4. [9] McCallum A, Li W. Early results for named entity recognition with conditional random fields, feature induction and web-enhanced lexicons[C]//Proceedings of the seventh conference on Natural language learning at HLT-NAACL 2003-Volume 4. Association for Computational Linguistics, 2003: 188-191. [10] Collobert R, Weston J, Bottou L, et al. Natural language processing (almost) from scratch[J]. Journal of machine learning research, 2011, 12(Aug): 2493-2537. [11] Huang Z, Xu W, Yu K. Bidirectional LSTM-CRF models for sequence tagging[J]. arXiv preprint arXiv:1508.01991, 2015. [12] Pham T H, Le-Hong P. End-to-end recurrent neural network models for vietnamese named entity recognition: Word-level vs. character-level[C]//International Conference of the Pacific Association for Computational Linguistics. Springer, Singapore, 2017: 219-232. [13] Ma X, Hovy E. End-to-end sequence labeling via bi-directional lstm-cnns-crf[J]. arXiv preprint arXiv:1603.01354, 2016. [14] Miller S, Fox H, Ramshaw L, et al. A novel use of statistical parsing to extract information from text[C]//1st Meeting of the North American Chapter of the Association for Computational Linguistics. 2000. [15] Mintz M, Bills S, Snow R, et al. Distant supervision for relation extraction without labeled data[C]//Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 2-Volume 2. Association for Computational Linguistics, 2009: 1003-1011. [16] Zelenko D, Aone C, Richardella A. Kernel methods for relation extraction[J]. Journal of machine learning research, 2003, 3(Feb): 1083-1106. [17] Brin S. Extracting patterns and relations from the world wide web[C]//International workshop on the world wide web and databases. Springer, Berlin, Heidelberg, 1998: 172-183. [18] Hasegawa T, Sekine S, Grishman R. Discovering relations among named entities from large corpora[C]//Proceedings of the 42nd annual meeting on association for computational linguistics. Association for Computational Linguistics, 2004: 415. [19] Piasecki M, Ramocki R, Kaliński M. Information spreading in expanding wordnet hypernymy structure[C]//Proceedings of the International Conference Recent Advances in Natural Language Processing RANLP 2013. 2013: 553-561. [20] Gonzalez J E, Xin R S, Dave A, et al. Graphx: Graph processing in a distributed dataflow framework[C]//11th {USENIX} Symposium on Operating Systems Design and Implementation ({OSDI} 14). 2014: 599-613. [21] Low Y, Bickson D, Gonzalez J, et al. Distributed GraphLab: a framework for machine learning and data mining in the cloud[J]. Proceedings of the VLDB Endowment, 2012, 5(8): 716-727. [22] Perozzi B, Al-Rfou R, Skiena S. Deepwalk: Online learning of social representations[C]//Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, 2014: 701-710. [23] Mikolov T, Sutskever I, Chen K, et al. Distributed representations of words and phrases and their compositionality[C]//Advances in neural information processing systems. 2013: 3111-3119. [24] 涂存超, 杨成, 刘知远,等. 网络表示学习综述[J]. 中国科学:信息科学, 2017(8):32-48. [25] Tang J, Qu M, Wang M, et al. Line: Large-scale information network embedding[C]//Proceedings of the 24th international conference on world wide web. International World Wide Web Conferences Steering Committee, 2015: 1067-1077. [26] Grover A, Leskovec J. node2vec: Scalable feature learning for networks[C]//Proceedings of the 22nd ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, 2016: 855-864. [27] Wang D, Cui P, Zhu W. Structural deep network embedding[C]//Proceedings of the 22nd ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, 2016: 1225-1234. [28] Yang C, Liu Z, Zhao D, et al. Network representation learning with rich text information[C]//Twenty-Fourth International Joint Conference on Artificial Intelligence. 2015. [29] Tu C, Liu H, Liu Z, et al. Cane: Context-aware network embedding for relation modeling[C]//Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). 2017: 1722-1731. [30] Tu C, Zhang Z, Liu Z, et al. TransNet: Translation-Based Network Representation Learning for Social Relation Extraction[C]//IJCAI. 2017: 2864-2870. [31] Bordes A, Usunier N, Garcia-Duran A, et al. Translating embeddings for modeling multi-relational data[C]//Advances in neural information processing systems. 2013: 2787-2795. [32] 宗成庆. 统计自然语言处理(第2版)[M]// 统计自然语言处理. 2008. [33] Carbonell J G, Goldstein J. The Use of MMR and Diversity-Based Reranking for Reodering Documents and Producing Summaries[J]. 1998. [34] Bollegala D, Okazaki N, Ishizuka M. A bottom-up approach to sentence ordering for multi-document summarization[J]. Information processing & management, 2010, 46(1): 89-109. [35] Li C, Qian X, Liu Y. Using supervised bigram-based ILP for extractive summarization[C]//Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). 2013, 1: 1004-1013. [36] Lin H, Bilmes J. A class of submodular functions for document summarization[C]//Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies-Volume 1. Association for Computational Linguistics, 2011: 510-520. [37] Li C, Liu Y, Liu F, et al. Improving multi-documents summarization by sentence compression based on expanded constituent parse trees[C]//Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP). 2014: 691-701. [38] Bing L, Li P, Liao Y, et al. Abstractive multi-document summarization via phrase selection and merging[J]. arXiv preprint arXiv:1506.01597, 2015. [39] Liu F, Flanigan J, Thomson S, et al. Toward abstractive summarization using semantic representations[J]. arXiv preprint arXiv:1805.10399, 2018. [40] Sutskever I, Vinyals O, Le Q V. Sequence to sequence learning with neural networks[C]//Advances in neural information processing systems. 2014: 3104-3112. [41] Cho K, Van Merriënboer B, Gulcehre C, et al. Learning phrase representations using RNN encoder-decoder for statistical machine translation[J]. arXiv preprint arXiv:1406.1078, 2014. [42] Bahdanau D, Cho K, Bengio Y. Neural machine translation by jointly learning to align and translate[J]. arXiv preprint arXiv:1409.0473, 2014. [43] Gehring J, Auli M, Grangier D, et al. A convolutional encoder model for neural machine translation[J]. arXiv preprint arXiv:1611.02344, 2016. [44] Goodfellow I, Pouget-Abadie J, Mirza M, et al. Generative adversarial nets[C]//Advances in neural information processing systems. 2014: 2672-2680. [45] Le Q, Mikolov T. Distributed representations of sentences and documents[C]//International conference on machine learning. 2014: 1188-1196. [46] Bahdanau D, Cho K, Bengio Y. Neural machine translation by jointly learning to align and translate[J]. arXiv preprint arXiv:1409.0473, 2014. [47] Hochreiter S, Schmidhuber J. Long short-term memory[J]. Neural computation, 1997, 9(8): 1735-1780. [48] 刘丹丹, 彭成, 钱龙华, et al. 词汇语义信息对中文实体关系抽取影响的比较[J]. 计算机应用, 2012, 32(08):2238-2244. [49] 刘向, 马费成, 陈潇俊, et al. 知识网络的结构与演化——概念与理论进展[J]. 情报科学, 2011(6):801-809. [50] Tu C, Wang H, Zeng X, et al. Community-enhanced network representation learning for network analysis[J]. arXiv preprint arXiv:1611.06645, 2016. [51] Griffiths T L, Steyvers M. Finding scientific topics[J]. Proceedings of the National academy of Sciences, 2004, 101(suppl 1): 5228-5235. [52] Pan S, Wu J, Zhu X, et al. Tri-party deep network representation[J]. Network, 2016, 11(9): 12. [53] Bordes A, Usunier N, Garcia-Duran A, et al. Translating embeddings for modeling multi-relational data[C]//Advances in neural information processing systems. 2013: 2787-2795. [54] 李娜娜, 刘培玉, 刘文锋, et al. 基于TextRank的自动摘要优化算法[J]. 计算机应用研究, 2019(5). [55] Pennington J, Socher R, Manning C. Glove: Global vectors for word representation[C]//Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP). 2014: 1532-1543. |
关键字(中文): | |
文摘: | 自改革开放以来,我国新的科技政策层出不穷,这些政策在不同层级政府、不同地区之间进行扩散,对各级政府的政策行为和科技治理水平的提高起到重要的影响,但是鲜有学者对上述扩散现象进行深入的研究。同时,现有科技政策扩散研究多以定性分析为主,侧重对政策扩散理论框架、制约因素、扩散实例展开研究,缺少对所提理论和模型的系统性验证;少数定量研究也主要是应用基本的统计分析,人工参与较多,缺少对政策内容和属性的自动挖掘,难以精确地提取扩散关系,挖掘内容变化。 基于以上情况,本文在总结前人研究的基础上,针对科技政策扩散特点,重点从结构和语义层面构建了科技政策扩散模型,引入自然语言处理领域的文本分析与计算方法,进行扩散特征的自动提取和政策扩散关系的自动挖掘。 (1)对政策领域有意义字符串发现和政策结构提取技术进行研究。首先,针对科技政策中新词术语较多且长度较长,传统分词效果难以达到分析需求的问题,本文提出了基于规则和信息熵的优化方法,实验表明该方法能有效地划分出科技政策文本中绝大部分有意义字符串。对于政策结构,本文分别提出了组织结构提取和扩展方法。首先利用政策行文特点,并结合词频和TextRank算法提取出政策的组织结构。在此基础上,本文构建了科技政策领域结构词表,并根据结构词表对政策的组织结构进行扩展,最终提取出政策的基本面。 (2)对政策扩散特征表示和政策扩散关系判定技术进行研究。首先,本文从结构和语义两方面对科技政策扩散特征进行了研究,分别提取了组织结构相关性特征、基本面同一性特征、特征词承继特征以及基于Doc2vec的文本相似性特征。在特征提取的基础上,本文选用决策树分类模型,将关系判断转化为分类问题,实现对多个特征进行一体化处理,实验表明,本文构建的多特征分类模型能有效地判定政策扩散关系。 (3)对政策扩散识别技术进行研究。首先,针对同一主题下科技政策扩散情况的分析需求,本文构建了科技政策扩散识别框架,并引入了Ranking SVM模型,融合科技政策扩散特征和文本多样化特征,对模型进行了适应化改进。之后,本文提出了基于排序评分的科技政策排序距离计算方法,寻找使扩散关系成立的最大排序距离,作为扩散识别经验值。然后用这一经验值优化识别模型,实现了检索过程中科技政策扩散对和扩散集的自动计算和输出。实验表明,本文构建的科技政策扩散识别框架能有效地提取出扩散集合,满足了用户对某一主题下的科技政策扩散关系挖掘的分析需求。 |
题名: 基于蒙特卡罗算法的皮肤病诊疗路径关键技术研究
题目(外文): | Research on Key Technologies of Dermatosis Diagnosis and Treatment Path Based on Monte Carlo Algorithms |
关键字(中文): | |
关键字(外文): | Dermatology Diagnosis and Treatment Medical Record Analysis Monte Carlo Algorithms The Shortest Path |
文摘: | 信息技术经过60余年的发展,已经普及到社会生活的各个方面。随着信息技术在各医学各领域的应用,大量数据随之产生。皮肤病是常见病及多发病,相关的病症种类多达一千多种,病历数据具有巨大的价值,其语义知识点可用于临床辅助诊疗和健康管理。目前全国皮肤科诊疗室面临着等待时间长、就医用药难、医师诊疗不准等多重问题,皮肤科医师迫切需要一种可以自动化推荐的计算机辅助诊疗工具,以辅助决策和智能医疗诊断。本文提出一种基于蒙特卡罗算法快速识别皮肤病诊疗依据知识点形成皮肤病诊疗最短路径的方法,用于给医师提供下一步最优推荐。 本文在对皮肤病病历结构及内容系统分析及总结归纳的基础上,以《皮肤科诊疗常规》为诊疗依据判定基础,结合蒙特卡罗算法特点及优势,提出了一套以病历诊疗为数据集、以诊疗依据提取与结构化为研究对象、生成皮肤病诊疗路径并基于蒙特卡罗算法计算训练出最短化方案,并通过实验研究验证该方案的可行性。本文研究重点在于如何通过对皮肤病传统的诊疗方法进行分析建模,形成一个能适应于蒙特卡罗算法进行计算的矩阵,如何根据病历及诊疗手册的结构与内容的对应关系提取出皮肤病诊疗依据,如何应用蒙特卡罗算法模拟计算、调参生成皮肤病诊疗最短路径,为诊疗提供支持。 本文的研究工作具体如下:分析皮肤病病历及诊疗手册的文本特征,对文本语义与结构信息进行深入挖掘,从中提取诊疗依据知识点的语义集合。基于文本分析方法模型和机器学习技术,形成能适应于蒙特卡罗算法计算的矩阵,构建出皮肤病诊疗模型。基于蒙特卡罗算法,探索并实现诊疗过程表示及结构化生成、诊疗路径计算与最短化处理关键技术,计算出皮肤病诊疗的最短路径。最后,通过实验论证了上述方法的有效性,可应用于下一步最优诊疗依据推荐。 |
文摘(外文): | Information technology has spread to society as a result of the development of more than 60 years. A large amount of data is generated with information technology applied in the field of medical science. Dermatosis is common and frequently-occurring, and there are more than 1,000 kinds of dermatoses now. Medical record data is of great value and its semantic knowledge points can be used for clinical assisted diagnosis and health management. At present, the national dermatology clinics face many problems such as long waiting time, difficulty in medical medicine, and inaccuracy in doctors' diagnosis. Thus, dermatologists need a computer-aided diagnosis tool urgently that can be recommended automatically to assist decision-making and intelligent medical diagnosis. This paper proposes a method to quickly extract diagnosis knowledge points to identify the shortest path of dermatological diagnosis and treatment based on Monte Carlo algorithms, which can be used to provide doctors with the recommendation for the next step. This paper proposes a scheme for calculating the shortest path of dermatology diagnosis and treatment based on Monte Carlo algorithms after systematic analysis and summary of the structure and content of dermatology medical records. It takes the Routine of Dermatology Diagnosis And Treatment as the basis of diagnosis and treatment, the advantages of Monte Carlo algorithms, and the data set of diagnosis and treatment of dermatology to extract and structuralize of diagnosis knowledge points. Thus the shortest scheme is proposed, and the feasibility of the scheme is verified by experimental study. The focus of this paper is how to model traditional diagnosis and treatment methods to form a large matrix that can be adapted to Monte Carlo algorithms, how to extract the knowledge points of dermatology diagnosis and treatment corresponding to the structure and content of medical records and the Routines of Dermatology Diagnosis and Treatment, and how to adjust the parameters and calculate the shortest path of dermatosis diagnosis and treatment based on Monte Carlo algorithms. The research work can be expressed as follows: first, analyzing the text characteristics of dermatological medical records and the Routines of Dermatology Diagnosis and Treatment. Based on the modeling of text structure, the knowledge points of diagnosis and treatment through automatic rule extraction are realized. Then, a large matrix suitable for Monte Carlo algorithms is formed on the basis of text analysis method model and machine learning technology, and a dermatological diagnosis and treatment model is constructed. After the diagnosis and treatment processes are represented and structured, Monte Carlo algorithms are used to evolve the diagnosis and treatment paths to calculate the shortest path of dermatological diagnosis and treatment. Finally, the effectiveness of the above methods is demonstrated by experiments, and the recommended system for the optimal diagnosis and treatment of dermatological intelligent diagnosis and treatment is designed and implemented. |
题名: 面向领域的先进技术侦测关键技术研究
题目(外文): | Research on Domain-Oriented Advanced Technology Detection |
关键字(中文): | |
关键字(外文): | |
文摘: | 本文主要针对现有技术侦测研究中缺乏先进技术侦测综合模型的问题,利用领域科学文献对先进技术侦测的关键技术进行研究。经过调研发现,在先进技术侦测中,技术点挖掘及其先进性特征在文本中的体现是构建合理有效的先进技术侦测模型的重要任务。因此,本文首先根据先进技术侦测思想方法构建起一个包含领域潜在技术点挖掘和领域潜在先进技术挖掘及其特征发现的侦测模型,之后,对领域先进技术特征建立融合模型以完善侦测模型设计。与此同时,建立针对技术先进性评价的指标体系,最后,在多个技术领域对初始模型进行实验,将实验结果与评价指标进行对比,以优化先进技术侦测模型。具体研究内容有以下几点: 首先,本文探讨了不同类型科学文献资源在技术点获取中的特点并根据文献特点制定了相应的技术词获取策略,并且多源文献的特点也为先进技术文本特征的选取提供了依据。同时,也利用科学文献资源建立了领域知识库,领域知识库的概念结构将帮助后续研究更好地挖掘先进性文本特征。本部分提出了针对技术点特点的TFIDFC-value技术字串提取方法,实验证明该方法具有一定有效性。通过该方法获取的领域技术点,将作为领域先进技术挖掘的基本技术词和先进性评价的部分对象。 其次,本文选取了技术生命周期、领域技术主题演化、领域科技文本术语、领域专利布局四方面以提取技术先进性文本特征,并假设先进技术位于技术生命周期的萌芽期和成长期,出现在领域技术演化的新主题、领域项目文本中的新术语和领域内大公司的非主流专利布局中,并根据假设和基本技术词,提取可能具有先进性的候选技术词,扩大了技术词获取范围。之后,本文根据相关研究总结归纳出技术先进性评价指标体系,基于此前提取的技术文本特征信息进行融合,融合基于技术成熟度、技术知识扩散等理论。指标体系将用于领域先进技术侦测,与先进技术特征挖掘共同构成了初始先进技术侦测模型。 最后,本文选取自动驾驶汽车和物联网领域作为先进技术侦测模型的回溯实验对象,实验证明初始先进技术侦测模型有效,并根据回溯实验结果从提升技术点专业性和单元性角度出发进行侦测模型改进,并将改进后的回溯实验结果与原结果比较分析,实验结果表明,改进后的模型一定程度上提升了排名靠前的候选技术点的先进性侦测准确度。 |
文摘(外文): | this paper mainly discusses the key technologies of advanced technology detection in the field of scientific literature. through research and development, it is found that in advanced technology detection, technology point mining, as well as the embodiment of its advanced characteristics in the text, is an important task to build a reasonable and effective advanced technology detection model. therefore, this paper first constructs a detection model including domain potential technology point mining, domain potential advanced technology mining, and feature discovery based on the advanced technology detection theories and methodologies, and establishes a fusion model for domain advanced technology features to improve the initial model design. at the same time, an index system for the evaluation of technological advancement is established. finally, the initial model is tested in many technical fields, and the experimental results are compared with the evaluation index to optimize the detection model of advanced technology. specific research contents are as follows: firstly, this paper discusses the characteristics of different types of scientific literature resources in the acquisition of technical points, and formulates corresponding acquisition strategies of technical terms based on the characteristics of the literature, and the characteristics of multi-source literature also provide a basis for the selection on advanced technical text features. at the same time, the domain knowledge base is established by using scientific literature resources. the conceptual structure of the domain knowledge base will help follow-up research that further excavates advanced text features. in this part, a tfidfc-value string extraction method based on the characteristics of technical points is proposed. experiments show that the method is effective. the domain technology points obtained by this method will be regarded as the basic technical terms of domain advanced technology mining and part of the of advanced evaluation. secondly, this paper selects four aspects of technology life cycle, domain technology theme evolution, domain technology text terminology, and domain patent layout to extract the text characteristics of technology advancement, and assumes that advanced technology lies in the germination and growth of technology life cycle, new topics of domain technology evolution, new terminology of domain project text and non-mainstream specialty of large companies in the domain. according to hypothesis and basic technical words, candidate technical words which may be advanced are extracted in the favorable layout, which enlarges the scope of technical words acquisition. then, according to the relevant research, this paper summarizes the evaluation index system of technological advancement. based on the feature information extracted before, it fuses the theory of technological maturity and diffusion of technological knowledge. the index system will be used in the field of advanced technology detection, and together with the mining of advanced technology features, it will constitute the initial advanced technology detection model. finally, this paper chooses self-driving automobile and internet of things as the backtracking experimental of advanced technology detection model. experiments show that the initial advanced technology detection model is effective, and based on the backtracking results, we improve the detection mode from the perspective of enhancing the expertise and unit nature of the technology points, as well as comparing the improved backtracking results with the original results. to some extent, the improved model improves the advanced detection accuracy of the top candidate technology points. |
题名: 基于层次条件变分自编码器的政府公文自动生成系统的设计与实现
关键词: | |
外文关键词: | LSTM CVAE Keyword extraction Government document Automatic text generation |
论文摘要: | 近年来,文本生成是自然语言处理领域(Natural Language Processing)一项极具挑战的任务,在解决短文本生成和诗歌生成等方面都取得了不错的进展,但由于当文本变长会造成信息丢失、误差传递和错误偏移等问题,因此在长文本生成上的研究还处于初步阶段,特别是中文长文本生成,而政府公文的生成又是中文长文本生成中特殊的一种。政府公文是我国传达政治任务、表达政治观点以及记录历史事件的特殊文化遗产,有着独特的行文思路和措辞特点,其生成任务所面临的难点与长文本生成有诸多共通之处,都希望生成具有用词多样性(wording diversity)和主题一致性(thematic consistency)的文本。用词多样指句中使用了多样化的词语来表词达意,而不是重复地使用单调的字或词;主题一致指文本句与句之间和句子内部词与词之间阐述的为同一主题。 随着深度学习方法的普及,在文本生成中seq2seq是一种常用的高质量文本生成框架。VAE的引入可以使得seq2seq的生成过程更具多样性,同时学者们发现将生成条件引入VAE中构成CVAE,可以进一步提高句子的内部主题一致性和句子用词多样性。在近期的工作中,关键词也被证实可以作为中间的生成结果来进一步提高句子与句子之间的主题一致性。 虽然CVAEs等已被证实可以用来进行文本生成,但是它们的生成指向性不足,并且不能很好地保证主题一致性以及生成更加多样化的用词。本文试图通过加入类似写作提纲的关键词得到Key-CVAE,使得模型在生成中文政府公文的过程中不仅可以考虑词和词的主题一致性还能进一步优化句子与句子之间的主题一致性。 实验表明,本文模型Key-CVAE不仅在本文构建的政府公文数据集上在篇章和句中主题一致性上取得了高于预期的效果,并且在一系列对比实验中验证了关键词和CVAE的结合不仅加强了CVAE的主题一致性,还保持了用词多样性的性能,同时验证了训练数据集的多样性对模型生成结果的影响。目前,虽然长文本生成技术在中文任务上只是初期探索阶段,但本文引入的Key-CVAE模型具有很好的参考研究价值,为以后长文本生成任务的研究提供了新的思路。 |
外文摘要: | Text generation is a challenging task in Natural Language Processing(NLP). Although text generation has achieved success in many fields such as Short-text generation and Poetry generation. But, when the text becomes longer, it will cause problems such as information loss, error transmission, error migration, etc. Therefore, the research on Long-text generation is still in its preliminary stage, especially in the Chinese Long-text generation, such as the Government document generation which is a special kind of Long-text generation. Government document is a unique cultural heritage with its special use and combination of words. Aiming to publish political tasks, express political views and note historical events. However, the challenges government documents face have much in common with traditional texts, like wording diversity and thematic consistency. Wording diversity highlights the type of words used and thematic consistency emphasized the consistency of theme between sentences and words. With the popularity of deep learning methods, Seq2Seq is a commonly used and a high-quality text generation framework in text generation. The use of VAE can make the generation process of seq2seq more diverse. Scholars have also found that the generation conditions of CVAE can further improve the VAE’s text generation in internal theme consistency and wording diversity. In recent work, keywords serve as intermediate generation results have also been shown can further improve the topical consistency between sentence and sentence. Although CVAEs have been proven to be useful for text generation, but their generation is not sufficiently directed. This paper attempts to propose the model: keyword-enhanced conditional variation autoencoder (Key-CVAE) to solve the problem of Chinese government document generation by adding the keywords as writing outline in the consistency of theme between sentences and words. Experiments have shown that the model Key-CVAE not only achieves higher-than-expected effect on the theme consistency in the government document data set constructed in this paper, but also proved that the combination of keywords and CVAE not only enhanced the theme consistency of the CVAE model, but also maintained the performance of it’s wording diversity, and verified the diversity of the training data set have an impact on the model generation, in a series of comparative experiments. Although Long-text generation is in the preliminary stage in chinese tasks, but the Key-CVAE model introduced in this paper has reference research value which provides a new idea for the research of Long-text generation tasks. |
分类号: | TP3 |
题名: 一种英语写作知识点推荐策略
论文摘要: | 无论在考试还是在日常生活中,英语写作都是中国学生不可回避的难题。目前,市面上虽然存在包括书籍与网站在内的各种英语写作教学资源,但这些资源的作用相对有限。相关书籍只是理论和语料资源的搜集整合;而写作辅助系统为学生的文章提供分数判定和错误批改,并没有结合学生的写作水平和写作目的针对性地在写后阶段帮助学生获得写作能力的提升。 英语写作教学理论繁多,在众多英语写作教学理论中,让学生通过不断改写文章来提升写作能力是广受认可的一种方案。本系统根据此教学理念设计。为了得到适合每个学生的文章改写方向,本系统充分考量每个学生的写作记录,设计并实现了一种英语写作知识点推荐策略。该策略在运行时可以修正自己的反馈以适应对象的变化。对于英语写作教学,该策略的目的是为学生推荐出其最需要学习的内容以提升其写作能力,本文将这些内容定义为学生使用次数较少而母语者使用次数较多的知识点。 对于中国学生在英语写作时遇到的无词可用、难以连词成句、表达不够地道等问题,本文提出了解决方案并对方案进行了验证。使用系统时,学生输入一篇自己的习作,系统对学生该篇习作和学生的写作历史进行单词、搭配、短语和句型四个维度英语知识点使用频率的计算,对比学生文章与范文中各个知识点的使用频率得出推荐的知识点。在单词推荐模块,为学生推荐使用频率小于同主题范文的单词集合。在搭配、语块和句型推荐模块,对于学生没有使用过的知识点,为其推荐同主题范文中使用频率最高的知识点;对于学生使用过的知识点,为其推荐同主题范文中使用频率/学生使用频率最高的知识点。当学生历史文章数量不足时,通过级别判定和主题约束模块来选取替代文章,使用中国英语学习者语料(Chinese Learner English Corpus,CLEC)作为学生历史文章的替代语料。 测试显示,本系统推荐的知识点可以有效提升写作者的写作表现,知识点的有用性得到了被试者的一致认可。在系统测评时,首先邀请五位英语水平较高的人员进行了英语作文写作。让五位受邀者使用系统改写自己的文章,对改写前后的五篇文章分别进行了专家评分和计算机自动评分,并邀请写作者对系统推荐的知识点做了打分。结果显示,在采纳系统推荐的知识点进行文章修改后,人工评分平均提高了6.3%(满分9分制,平均提高0.568分),机器评分平均提高了0.5%(满分100分制,平均提高0.5分)。在对推荐的知识点进行人工评分时,在满分5分制下,单词推荐结果的平均人工评分为3.65,搭配推荐结果的平均人工评分为2.3,语块推荐结果的平均人工评分为3.63,句型推荐结果的平均人工评分为2.8。为了验证系统对英语水平一般的英语写作者的作用,笔者从CLEC语料库中随机抽取5篇文章,并邀请五位学生对这五篇文章进行严格基于系统推荐的知识点的改写,在修改后,人工评分平均提高了7.4%(满分9分制,平均提高0.67分),机器评分平均提高了8.5%(满分100分制,平均提高8.5分)。 |
外文摘要: | english writing is a major obstacle for chinese students either in exams or in daily life. although various books and websites concerning english writing exist, most of the books are simply the display of corpus resources while the auxiliary websites do nothing more than examing and grading student`s articles. these tools lack the individualized writing guidance which is key to the advance of students` writing ability. among diverse teaching theories of english writing, one of the most recognized is improvement through rewriting, based on which this system is designed and developed. to get the suitable rewriting direction for each student, the system provides the users with knowledge points of words, collocations, chunks and sentence patterns. this system adjusts its feedback according to the . the main goal of an intelligent auxiliary writing platform is to boost students` writing ability through recommending suitable knowledge points. the knowledge points recommended in this project are those which chinese students rarely use while native writers use a lot. this system aims to tackle the common problems chinese students have such as lost in words, incapable of connecting words into strong expressions and idiomatically insufficient. this thesis will give solution to these problems and verify the system`s effect. to use the system, students need to input an article, the system will then calculate and compare the usage frequency of different words, collocations, chunks and sentence patterns in user`s articles with that in native writer`s articles to decide which ones to recommend. when recommending words, the system picks those which students use less than native writers. when collocations, chunks and sentence patterns are selected, the system divides the strategy into two scenarios. if a user has never used certain knowledge points, those knowledge points who have the largest usage frequency in model essays are selected. for those knowledge points which have been used before by the user, the system calculates the value of usage frequency in model articles divided by usage frequency in students` articles and the knowledge point with the largest value is recommended. when there are not enough history articles of a user, the system employs level determination module and genre definition module on chinese learner english corpus (clec) for substitution. the system is proved valid in promoting users`writing ability and knowledge points recommended are approved by the users. evaluation was done first by tracking the artificial scores (given by two english experts) and machine scores (given by pigaiwang, a website embedded with grading module) of five articles written by students. results show that after using the system, artificial scores increase by 6.3% (rose by 0.5 points of a possible 9) on average, machine scores increase by 0.5% (rose by 0.5 points of a possible 100) on average. when asked to evaluate the knowledge points recommended by the system with full mark of 5, the five writers scored the word recommendation module of 3.65, the collocation recommendation module of 2.3, the chunk recommendation module of and the sentence pattern module of 2.8. due to the fact that five writers invited by the author generally have high english writing levels. in order to test the system`s effect on average students, the author randomly extracted five clec articles and invite five students to rewrite them only using the knowledge points recommended by the system. after modification, artificial scores increase by 7.4% (rose by 0.67 points of a possible 9), machine scores increase by 8.5% (rose by 8.5 points of a possible 100). |
论文摘要: | 古籍是辛亥革命以前传抄或刻印的历史典籍等资源的统称,具有较高的文物价值和 文化意义。但因年代久远,善本难存。为了恢复古籍的原本样貌,古籍工作者需要进 行辑佚、校勘、注释、标点等整理工作,以探求古籍原本样貌,便于后人阅读研究。 为了实现古籍资源共享,数字化是必由之路。如何借助先进的信息技术,提升古籍 整理效率,解决数字化过程中存在的问题是当务之急。通过分析古籍整理的研究现状, 可将问题总结为如下四点:一、古籍整理缺少功能完整、流程完善的开放平台;二、 缺乏统一规范的整理流程,整理工作欠缺指导;三、仅重视整理的结果,丢失整理的 过程信息。四、专业性较强的整理工作缺少专家参与,整理质量参差不齐。 为解决上述问题,以提供便捷、完整、高效的古籍整理系统为目标,结合古籍整理 的特点和原则,笔者创新性地提出了重视古籍整理过程的思路,并完成了多层次、可 追溯的古籍整理平台产品设计,为整理者提供了高效的工作环境。平台的优势可总结 如下:一、包含完整的工作流程,平台将版本选择、文本录入、内容整理等重点工作 囊括在内。在系统的指引下,整理者可通过一个平台完成整理任务,减少不同工具之 间的切换。二、重视整理过程,将系统分成不同的工作层次,借助富信息的设计,保 存每层的校改信息,根据存储的数据追溯整理过程,出现问题便于定位,及时更改, 也为研究提供了支持。三、区分专家和普通整理者角色,分别匹配不同的整理任务, 确保参与者能胜任整理工作,产出符合要求的整理成果;同时,系统分为多个层次, 可在每个层次审查整理结果,保证整理质量。 在北京大学儒藏古籍整理专家的指导下,笔者通过整理工作的典型场景应用范例, 对本研究设计的整理平台进行了验证。工作成果得到了古籍专家的肯定,证明了富信 息整理平台可以为古籍整理工作提供便利,提高整理结果的可信度。
外文摘要: | Ancient books refer to the historical books written or published before the 1911 Revolution. Ancient books are the carriers of Chinese culture. Hundreds of years has passed since these books first came out, so they are inevitably suffered loss. To restore the original appearance of these historical books for reading and researching, it is necessary for the specialists to collate, add punctuation and notes, etc. On the purpose of widely sharing resources of ancient books, digitalization is the only way. In the past, scholars continually did research work with outdated tools. But nowadays, information technology brings more possibility to collation work. The collation work for historical books is developing through time. Though computer-aided collation systems help improve the efficiency of the work, there are still many problems waiting to be solved in collation practice as following listed: Firstly, lack of open and integrated platform for collation work. Secondly, deficient in standard workflow and guidance. Thirdly, loss high value information due to the emphasis on the work result rather than the work process, workflow can not be traced back. Finally yet importantly, quality issues on collation work. To solve the problems mentioned above, the author proposed a novel idea of valuing collation process. Under the guidance of this idea, the paper designed an ancient books collation system with the following advantages: Firstly, the system guides users to finish the whole process of collation work. Secondly, the system is designed by the guidance of emphasis on collation process, using XML file to record the data of collation process, which makes tracing the workflow back possible. Thirdly, the design of both user-work differentiated and multiple verification based on multi-layer design offers a guarantee of quality. This paper has designed a specific collation scenario to validate the design of Information-Rich Ancient Books Collation System. Experts from the Ru Cang Compiling and Editing Center of Peking University had given their recognition. The interviews proved that this study and design could effectively lighten the burdens of collation work and improve the working efficiency to this field. |
论文摘要: | 随着国际化进程的不断发展,人们越来越重视外语能力的培养,尤其是在真实情境中的语言运用能力,其中英语写作能力占据了非常重要的位置。但由于国内学生的外语基础较为薄弱,能够实际操练的机会少,且外语写作水平的提升并非一蹴而就,因此写作成为了国内外语学生的弱项,同时也成为了他们焦虑和畏惧的对象。 |
外文摘要: | with the continuous internationalization process, people pay more and more attention to the foreign language competence, especially the ability to use the language in real life, in which writing ability plays a crucial role. however, since writing skills could not be improved overnight, most chinese students, who have limited knowledge of english and lack the opportunity to apply it into practice, are frustrated when they write in english. |
文摘: | 文本特征的自动量化分析是通过计算机程序实现文本特征的定量评估。文本量化的一大核心是建立一组反映文本特征的指标体系。相较于人工分析,定量分析文本特征更加客观和高效。因此,在西方它已被应用在字母类语言的话语分析、语料库研究等领域。目前,文本特征量化指标的研究多以英文文本为对象,中文文本分析量化指标研究较少。从现有研究来看,中文文本分析主要存在三个问题。第一,现有的中文文本量化指标体系化不足,研究较为单一且不够全面;第二,缺少一个中文文本自动量化分析系统;第三,中文文本量化分析指标体系和量化分析系统的应用价值亟待研究和证明。 基于上述中文文本分析量化指标研究中存在的问题和不足,本研究围绕现代汉语,确立了文本分析量化指标研究、自动量化分析工具的实现、量化指标的应用三大研究主线。第一,在英文文本分析量化指标的研究基础上,以汉语特点和汉语语法为纲,建立了适用于中文文本分析的量化指标体系;第二,以中文文本量化指标体系为基础,依托于自然语言处理技术与中文语言资源,设计并实现了一款中文文本量化分析工具;第三,将中文文本量化分析工具应用于现代汉语文本特征研究和文本分级模型。 本研究建立的中文文本分析量化指标体系聚焦于文本的语言特征,总共包括五个层面:描述性特征层面、汉字层面、词汇层面、句子层面和语篇层面。描述性特征层面包括12个指标;汉字层面包括32个指标;词汇层面包括67个指标;句子层面包括60个指标;语篇层面包括1个指标。整个文本量化指标体系包含的指标共计170余项。本研究以两个应用为例,阐述了中文文本分析量化指标体系和量化分析系统的实用价值。第一,本研究以人教版小学语文教科书课文为例,从汉字、词汇和句子等五个层面,对语料进行了较为全面地统计分析。第二,本研究基于机器学习算法和量化指标体系,构建了文本分级模型,模型的预测准确度高达0.90左右。 研究数据结果显示,随着年级的上升,小学教科书课文的字词量、汉字复杂度、词汇难度和句法复杂度等特征值均呈现上升态势,基本遵循了从简到难的编排特点。然而,课文的用字用词仍存在改进之处。例如低年级课文中出现了较多的非常用字词;部首表收录内容与课文用字的关联度较弱等。这些研究结果已被应用于北京大学俞敬松老师研究小组的相关教学研究中。此外,本研究构建的文本分级模型能参照标准教科书,预测文本的阅读级别,从而被应用于不同阅读级别文本的自动分类和筛选。 |
文摘(外文): | automated textual analysis is to analyze text features quantitatively with computer programs. how to build a group of indicators that can reflect text characteristics is one of the core issues of textual analysis. compared with manual analysis of text, quantitative analysis of text is ive and efficient. therefore, it has been applied in the discourse analysis and the research of corpus of alphabetic languages in the western world. currently, most of the studies in automated textual analysis focus on english texts, and the studies in chinese textual analysis are rare. three major shortages can be found in the existing studies in chinese textual analysis. firstly, the existing studies focusing on textual features do not take a systematic and comprehensive approach. secondly, there is no tool available for analyzing chinese texts. thirdly, the practical value of quantitative indices system and the automated tool is still to be researched and validated.
to solve the problems of the previous studies related to quantitative indices system mentioned above, and fill the research gap, this research mainly focuses on three issues related to the analysis of modern chinese texts. firstly, in this research, a quantitative indices system is established for chinese textual analysis based on chinese characteristics and chinese grammar. secondly, a tool for the automated analysis of chinese texts based on the indices system is designed and built with the support of natural language processing technology and chinese language resources. furthermore, the tool is used in the analysis of modern chinese texts and the establishment of text leveling models.
the quantitative indices system for chinese textual analysis mainly focuses on linguistic features. this system consists of five levels: deive indices, chinese character, words, sentence, and discourse, with 12, 32, 67, 60, and 1 indicator for each level, respectively. in total, the indices system has more than 170 indicators. furthermore, in this research, two applications are used as examples to prove the practical value of quantitative indices system and the automated tool. in the first application, the linguistic features of primary school textbooks, published by the people’s education press, are extracted. the textbooks are analyzed thoroughly on the five levels, including chinese character, words, and sentence. in the second application, the quantitative indices system is integrated with machine learning algorithms, and text leveling models for chinese texts are built. the prediction accuracy of text leveling models is about 0.90.
according to the results, many linguistic values related to the features of textbooks such as word count, chinese character complexity, vocabulary level, and syntactic complexity, show an upward trend as the year increases, indicating that in general the texts are simpler for students in lower years, and are more complex for students in higher years. however, there is still room for improvements regarding the use of characters and words. for example, there are lots of uncommon words in the textbooks for students in lower years. besides, the content of the radical table has little connection with the chinese characters used in textbooks. these findings have been used to support the relevant research by jingsong yu research team of peking university. furthermore, the text leveling models can be used to predict the reading level of chinese texts with the reference to the levels of standard textbooks, and therefore, these models can be used for automated classification and selection of chinese reading texts. |
论文摘要: | 人类社会进入21世纪以来,科技的飞速发展带动了各个领域的不断进步,与此同时,人们在各个专业化领域的需求也在不断探索和前进。因此,传统的专业领域工具已经无法满足人们的需要。对于医学领域人士(医生、医学生、医学爱好者等)来说,一款高效的医学英语电子词典是他们工作、学习必不可少的好帮手。 然而,纵观过往的医学英语电子词典产品,大多存在以下三个问题:第一,依附于通用型词典之上,对于医学领域的专业广度和深度拓展不够。词汇和相关内容的搜索展示仍停留在通用词水平,无法为专业人士提供符合他们专业程度的词汇查阅需求。第二,词典功能单一。当前市面上流行的许多医学电子词典都将功能局限在“查词”上,无法为专业医学人士日常所需的文献阅读和论文撰写提供较为便利的解决方案。第三,无法辨别医学用户的专业偏好。目前,多数常用电子词典对于医学用户的专业偏好并没有区分,从而无法从专业的科室角度为用户提供高效的查词体验,也无法为用户提供个性化的搜索和展示。 为了解决上面三个问题,本研究首先进行了医学英语词汇的构成分析,研究英语词汇的特点,提升词典设计的合理性。然后从医学英语词汇联结的角度入手,研究了UMLS和SNOMED CT两大医学界较为权威的英语词汇系统,从中提取医学英语词汇之间的相互联系。并且,为了解决现存医学数据库中词汇科室不明的问题,本研究采取机器算法与人工校对相结合的方式,利用不同科室的核心词汇表,加上提取的词汇联系网络,将获得的医学词汇数据进行科室分类。最后,本研究结合个性化产品设计的思路,辅之以针对医学用户日常所需的便利功能,如写作助手和阅读助手,为医学领域的专业用户设计了一款个性化的医学英语电子词典。 为了验证本研究设计的词典的实用性,本研究邀请了来自浙江大学医学院的专家与同学参与了有效性验证。实验表明,本研究设计的电子词典可以有效提高用户查词的效率。此外,词典系统中的辅助功能,如写作、阅读助手等,也帮助用户提高了学习与工作的效率。本研究设计的医学英语电子词典有效地解决了当前医学电子词典中存在的专业化和个性化问题,帮助用户提高了学习和工作效率,对医学英语电子词典的研究与发展有一定参考价值。 |
外文摘要: | Great changes have appeared in various fields with the rapid development of science and technology since the beginning of the 21st century. At the same time, the needs of human beings in various specialization fields are also being advanced. Therefore, traditional professional learning tools can no longer meet people's needs. For people in the medical field (doctors, medical students, medical enthusiasts, etc.), an efficient professional medical English dictionary is an essential helper for their work and study. However, when it comes to the past medical English dictionaries, there are following three problems in the most of these dictionaries: Firstly, these dictionaries are one part of ordinary dictionaries, in which users are difficult to find professional knowledge, especially for professional medical users. Some uncommon knowledge, such as diseases, treatments and so on, also attracted medical users. Secondly, most dictionaries nowadays have a single function of word-retrieval. It is difficult for a simple dictionary to satisfy medical professionals’ daily needs since they have to read and write professional literatures in work and study. Thirdly, all users are treated equally when using those dictionaries, which means that they are difficult to distinguish the professional preferences of medical users, nor is it possible to provide users with efficient word- retrieval and word-display experience. In order to solve the three problems above, this study firstly carries on the characteristic analysis of medical English vocabulary to improve the professionalism of dictionary design. Then, from the point of view of medical English vocabulary association, this paper studies the two authoritative English vocabulary systems, UMLS and SNOMED CT, from which the relationship between medical English vocabulary is extracted. Moreover, in order to solve the problem that vocabulary departments in the existing medical database are unclear, this study adopts the way of combining machine algorithm with manual proofreading, uses the core vocabulary of 18 departments, and adds the extracted vocabulary connection network. The obtained medical vocabulary data thus are classified into sections. Finally, combined with the idea of personalized product design, this study designs a personalized medical English dictionary for professional users in the medical field, supplemented by some convenient functions for medical users. To verify the practicability of the medical English dictionary designed in this study, experts and students from School of Medicine in Zhejiang University were invited to participate in the verification of validity. The results of experiments show that the dictionary system designed in this study can improve the efficiency of word-retrieval. In addition, other auxiliary functions in the dictionary system, such as writing and reading assistant, also help users to study and work more efficiently. The medical English dictionary designed in this study has solved the specialization and individualization problems existed in the current medical dictionaries, and it also helps users study and work efficiently. What’s more, the study has certain reference value for the further development of the medical English dictionary design. |
论文摘要: | 词汇教学是英语教学的重要组成部分。在词汇教学中,教学材料直接影响到学生的学习效果。经过调研,发现虽然目前可使用的英语词汇学习资料丰富多样,但是内容分散,良莠不齐,无法完全满足教师和学生的资源需求。如果能够发挥计算机和互联网技术的优势,对词汇学习材料进行采集和整合,将对英语词汇教学质量的提升有所助益。 为整合英语词汇教学中的资源,本研究广泛地收集词汇学习材料,使用计算机技术对文本进行加工,根据一定逻辑对素材进行组织,建设英语词汇学习知识库。本文研究的关键问题有知识库建设的方法和思路、大规模资源汇总、资源加工存储的规范标准和自动处理程序的设计与实现。 本研究从互联网上采集了大量词汇学习资源,进行汇总,建设资源库。之后在文献分析的基础上构建词汇知识模型,并以此作为整合资源的逻辑依据。词汇知识模型从习得过程和知识内容两个角度出发组织词汇知识,响应了学习者在习得词汇过程中词汇知识需求的动态变化。模型以词汇习得过程为框架,词汇知识为内容,将词汇习得过程分为感知、理解、联想和输出四个阶段,让知识内容聚合到音位、形式、语境、语义、搭配、词源、产出和主题八个维度下。在建设知识库时,根据词汇知识模型设计知识库的结构和资源加工的规范标准,对例句和搭配的自动抽取等关键问题进行研究。最终,本研究整合了通识英语教学中的词汇学习资源,解决了知识库建设中的关键问题,开发了自动处理程序,实现了一定规模的词汇学习知识库。 对于整合后的词汇学习资源,本文进行了客观指标评估、准确率抽样检查和场景检查,证明了知识库能够满足教师和学生的资源需求,帮助改善词汇教学的效果。 本研究的创新之处包括以下三点:1)在文献研究的基础上构建词汇知识模型,根据知识模型从各类词汇学习资料中针对性地提取内容,然后进行整合,保证了内容的丰富性和资源间的关联性,去掉了重复性和低质量的学习材料,是一种语言学习资源建设的新思路;2)知识库将学习资源聚合到不同的维度下,实现了资源的模块化,在应用知识库时,可以根据需求灵活地调用和组织内容;3)本研究在加工资源时,对自动处理程序中的关键问题进行研究,使用自然语言处理等计算机技术提高资源建设的效率。 词汇教学是英语教学的重要组成部分。在词汇教学中,教学材料直接影响到学生的学习效果。经过调研,发现虽然目前可使用的英语词汇学习资料丰富多样,但是内容分散,良莠不齐,无法完全满足教师和学生的资源需求。如果能够发挥计算机和互联网技术的优势,对词汇学习材料进行采集和整合,将对英语词汇教学质量的提升有所助益。 为整合英语词汇教学中的资源,本研究广泛地收集词汇学习材料,使用计算机技术对文本进行加工,根据一定逻辑对素材进行组织,建设英语词汇学习知识库。本文研究的关键问题有知识库建设的方法和思路、大规模资源汇总、资源加工存储的规范标准和自动处理程序的设计与实现。 本研究从互联网上采集了大量词汇学习资源,进行汇总,建设资源库。之后在文献分析的基础上构建词汇知识模型,并以此作为整合资源的逻辑依据。词汇知识模型从习得过程和知识内容两个角度出发组织词汇知识,响应了学习者在习得词汇过程中词汇知识需求的动态变化。模型以词汇习得过程为框架,词汇知识为内容,将词汇习得过程分为感知、理解、联想和输出四个阶段,让知识内容聚合到音位、形式、语境、语义、搭配、词源、产出和主题八个维度下。在建设知识库时,根据词汇知识模型设计知识库的结构和资源加工的规范标准,对例句和搭配的自动抽取等关键问题进行研究。最终,本研究整合了通识英语教学中的词汇学习资源,解决了知识库建设中的关键问题,开发了自动处理程序,实现了一定规模的词汇学习知识库。 对于整合后的词汇学习资源,本文进行了客观指标评估、准确率抽样检查和场景检查,证明了知识库能够满足教师和学生的资源需求,帮助改善词汇教学的效果。 本研究的创新之处包括以下三点:1)在文献研究的基础上构建词汇知识模型,根据知识模型从各类词汇学习资料中针对性地提取内容,然后进行整合,保证了内容的丰富性和资源间的关联性,去掉了重复性和低质量的学习材料,是一种语言学习资源建设的新思路;2)知识库将学习资源聚合到不同的维度下,实现了资源的模块化,在应用知识库时,可以根据需求灵活地调用和组织内容;3)本研究在加工资源时,对自动处理程序中的关键问题进行研究,使用自然语言处理等计算机技术提高资源建设的效率。 |
外文摘要: | vocabulary teaching holds an important position in english language teaching and the materials directly influence the learning effect of students. it was found that although the english vocabulary learning resources currently available are rich and varied after surveying teachers and students, problems do exists with the quality and organization of these materials which fail to fully meet the resource needs of teachers and students. if vocabulary learning materials were collected and integrated with computer and internet technologies, the quality of english vocabulary teaching could be improved. in order to integrate resources in english vocabulary teaching, the author collected quantities of vocabulary learning materials, applied computer technology to process texts, organized materials according to certain logic, finally realized an english vocabulary learning knowledge base. the key issues in this study are the methods and ideas of knowledge base construction, gathering large-scale resources, standardizing the resource processing and storage, and designing automatic text processing programs. this study collected a large amount of vocabulary learning resources from the internet based on which a resource library was built. then, after the literature analysis, the author proposed a vocabulary knowledge model as the logical basis for integrating resources. the vocabulary knowledge model organizes vocabulary knowledge from the perspectives of acquisition process and knowledge content, responding to the dynamic changes of vocabulary knowledge needs of learners in the procedure of acquiring vocabulary. the model takes the vocabulary acquisition process as the framework and the vocabulary knowledge as the content. the vocabulary acquisition process is divided into four stages: perception, understanding, association and output. the knowledge content is aggregated into the eight dimensions of phoneme, word form, context, semantic, collocation, topic, source and output. when constructing the knowledge base, the knowledge base structure and the normative standards of resource processing were designed according to the vocabulary knowledge model, and key issues such as automatic extraction of example sentences and collocations were studied. in the end, this study integrated the vocabulary learning resources in general english teaching, solved the key problems in the construction of knowledge base, developed automatic processing programs, successfully built a vocabulary learning knowledge base of a certain scale. for the integrated vocabulary learning resources, this paper conducted ive index evaluation, accuracy sampling and scene inspection, which proved that the knowledge base could meet the resource needs of teachers and students, improving the effect of vocabulary teaching. the innovations of the study include the following three points: 1) by extracting and integrating resources from various vocabulary learning materials according to the knowledge model, the study can ensure the richness and connection of the resources in the knowledge base, moreover, remove the repetitive and low-quality learning materials. this is a new way to build language learning resources; 2) the knowledge base aggregates learning resources into different dimensions and realizes the modularization of resources. when applying the knowledge base, the users or applications can flexibly invoked and organized the resources according to their requirements; 3) this paper studied the key issues in the automatic processing program when integrating resources, and used computer technology such as natural language processing to improve the efficiency of resource construction. |
论文摘要: | 随着我国涉外法务日益频繁,法律英语的重要性毋庸置疑。想学好法律英语需要先了解它的特殊性,根据前人研究,法律英语的特殊性首要体现在法律英语词汇的特殊性。本研究期望借助移动学习的优势,设计一个法律英语词汇学习系统。 经调研,目前的法律英语词汇教学在学习效率方面存在以下问题未能得到有效解决。第一,目标词汇不成体系,未能建立词汇的属性标注和关联关系。学习资源以纸质教材为主,存在学习内容有限等问题;相关词汇学习系统仅将纸质教材中的词表电子化,未能建立词汇的属性标注和关联关系。第二,词汇学习深度不足,没有突出重难点。一般英语学习的基本词汇信息未能满足法律英语词汇学习的深度需求;没有重点设计近义辨析这一教学难点。第三,词汇考查内容不足,复现形式单一。考查内容只包括词汇的基本信息,缺乏法律英语词汇的特殊信息;仅以测试实现词汇复现,形式单一。第四,一般英语的词汇推荐策略不完全适用于法律英语词汇。词汇优先级计算忽视了法律英语词汇高频使用特别术语的特征;近义词学习未能实现动态推荐。 为解决上述问题,本研究在相关词汇教学理论和二语习得理论的指导下,借助移动学习的优势,设计了法律英语词汇学习系统。第一,多维度整合学习资源确立目标词汇资源库,对词汇的词源和部门法等属性信息进行标注、建立近义词汇的关联关系。第二,根据法律英语学习的深度需求,增加构成要件等学习内容,重点设计近义辨析模块。第三,设计多种复习题型考查法律英语词汇的基本信息和特殊信息,多种法律语境复现法律英语词汇。第四,结合法律英语词汇特征和学习者的学习情况,融入初始熟悉度因子综合计算词汇优先级,优化近义词推荐策略。 对于资源建设和词汇推荐策略,本研究邀请专家通过访谈的形式验证了设计的合理性和有效性。对于功能设计部分,通过对学习者进行问卷调查和深度访谈的方式验证了本系统在认知负荷相似的情况下,在学习目标达成方面优于现有的词汇学习系统。 本研究设计的法律英语词汇学习系统抓住了法律英语词汇的特征和教学重难点,发挥移动学习的优势,克服了纸质资源的局限性,作为课堂教学的有益补充,帮助学习者高效学习法律英语词汇。 |
外文摘要: | with the increase of china’s foreign legal affairs, legal english plays a more important role. if students want to learn legal english well, above all, they are supposed to know the specialty of legal english, which mainly reflected by its vocabulary. this study aims to design a legal english vocabulary learning system with the help of mobile learning. when it comes to the efficient learning of legal english vocabulary, there are still several problems. first of all, learning resources have not been integrated effectively. the number of vocabularies in paper textbooks is limited; current systems either label attribute or make the association of synonyms. secondly, current systems fail to meet the learning depth of legal english vocabulary and to design the discrimination of synonyms as a key point. thirdly, the review content of current systems is unable to meet the review needs of those students who have different learning ives. and the review form of current systems is only based on examinations, which is prone to make students feel boring. fourth, the vocabulary recommendation strategy of current systems is not suitable for legal english vocabulary. the difficulty grading system ignores the influence of the learner's previous english level. in addition, the recommendation of synonyms can’t be adjusted dynamically. this study tries to solve these problems by following ways. first, this study multi-dimensionally integrates learning resources and labels the attribute of legal english vocabulary to meet the needs of different students at the learning portal. second, this study selects the learning content according to the characteristics of legal english vocabulary, trying to highlight the difficulties of legal english vocabulary teaching. third, this study designs a variety of questions to meet the different review needs of learners and makes learned vocabulary repeat in different legal contexts to stimulate students’ interests. fourth, this study takes students’ former english level into consideration to make an effective learning sequence for legal english vocabularies and different study strategies for synonyms. this study invites experts to evaluate the resource integration part and vocabulary recommendation part by interviews, which verifies the rationality and effectiveness of the design. in addition, this study conducts an experiment on legal english vocabulary learners by questionnaires and interviews, which shows that the cognitive load of the system designed in this study is similar to that of the existing systems, but the achievement of learning goals is more conductive. the legal english vocabulary learning system designed in this study captures the characteristics of legal english vocabulary and teaching key points, overcomes the limitations of paper resources by virtue of the advantages of mobile learning, and serves as a useful supplement to in-class teaching, helping students to learn legal english vocabulary efficiently. |
论文摘要: | 信息时代,合作探究教学模式能够给予学生更充分的思考空间和更丰富的思维训练,逐渐成为教育教学中的热门研究话题。在合作探究教学中,主要由学生小组合作展开讨论学习,教师辅助进行指导。众多教育研究者提出合作探究教学是促进学生思辨能力的有效方式。 然而,该模式在实际教学中面临了一些问题。笔者作为《翻译技术原理与实践》课程的助教,发现目前的合作探究教学存在三个问题:(1)如果缺少学生对彼此研究内容的评判或教师对学生发言观点的评价,难以培养学生思维的严谨性;(2)学生不能积极给予同伴建议或教师未能及时提供启发,将不利于训练学生思维灵活性;(3)若学生之间没有情绪表达沟通或教师未能提供积极的鼓励,容易导致学生的思维自主性难以被触动。 针对上述问题,笔者阐述了在现有合作探究教学模式中引入六顶思考帽理论的必要性,明确了教学研究思路和研究方法,创新性地提出了基于六顶思考帽理论的合作探究教学模式。新模式主要包括:学生借助思考帽讨论流程强化小组讨论中的同伴互助作用,教师利用思考帽进行评价反馈增加小组讨论中的教师辅助引导作用。 为验证新模式的有效性,笔者于2018年3月至6月对北京大学外国语学院的14名学生开展了教学实验,实验类型为单组前后测,前后测数据来源为学生的线上讨论记录和问卷调查,并通过定量分析和定性分析对教学实验前后测数据进行对比,验证基于思考帽理论的合作探究教学方法在实际课堂应用中的效果。 实验结果表明,学生的思辨能力在灵活性、严谨性和自主性方面有提高,在一定程度上证明基于思考帽理论的合作探究教学对学生思辨能力提升有积极效果。该研究对合作探究教学模式进行了探索,为其进一步优化提供了新思路。 |
论文摘要: | 随着全球化的深入,英语写作变得愈发重要。学习者对于提升写作能力的诉求也越来越强烈,由于传统写作教学对写作成果的重视程度远大于其写作过程,写作能力的提升也并非一蹴而就,因此写作成为了众多学生英语学习的弱项。近年来随着在线学习的普及以及过程写作理论、协作学习等理论的推广,越来越多的学者开始重视写作过程,提出了群组讨论和同伴互评等协作学习方式,并将其应用到各类在线英语写作教学和英语写作学习系统中。然而,现有的在线英语写作教学、学习系统并未从英语写作学习的实践出发,群组讨论阶段存在讨论跑题、积极性不足等问题;同伴反馈阶段存在互评质量差,参与意愿低等问题,学生的写作能力提升缓慢。 本文以二语写作理论、协作学习理论、激励理论及结构化研讨方法为依据,分析了现有写作教学模式和竞品在社交化模块上的优势与不足。针对其中存在的问题以及大学生群体的写作需求,结合社交化模块的评价标准,对自适应英语写作系统的社交化模块进行了设计。通过建立结构化的讨论社区,提高讨论的质量,培养学生的思维能力,降低学生在写作过程当中的无助感和焦虑感。通过建立结构化的同伴互评方式,帮助学生建立批改思路,获得多元化、高价值的反馈意见。通过奖励等机制的设计,提升学生在学习过程中的参与感和满足感,激发学生的参与意愿。 由于系统中涉及的社交化机制较多,本文在此不一一详述,拣选两个具有代表性的功能——创意广场和同伴互评进行了研究和探讨,并针对这两个功能提出了不同的设计方案。此外还对系统中其他模块的设计进行了简单的介绍。 基于以上设计,本研究选取了南京某高校20名非英语专业学生进行了教学实验。通过实验观察、数据分析、深度访谈、调查问卷等方式,论证了创意广场和同伴互评设计在提高群组讨论质量,降低学生无助感和焦虑感,培养学生思维能力,提升讨论积极性以及在提高同伴互评的质量,提升学生评判性思维能力、认知能力以及参与意愿等方面大有裨益,并筛选出了这两个功能的最优方案。 本研究中自适应英语写作系统的社交化设计弥补了在线英语写作协作化学习方面的不足。在群组讨论阶段,讨论质量及学生们的思维能力均有所提高,缓解了学生写作时的焦虑和畏难情绪。在同伴互评阶段,提升了互评的质量,满足了学生对于多元化,高价值反馈的需求,提升了学生的能力,激发其参与反馈的意愿,补足了现有英语写作协作化学习设计的短板,对英语写作移动教学和协作化学习有一定的参考价值。 |
外文摘要: | As globalization continues to develop, English writing becomes all the more important, which leads to stronger desire of English learners to improve their writing skills. Traditional teaching of writing, however, emphasizes results over processes, leaving writing skills that cannot be cultivated overnight a weak link for college students. In recent years, with online learning, the process theory of composition and the collaborative learning theory gaining popular, more scholars than before pay attention to the writing process. They put forward collaborative learning approaches such as group discussion and peer review, and also apply these approaches on various online English writing systems. However, existing online English writing systems are not practical. Problems occur such as off-the-topic or poorly motivated discussions, low quality of peer evaluation and low willingness to participate. All these lead to slowly improved writing ability. Based on L2 writing theory, collaborative learning theory, motivation theory and structured research method, this thesis analyzed common teaching methods of English writing and current teaching products. The author applied standards and contents of the collaborative learning methods and then designed systemic social modules targeting the problems mentioned above and needs of college students for writing. This thesis hopes to improve the quality of discussions and train students' critical thinking via structured discussion groups so as to reduce their helplessness and anxiety in the writing process. Structured peer review and intelligent recommendation of reviewing peers can help students set up the correction ideas and get diversified and valuable feedback. Besides, with rewards and other systems, students’ would be more satisfied and motivated to participate in the discussion. As there are loads of social mechanisms involved in the system, this thesis was not able to cover all of them and thus selected two core functions for research, namely the creative square and peer review. This thesis includes different design proposals and brief introduction of ideas of designing other modules in the system. In this thesis, 20 non-English majors from a university in Nanjing were invited in the experiments. Through experimental observation, data analysis, in-depth interview and questionnaires, the author proves creative square and peer review can improve the quality of group discussions and reduce students’ helplessness and anxiety. On top of that, they also help to cultivate students’ critical thinking, motivate students to discuss and promote the quality of peer review and students’ willingness to participate. In the thesis, the best solution of each function was also presented. In this study, the social design of the adaptive English writing system optimized the process of collaborative learning, improved the quality of discussion, developed the students' critical thinking and relieved the anxiety in the writing process. In the post-writing stage, the quality of mutual evaluation was improved to meet the needs of students to get diversified and valuable feedback. This can inspire students' willingness to provide feedback and make up the weak link of online English teaching of writing, which has certain reference value for the mobile teaching of English writing and cooperative learning.
文摘: | 与普通英语词汇学习相比,托福词汇学习的特征主要表现为学科性强、学习量大和准备周期短。部分学习者对托福词汇学习存在误解,认为只需了解词汇意思即可,但根据托福考试要求,其中一部分核心高频词需要被转换为积极词汇,即需要在听力、写作和口语中熟练运用。调研表明,已有的背托福词汇APP或微信小程序并不注重积极词汇的训练,使得学习者对词汇的理解仅停留在“阅读”层面。 |
文摘(外文): | Compared with general English vocabulary building, TOEFL vocabulary building is characterized by involving a variety of disciplines, huge workload and comparatively less time for preparation. Some students have misconception on TOEFL vocabulary building, by false believing that to know the Chinese meaning of the words would be sufficient. Nevertheless, according to the test requirements for TOEFL, students should have the capacity to understand some TOEFL words in listening, writing and speaking tests. Survey shows most apps or Wechat Applets cannot help students to effectively build their active vocabulary and vocabulary building is meant for reading purpose only. |
论文摘要: | 目前,国内翻译研究多集中于译者、译作和翻译策略等方面,审校相关研究相对较少,而对中文版图书的出版审校研究则更为罕见。笔者在翻译《培养小极客》(Bringing Up Geeks)一书的过程中,深入了解了出版社的审校流程,并发现目前部分出版社在出版中文版图书时,会在原有的内部专业审校基础上,增加目标读者审校的环节,从而弥补专业审校的不足,提高中文版图书编校质量,提升读者对中文版图书的阅读体验。 因此,本文基于对《培养小极客》的翻译和审校过程中出现的具体案例,以目标读者审校为研究对象,以明确目标读者审校的价值为研究目的,首先通过理论与现实结合的方法对审校者多样化与读者“反馈提前化”的可能性进行了论证,并从理论角度分析了目标读者在审校中可能起到的作用,从而明确了目标读者参与中文版图书审校的可行性;其次从审校专业能力、审校目的、审校标准或规范和审校方式四个维度对专业编校人员和目标读者进行对比分析,并通过对《培养小极客》中具体案例的分析,详细探讨了目标读者审校在句义不明、词义不明、文化差异和表达优化四类问题上所发挥的作用;最后结合对专业审校局限性的分析,明确了目标读者审校的作用,即(1)帮助解决专业审校忽视的语义不明问题,增强译本的可理解性;(2)润色文本表达,提升译本的可读性;(3)帮助解决专业审校忽视的文化差异问题,并提出对应的多样化解决方案,提升译本的丰富性。 本文证明,在中文版图书出版过程中,目标读者审校是对专业审校的一个有效补充,将其纳入出版社的审校流程,在弥补专业审校的局限性和盲点、提高图书编校质量、增强图书对读者的吸引力和说服力等方面均有一定意义。 |
外文摘要: | Currently, domestic translation studies mostly focus on translators, translation works, and translation strategies, while researches on review and publishing process are relatively rare. After finishing the translation of Bringing Up Geeks written by American writer Marybeth Hicks, the author of this paper pays more attention to the review and publishing process and finds that the publisher’s review contains not only internal editor’s review but also external target reader’s review. The latter is to make up for the limitations and shed light on the blind spots of the editor’s review, improve the quality of translation, as well as enhance readers’ reading experience. Combining qualitative analysis with quantitative analysis, this paper analyzes the differences between editor’s review and target reader’s review and systematically clarifies the functions of target reader’s review based on the specific problems occurring during the review and publishing process of Bringing Up Geeks. The analysis shows that the editor’s review and the target reader’s review differ in skills, purposes, standards and methods. Also, the research finds that the target reader’s review can complement the editor’s review in four aspects occurring in the translation: ambiguity of sentences, ambiguity of words, cultural differences and expression optimization, even though the target reader may present wrong or unnecessary suggestions due to subjective factors. In conclusion, this paper advises that translation revision is a critical process in the publication of translated books. Target reader’s review, as a relatively new mode of review among domestic publishers, presents interesting interactions between the translator, editor and reader. It is an effective supplement to the editor’s review and therefore deserves more attention from both publishers and researchers. |
论文摘要: | 翻译不仅是语言的解码和编码过程,也是跨文化交际的过程。因而回译对原文的还原和回归,不仅是语言上的还原,也包含文化的还原。本文基于美国作者约书亚•葛以嘉(Joshua Goldstein)《伶界大王:1870-1937年京剧再造时期的演员与观众》(Drama Kings: Players and Publics in the Re-creation of Peking Opera,1870-1937)一书的翻译实践,探讨了原文中存在的文化英译问题,以及在回译时针对不同内容所采取的还原策略。 在对回译研究、文化还原和京剧翻译研究做了简要回顾后,笔者首先分析作者在写作过程中对京剧文化的英译特点,探讨其带来的翻译难点。笔者发现,作者在英译京剧文化术语和专有名词时大量地使用音译和拼音注释。笔者以为这一翻译方式较好地保留了中国文化的异域特色,且相应的解释说明有助于读者了解文化概念的内涵,但同时作者的英译也存在不够准确或不够恰当的情况。其次笔者指出了原文术语模糊翻译、一词多义、人名音译错误和引用来源多元等现象给回译实践造成的难点。 对于本书第二章已有的译文,笔者分析了其存在的问题,主要是词汇还原不准确、还原有误和引文未实现至译的情况。针对这些问题,笔者皆给出了自己的思考及认为更恰当的译法。接下来笔者总结归纳了在回译实践中针对不同情况所采取的还原策略,主要从对词汇和引文的还原两个角度出发。对词汇的还原方法有经过仔细详尽的查证后给出译法、省去不译原文中对于中文读者来说冗余的解释、通过添加文内注释或脚注的形式增添必要的解释说明或背景知识以增强译文的可读性。针对引文的还原,则主要依据是否找到引语原文和引语与原文的吻合程度来进行处理,主要分为按引文原文还原、添加注释说明作者错译现象、笔者自译等方法。 |
外文摘要: | Translation is not only a process of language decoding and encoding, but also a process of cross-cultural communication. In other words, translation is not only about bilingual transformation, but also about cultural exchange. Therefore, the restoration to the original text in the process of back-translation is not only of language but also of culture. Based on the translation project of Drama Kings: Players and Publics in the Re-creation of Peking Opera, 1870-1937 written by Joshua Goldstein, this paper probes into the translation problems of Peking Opera and the strategies adopted for different cultural contents in the process of restoration. This paper first summarizes the cultural content involved in the source text, which can be divided into two categories: vocabulary and citation. Then it analyzes the English translation of cultural content in the source text and points out the difficulties it poses to the back-translation, as they require the translator to handle carefully according to different circumstances. The paper then analyzes the problems in the Chinese translation of the second chapter of the source text, including inaccurate lexical restoration, incorrect restoration and imprecise back-translation of citations. In view of these problems, this paper puts forward appropriate translations. For the two categories of cultural content in the source text, the present paper proposes corresponding restoration strategies. Regarding vocabulary, there are three methods, namely, doing detailed research, omitting redundant information and annotating uncommon cultural concepts. As for citations, if sources can be found, copy the source; if not, translate them on one’s own. This may involve translating in the classical style of Chinese. In this case, Baidu’s classical-Chinese machine translation, still immature but being the only one of its kind, may be applied, with the result balanced by the translator. |
论文语种: | chi |
培养层次: | 硕士 |
培养单位: | 北京大学 |
导师1姓名: | |
导师1单位: | 软件与微电子学院 |
论文答辩日期: | 2019-05-27 |
外文题名: | Strategies for Prototype Shift in English-Chinese Translation: A Case Study of Word by Word |
外文关键词: | Translation methods English-Chinese translation Prototype Shift |
论文摘要: | 本次翻译实践基于美国作家柯丽·斯坦珀所著的《推和敲》(Word By Word)一书。这是一本描绘词典编撰幕后工作的著作,类似的题材在国内外都属小众。笔者在试译此书的过程中发现书中表达看似简单却并不易懂,如果按照传统的直译或是意译往往无法准确传达原文语义,需采用原型转移法进行翻译。迄今为止,原型转移法在翻译研究和实践中的运用不多,学界也还未大量开展这方面的研究,这使得笔者对原型转移法产生兴趣,并以此作为研究主题。 在展开翻译实践前,笔者利用语料分析工具对原文语言进行分析,探讨其中的翻译难点。笔者发现,首先,作者在行文中大量地采用低频词、生僻词,不少词汇不仅没有现成对应的中文翻译,更有的难寻英文背景信息。其次,作者还会在旧词的基础上创造新义,而且不忌讳粗俗语的使用。基于此,并结合大量现有的翻译实例,笔者提出在使用原型转移法时应遵循语境原则、从主原则、等效原则,采取增减达义法、虚实互换法、巧用流行语和谐音转义法的翻译策略,以保证从语义和文化层面准确传达原文的意思,保证译文质量。 本研究的意义主要体现在以下三个方面:(1)引入认知科学领域的原型理论,并在此基础上进一步探讨原型转移翻译法,为翻译方法提供一个新的视角和途径,克服传统的对翻译方法非直译便意译的认知模式;(2)分析探究原型转移翻译法适用的场景,有助于打破当前仅限于品牌名、电影名等翻译方向的困境;(3)总结整理三大翻译原则和四大翻译策略,能比较有效地解决各类翻译问题。 总之,原型理论可为翻译提供一个认识翻译的新角度,为一些翻译议题提出新的见解,对当前的翻译实践和理论研究都具有十分积极的意义。 |
外文摘要: | This paper discusses translation strategies for prototype shift in English-Chinese translations based on the translation practice of Kory Stamper’s book Word by Word. It is a nonfiction work, explicating the lexicographic details and dilemmas encountered by Stamper as an associate editor in Merriam Webster Company. During the process of analyzing the book, this paper finds that it contains many simple words with obscure meanings, which, if translated in traditional ways, for example, word for word or sense for sense, would not convey the correct meaning of the source text. For this reason, this paper proposes to adopt the prototype shift, a concept that originates from cognitive science, to solve the problem. First of all, through literature review, this paper finds prototype shift strategy has not been widely adopted in translation practice, nor has it been extensively explored by researchers. Secondly, analysis of existing translation cases leads to three translation principles, emphasizing the linguistic context, the cultural background and the translation effects, and four translation techniques. Finally, this paper compares prototype shift strategy with some easily confused concepts, for instance, the conversion approach and the free translation. The significance of this study is mainly reflected in the following three aspects: (1) introducing the prototype theory to provide a new perspective and approach for translation; (2) analyzing and exploring the applicable situations for prototype shift; (3) proposing three principles and four techniques to realize prototype shift effectively. |
分类号: | H059 |
论文总页数: | 35 |
参考文献总数: | 29 |
参考文献列表: |
[1] Dryden, J. "The Three Types of Translation." Western Translation Theory: From Herodotus to Nietzsche Ed. Robinson, D. Beijing: Foreign Language Teaching and Research Press, 2006:172-174.
[2] Lefevere, Andre, and ping Xia. Translation, Rewriting and the Manipulation of Literary Fame: 翻译、改写以及对文学名声的制控. Shanghai: Shanghai Foreign language Education Press, 2010. [3] Doherty, Stephen M. "Translation in Transition: Between Cognition, Computing and Technology." Journal of Specialised Translation, 2018:353-355. [4] Shreve, Gregory M., and Erik Angelone. Translation and Cognition. Amsterdam: John Benjamins Pub. Co., 2010. [5] 卢卫中, 王福祥. 翻译研究的新范式——认知翻译学研究综述[J]. 外语教学与研究, 2013(4):606-616. [6] Rosch, E.H.. "Cognitive Representaions of Semantic Categories". Journal of Experimental Psychology: General, 1975:192-233. [7] Lakoff, George. Women, Fire, and Dangerous Things: What Categories Reveal about the Mind. Chicago: University of Chicago Press, 1987. [8] 维特根斯坦, 蔡远. 哲学研究[M]. 中国社会科学出版社, 2009. [9] 朱立元. 当代西方文艺理论[M]. 华东大学出版社, 2008. [10] Lapsley, Daniel K. and Benjamin Lasky. ''Protypic Moral Character.'' An International Journal of Theory and Research, 2001: 345-363. [11] Taylor, John R. Linguistic Categorization: Prototypes in Linguistic Theory. England: Clarendon Press, 1989:52-53. [12] 刘夏. 从原型理论视角分析中英文化中“红色”语义对比[J]. 现代交际, 2019(03):96+95. [13] 肖群. 基于原型理论对英语动词多义性的认知语义研究[D]. 成都理工大学, 2017. [14] 夏珺. 基于原型范畴理论的网络新兴词汇研究[J]. 教育教学论坛, 2019(12):203-204. [15] 藏雅楠,卢绍刚. 原型范畴理论下“云XX”的认知社会语言学研究[J]. 现代语文, 2019(02):135-139. [16] 霍克斯特. 结构主义与符号学[M]. 瞿铁鹏,译. 上海译文出版社, 1987. [17] 程雨民. 关于词汇意义[J]. 外语与外语教学, 1999(01):13-14. [18] 王佐良. 翻译:思考与试笔[M]. 外语教学与研究出版社, 1989. [19] Kovecses, Zoltan. Language, Mind, and Culture: A Practical Introduction. England: Oxford University Press, 2006. [20] 龙明慧. 翻译原型研究[D]. 中山大学出版社, 2011. [21] 李勇. 花非花 雾非雾——翻译中的原型转移效应[J]. 译苑新谭, 2014(1):55-62. [22] 张培基. 英汉翻译教程[M]. 上海外语教育出版社, 1980. [23] 陈宏薇. 看似容易,实则不易[J]. 中国翻译, 2008(01):88-90. [24] 谭卫国, 蔡龙权. 新编英汉互译教程[M]. 华东理工大学大学出版社, 2009. [25] 徐慧. 从词义表达和词义引申的角度谈英汉翻译[D]. 上海交通大学,2011. [26] 刘宓庆. 新编当代翻译理论[M]. 中国对外翻译出版公司, 北京, 2012. [27] Schmitt, N., and Schmitt, D.. ''A Reassessment of Frequency and Vocabulary Size in L2 Vocabulary Teaching.'' Language Teaching, 2014:484-503. [28] 刘锦.网络热词“直男癌”的建构与颠覆——基于社交媒体女权主义话语符号的分析[J].新闻知识,2017(11):84-87. [29] Lutzky, Ursula, and A. Kehoe. ''Your Blog is (the) Shit A Corpus Linguistic Approach to the Identification of Swearing in Computer Mediated Communication.'' International Journal of Corpus Linguistics 2016:165-191. |
公开日期: | 2019-06-04 |
题名: | 针对英语词汇石化问题的自适应词块系统研究与设计 |
姓名: | |
学号: | 1601210744 |
论文语种: | chi |
专业: | |
培养层次: | 硕士 |
培养单位: | 北京大学 |
导师1姓名: | |
导师1单位: | 软微 |
导师2姓名: | |
论文答辩日期: | 2019-05-27 |
外文题名: | Research and Design of An Adaptive Chunk Learning System for the Ease of English Vocabulary Fossilization |
外文关键词: | Vocabulary fossilization Lexical approach Adaptive learning Productive exercise System design |
论文摘要: | 语言石化(Fossilization)是中介语(Interlanguage)的特征之一,是指二语学习出现停滞不前甚至倒退的现象。其中,语音、词汇、语法等层面都可能出现石化,而防止或缓解词汇石化现象中的词汇能力石化问题是本研究的核心。大量研究表明,词块(Lexical Chunk)学习能够改善词汇能力石化问题,但传统的词块教学仍有以下三方面的局限:一、无序词块的学习内容忽视了词块间的纵聚合与横组合的语义关系,学习者难以构建词义网络;二、重视记忆过程,缺少产出练习;三、一致的学习内容和方法难以满足学习者的个性化需求。这些局限性容易给学习者造成已会运用词汇的假象,进一步导致词汇能力石化问题。 为了改善词汇能力石化问题,本文根据词汇石化、语义网络、词块教学法以及二语习得其他相关理论和自适应学习理念,结合对现有英语词汇学习工具和中国大学生词汇能力石化现状分析,设计了一款针对防止或缓解词汇能力石化问题的自适应词块学习系统,并完成了系统的原型设计。其核心思想如下:一、采用词块教学法培养学习者的词块意识和使用能力,增加词汇语境信息,避免词义直接对等,产生母语负迁移;二、通过学习概念和搭配掌握词块间的关联性,构建并激活学习者的词义网络,增加表达的多样性和准确性;三、设计不同任务复杂度的练习题型,实现对词块从识记到产出的闯关式进阶;四、设置不同学习阶段和教学反馈的自适应规则,让学习内容具有针对性并引导学习者走出词汇使用舒适区,以此来避免用词惰性,改善词汇能力石化问题。 本研究在54名本科一年级非英语专业的学生中进行了2周的教学实验。其中10人为先导小组,以确定学习材料及实验细节;其余44人随机分为实验组和对照组各22人,前测证明两组成绩不具有显著性。实验组使用自适应词块学习系统,对照组采用传统的无序词块词表学习,两组均保证学习总量和内容完全相同。实验结束后进行后测,并在10天后进行延时测试。另外,还通过问卷调查和访谈对学习效果和满意度进行了补充验证。研究结果表明:自适应词块学习系统能够提高学习者词汇表达的多样性和准确性,并在缓解词汇能力石化上的保持效果和满意度方面都要优于传统词块学习法。 本研究设计的自适应词块学习系统一方面能够有效缓解词汇能力石化问题,提高词汇表达的多样性和准确性;另一方面丰富了词块教学法的研究成果,对课堂教学和英语词汇学习相关工具的设计具有一定的借鉴意义。 |
外文摘要: | Fossilization is one of the main features of Interlanguage, which means that stagnancy or even backwardness occurs in the process of L2 learning. Fossilization may occur in the aspects of pronunciation, vocabulary and grammar, and the prevention and ease of vocabulary fossilization is the core issue of this study. A large number of studies have shown that the lexical approach can ease the problem of vocabulary fossilization, but the traditional lexical pedagogy still has the following three limitations: First, the disordered chunk learning materials ignore the paradigmatic and the syntagmatic relations of the semantic networks, so that learners can hardly build their semantic networks. Second, it emphasizes the memory process but lacks productive exercises. Third, unified learning materials and methods are difficult to meet the individual needs of learners. These problems can aggravate vocabulary fossilization because learners are probably unable to use appropriate and diversified words in an actual context. In order to make up for the above deficiencies, in light of the theories of language fossilization, semantic network, the lexical approach, other related theories of second language acquisition and the thought of adaptive learning, this paper has designed a lexical chunk learning system for the ease of vocabulary fossilization based on the analysis of the status quo of learners’ vocabulary fossilization and the existing English vocabulary learning tools. This study completed the prototype design of the system. The core ideas of the system are as follows: First, cultivating learners' lexical chunk awareness and its competence by adopting the lexical approach, so that lexical context information can be enriched and the direct equivalence of vocabulary meanings can be avoided. Second, grasping the interrelationship among lexical chunks by acquiring concepts and collocations, so that the semantic networks can be constructed and activated, and the diversity and accuracy of expressions can be enhanced. Third, realizing the process from memorization of chunks to output by setting different types of task complexity exercises. Fourth, setting adaptive rules for learning process and feedbacks, so as to improve learning pertinence and guide learners to move out of the comfort zone of using vacabulary. A two-week teaching experiment was conducted among 54 undergraduate freshmen of non-English majors. Among the participants, 10 were randomly chosen for pilot experiment in order to determine the learning materials and experimental details. The remaining 44 were randomly divided into the experimental group and the control group, 22 participants respectively. The former test proved that the scores of the two groups were not significant statistically. The experimental group acquired chunks by using the adaptive lexical chunk learning system; and the control group used the traditional disordered lexical chunk lists. The total amount of the learning material and the content were all the same to both groups. Post-testing was performed after the end of the experiment, and a delay test was conducted after 10 days of the post-test. In addition, the study applied questionnaires and interviews to acquire the effect and satisfaction of the different learning approach. The research results show that the adaptive lexical chunk learning system is superior to the traditional lexical learning method in terms of the ease of vocabulary fossilization, especially the diversity and accuracy of their expressions. So are the maintenance effect and satisfaction. The adaptive lexical chunk learning system designed in this study, on the one hand, can effectively alleviate the problem of vocabulary fossilization, especially the improvement of accuracy and diversity of expressions. On the other hand, it enriches the researches of lexical approach, and has referencing significance to classroom instruction and the design of vocabulary acquisition tools. |
分类号: | TP3 |
论文总页数: | 107 |
参考文献总数: | 77 |
参考文献列表: |
[1] 加斯 S,塞林克 L.第二语言习得[M].赵杨,译.北京:北京大学出版社. 2011.
[2] 蔡基刚.关于我国大学英语教学重新定位的思考[J].外语教学与研究, 2010, 42(4): 306-8. [3] 郑秋萍.心理语言学视角下的二语词汇石化现象分析与防治策略[J].外语研究,2014(06):59-62. [4] 吴旭东,陈晓庆.中国英语学生课堂环境下词汇能力的发展[J].现代外语,2000(04):349-360. [5] Tinkham T.The effect of semantic clustering on the learning of second language vocabulary[J]. System,1993,21(3):371-380. [6] Laufer B. ‘Sequence’and ‘Order’in the Development of L2 Lexis: Some Evidence from Lexical Confusions[J]. Applied Linguistics, 1990, 11(3): 281-296. [7] Laufer B. The development of passive and active vocabulary in a second language: Same or different?[J]. Applied linguistics, 1998, 19(2): 255-271. [8] Laufer B, Paribakht T S. The relationship between passive and active vocabularies: Effects of language learning context[J]. Language learning, 1998, 48(3): 365-391. [9] 崔艳嫣,王同顺.接受性词汇量、产出性词汇量与词汇深度知识的发展路径及其相关性研究[J].现代外语,2006(04):392-400+437-438. [10] Lewis M. The lexical approach[M]. Hove: Language Teaching Publications, 1993. [11] Nattinger J R, Decarrico J S. Lexical phrases and language teaching[M]. Oxford University Press, 1992. [12] 周红云.语言的僵化现象[J].外语界,2003(04):19-26. [13] Torabian A H, Maros M, Subakir M Y M. Lexical collocational knowledge of Iranian undergraduate learners: implications for receptive & productive performance[J]. Procedia-Social and Behavioral Sciences, 2014, 158: 343-350. [14] Selinker L. Interlanguage[J]. IRAL-International Review of Applied Linguistics in Language Teaching, 1972, 10(1-4): 209-232. [15] Richards J C, Schmidt R W. Longman dictionary of language teaching and applied linguistics [M]. Routledge, 2013. [16] Long M H. Stabilization and Fossilization in Interlanguage Development[A]. In Doughty, Catherine J, Michael H. Long, eds. The handbook of second language acquisition[C]. John Wiley & Sons, 2008, 27: 487-535. [17] 戴炜栋,牛强.过渡语的石化现象及其教学启示[J].外语研究,1999(02):11-16. [18] Krashen S. Principles and practice in second language acquisition[J]. 1982. [19] Han Z H. Fossilization: five central issues[J]. International Journal of Applied Linguistics, 2004, 14(2): 212-242. [20] Selinker L. Fossilization as simplification ? [J]. 1993: 197-216. [21] Selinker L, Han Z H. Fossilization: Moving the concept into empirical longitudinal study[A]. In Davis A. Studies in language testing: Experimenting with uncertainty[C].Cambridge University Press, 2001, 27: 276-291. [22]刘座雄.英语写作词汇能力石化现象探析[J].西南民族大学学报(人文社科版),2007(S1):155-158. [23] 石永新.大学生英语写作中的词汇石化现象研究[D].吉林大学, 2017. [24]陈文存.对外语和二语学习者石化现象研究问题的评述[J].外语教学理论与实践,2010(01):89-95+83. [25] 赵文静.母语汉语学生在汉英同传中的负迁移现象[D].北京外国语大学, 2018. [26] Meara P. A note on passive vocabulary[J]. Interlanguage studies bulletin (Utrecht), 1990, 6(2): 150-154. [27] 陈建生.英语词汇教学 “石化” 消解研究[D].西南大学, 2009. [28] 桂诗春.新编心理语言学[M].上海:上海外语教育出版社.2000. [29] Schwartz A I, Kroll J F. Bilingual lexical activation in sentence context[J]. Journal of memory and language, 2006, 55(2): 197-212. [30] 陈玫.从纵聚合和横组合关系看英语写作中的措辞缺陷[J].外语与外语教学,2005(06):32-35. [31] Singleton D M. Exploring the second language mental lexicon[M]. Cambridge: Cambridge University Press,1999. [32] 刘绍龙,傅蓓,胡爱梅.不同二语水平者心理词汇表征纵横网络的实证研究[J].解放军外国语学院学报,2012,35(02):57-60+70+128. [33] 李小撒,王文宇.WordNet与BNC介入下的第二语言心理词汇联系模式实证研究[J].语言科学,2016,15(01):74-84. [34] Jiang N. Lexical representation and development in a second language[J]. Applied linguistics, 2000, 21(1): 47-77. [35] Cowie A P. Phraseology: theory, analysis, and applications[M]. Oxford: Clarendon press, 1998. [36] Becker J D. The phrasal lexicon[A] In Nash-Webber B, Schank R. Proceedings of the 1975 workshop on Theoretical issues in natural language processing [C].Cambridge, Massachusetts, 1975, 60-63. [37] 杨惠中,卫乃兴.中国学习者英语口语语料库建设与研究[M].上海:上海外语教育出版社.2005. [38] Lewis M. Implementing the lexical approach: Putting theory into practice[M]. Hove: Language Teaching Publications, 1997. [39] 周正钟.语块教学法新探—理论, 实证与教学延伸[M].苏州大学出版社. 2014. [40] 贾知辉.词块概念下的高中英语词汇教学实证研究[D].哈尔滨师范大学, 2016. [41] 濮建忠.英语词汇教学中的类联接、搭配及词块[J].外语教学与研究,2003(06):438-445+481. [42] 卫乃兴.中国学习者英语口语语料库初始研究[J].现代外语,2004(02):140-149+216-217. [43] Bychkovska T, Lee J J. At the same time: Lexical bundles in L1 and L2 university student argumentative writing[J]. Journal of English for Academic Purposes, 2017, 30: 38-52. [44] Lu X, Deng J. With the rapid development: A contrastive analysis of lexical bundles in dissertation abstracts by Chinese and L1 English doctoral students[J]. Journal of English for Academic Purposes, 2019,(39)21-36. [45] 郭小宁.中国英语专业学生预制词块鉴别能力研究[D].东北师范大学, 2009. [46] 丁言仁,戚焱.词块运用与英语口语和写作水平的相关性研究[J].解放军外国语学院学报,2005(03):49-53. [47] Krashen S D. Principles and practice in second language acquisition[M]. New York, Oxford: Pergamon,1982. [48] Swain M. Communicative competence: Some roles of comprehensible input and comprehensible output in its development[J]. Input in second language acquisition, 1985, 15: 165-179. [49] 何花.非英语专业研究生英语输出中的“注意”培训研究[D].上海外国语大学,2014. [50] 冯纪元,黄姣.语言输出活动对语言形式习得的影响[J].现代外语,2004(02):195-200+220. [51] 戴运财,戴炜栋.从输入到输出的习得过程及其心理机制分析[J].外语界,2010(01):23-30+46. [52] 王初明.外语写长法[J].中国外语,2005(01):45-49. [53] Hulstijn J H, Laufer B. Some empirical evidence for the involvement load hypothesis in vocabulary acquisition[J]. Language learning, 2001, 51(3): 539-558. [54] 孔繁霞,王歆.任务模式与类型对词汇附带习得的影响研究[J].外语界,2014(06):21-29. [55] 魏梅,王立非.任务类型与频次因素对大学生英语惯用短语学习的影响——对投入量假设的再考察[J].现代外语,2011,34(04):372-380. [56] Vigil N A, Oller J W. Rule Fossilization: A Tentative Model[J]. Language learning, 1976, 26(2): 281-295. [57] Truscott J. The case against grammar correction in L2 writing classes[J]. Language learning, 1996, 46(2): 327-369. [58] Han Y, Hyland F. Academic emotions in written corrective feedback situations[J]. Journal of English for Academic Purposes, 2019, 38: 1-13. [59] Rassaei E. Corrective feedback, learners' perceptions, and second language development[J]. System, 2013, 41(2): 472-483. [60] Bitchener J. Evidence in support of written corrective feedback[J]. Journal of second language writing, 2008, 17(2): 102-118. [61] 蒋景阳.英语作为外语教学的课堂中非刻意负反馈作用的研究[D].上海外国语大学, 2010. [62] Brusilovsky P . Methods and techniques of adaptive hypermedia[J]. User Modeling and User-Adapted Interaction, 1996, 6(2-3):87-129. [63] Weber G, Brusilovsky P. ELM-ART: An adaptive versatile system for Web-based instruction[J]. International Journal of Artificial Intelligence in Education (IJAIED), 2001, 12: 351-384. [64] Alshammari M. Adaptation based on learning style and knowledge level in e-learning systems [D].University of Birmingham, 2016. [65] 廖轶.面向基础教育的自适应学习服务系统研究与应用[D].北京交通大学, 2017. [66] 陆宏,赵艳平.高中英语词汇自适应学习系统的研制[J].现代教育技术,2014,24(11):47-52. [67] Li M, Ogata H, Hou B, et al. Development of adaptive vocabulary learning via mobile phone e- mail[C]//2010 6th IEEE International Conference on Wireless, Mobile, and Ubiquitous Technologies in Education. IEEE, 2010: 34-41. [68] Jung J, Graf S. An approach for personalized web-based vocabulary learning through word association games[C]//2008 International Symposium on Applications and the Internet. IEEE, 2008: 325-328. [69] Lu M. Effectiveness of vocabulary learning via mobile phone[J]. Journal of computer assisted learning, 2008, 24(6): 515-525. [70] 吕京.基于自适应模式的英语阅读教学研究[D].北京大学, 2015. [71] 林毅君.基于自适应学习模式的英语从句语法教学研究[D].北京大学, 2015. [72] 徐亮.基于自适应学习模式的大学英语产出性词汇教学研究[D].北京大学, 2015. [73] 阙颖.面向自适应教学的英语口语资源加工方法的设计与实现[D].北京大学, 2017. [74] 宋凌云.基于自适应学习模式的高中英语听力教学研究[D].北京大学, 2016. [75] Huckin T, Bloch J. Strategies for inferring word-meanings in context: a cognitive model [A]. In Haynes M, Huckin T, Coady J. Second language reading and vocabulary learning[C]. Albex Publishing Corporation, 1993, 153-178 [76] 杨世登.英语学习者产出词汇的发展模式[J].外国语言文学,2007(04):254-259+288. [77] Gardner R C, Lalonde R N, MacPherson J. Social factors in second language attrition[J]. Language learning, 1985, 35(4): 519-540. |
公开日期: | 2019-06-19 |
题名: | 海外汉学著作精准回译策略研究——以《中国武术:从古代到21世纪》为例 |
姓名: | |
学号: | 1601210677 |
论文语种: | chi |
培养层次: | 硕士 |
培养单位: | 北京大学 |
导师1姓名: | |
导师1单位: | 软件与微电子学院 |
论文答辩日期: | 2019-05-27 |
外文题名: | Strategies of Accurate Back Translation of Overseas Sinological Works——Taking Chinese Martial Arts: From Antiquity to the Twenty-First Century as an Example |
外文关键词: | Overseas Sinological Works Accurate Back Translation Martial Arts Translation |
论文摘要: | 自上世纪八十年代以来,海外对中国的研究不断加强,出现了越来越多的汉学著作,中国学界也开始重视起这些著作的译介,这些译介在中国的海外汉学研究中扮演了重要角色。在海外汉学著作的翻译过程中,回译问题不可避免,由于海外汉学著作多为学术类著作,行文严谨,措辞谨慎,这也对译者的回译提出了更高的要求,需达到精准回译。本次翻译项目的源文本来自于《中国武术:从古代到21世纪》(Chinese Martial Arts: From Antiquity to the Twenty-First Century),该书介绍了中国武术的发展史,是一本典型的汉学著作,作者为美国著名历史学家龙佩(Peter Lorge)。 本文首先阐述了本次研究的背景与意义,介绍了本次翻译的书籍以及所用到的翻译工具,然后从“海外汉学”,“回译”和“中国武术的翻译”三个角度进行了文献综述。在第三章中,笔者基于《中国武术》选译章节的翻译,总结出该书中出现的三大回译现象,即引文回译、词汇回译、以及原文错误之回译。在引文回译中,笔者将引用类型细分成“直接引用”和“间接引用”,提出不同引用类型下,引文的回译处理方式,其中对于“间接引用”的引文进行回译时,还需留意“古今异义”现象的出现;在词汇回译中,笔者将词汇分为“人名”,“武器名”和“武术相关术语”三大类,分别就这三类词汇出现的回译问题进行了探讨;在原文错误之回译中,笔者将错误分为“名称错误”和“史实描述错误”两类,就错误性质进行了定性,并对这些错误的回译方式给出了建议。 最后笔者基于本次翻译实践的过程,提出“巧用四字格”、“合理减译”、“归化为主”以及“恪守读者视角原则”四大精准回译策略。本次研究是对武术历史类海外汉学著作精准回译策略研究的初试,以期为同类著作的翻译提供一些参考。 |
外文摘要: | During the process of translating overseas sinological works into Chinese, back translation is inevitable. Since overseas sinological works are mostly academic works written rigorously and cautiously, they set a high demand on the translator’s skills and abilities. Accurate back translation is therefore needed. The present translation project is based on Chinese Martial Arts: From Antiquity to the Twenty-First Century, authored by Peter Lorge, which discusses the history of Chinese martial arts. This paper summarizes three major problems in the back translation of overseas sinological works, namely citations, special nouns, and errors in the source text. For the back translation of citations, the paper subdivides them into “direct citations” and “indirect citations”, and proposes different methods of back translation. With special nouns, this paper divides them into three categories: “names of people”, “names of weapons” and “martial-arts-related terms”, and discusses their back translation of them. As for errors in the source text, the paper divides them into two types as “errors of names” and “errors of historical description”, and advises on how to deal with them in the back translation. Based on this translation practice, the paper then puts forward four strategies for accurate back translation, namely “using Chinese four-character structure”, “omitting known information to the target audience”, “domesticating translation as the mainstay” and “observing the target-reader perspective”. In conclusion, this paper points out that for accurate back translation of Chinese martial arts, extensive bibliographic search is a prerequisite and careful contextualization of the object of translation is necessary. |
分类号: | H059 |
论文总页数: | 191 |
参考文献总数: | 43 |
参考文献列表: |
程裕祯. 关于海外汉学研究[J]. 中国文化研究, 1997(2):118-121.
党晟. 往而复来——漫议西方汉学著作的翻译[J]. 读书, 2018(09):157-164. 丁红艳, 陆志国.也谈文学翻译的原则[J]. 延安教育学院学报, 2004(01):68-70. 方骏. 中国海外汉学研究现状之管见[J]. 国际汉学, 2000(02):9-16. 方梦之. 中国译学大辞典[Z]. 上海外语教育出版社, 2011:97. 冯庆华, 李美. 文体翻译论[M]. 上海外语教育出版社, 2001. 郭沫若. 甲骨文合集[M]. 中华书局, 1999:4541 韩丹. 我国古代东北民族的射柳活动考[J]. 哈尔滨体育学院学报, 2004(1):1-3. 何一民. 海外“中国学”与中国“中国学”[J]. 四川师范大学学报(社会科学版), 2011, 38(01):109-114. 贺显斌. 回译的类型、特点与运用方法[J]. 中国科技翻译, 2002, 15(4):45-47. 胡厚宣. 甲骨文合集释文一[M]. 中国社会科学出版社, 1999:1803. 胡厚宣. 甲骨续存补编[M]. 天津古籍出版社, 1996. 季金珂. 浅谈武术类文本的回译策略[J]. 俄语学习, 2017(05):54-60. 焦丹. 论“一带一路”背景下的中华武术文化翻译及国际传播[J]. 翻译界, 2017:81. 乐黛云. 多元文化发展中的问题及文学可能作出的贡献[J]. 中国文化研究, 2001(1):9-15. 李宁. 英译汉中“四字格”美学价值试析[J]. 新疆大学学报(哲学•人文社会科学汉文版), 2003(s1):161-163. 李长栓. 非文学翻译[M]. 外语教学与研究出版社, 2009:91 卢安. 武术类英文版图书国外发行现状研究与启示[J]. 内蒙古农业大学学报(社会科学版), 2014, 16(3). 鲁迅. 鲁迅全集·且介亭杂文二集[M]. 人民文学出版社, 1981:61-63. 罗安宪. “学而优则仕”的历史流变[J]. 中国社会导刊, 2006(6):14-15. 罗永洲. 中国武术英译现状与对策[J]. 外语教学理论与实践, 2008(4):58-63. 吕洁. 论英译汉中汉语四字格的使用[J]. 当代教师教育, 2002, 19(4):73-76. 钱钟书. 林纾的翻译[J]. 中国翻译, 1985(11):2-10. 万雪梅. 试论汉学翻译[J]. 南京师范大学文学院学报, 2012(1):84-88. 王宏印, 江慧敏. 京华旧事,译坛烟云——Moment in Peking的异语创作与无根回译[J]. 外语与外语教学, 2012(2):65-69. 王宪明. 返朴归真最是信──由几处经典引文回译所想到的[J]. 中国翻译, 1994(4):72-76. 王正良. 回译研究[M]. 大连海事大学出版社, 2007. 王正胜. 回译研究的创新之作——《回译研究》介评[J]. 外语教育, 2009, 9(00):167-170. 谢应喜. 武术翻译初探[J]. 中国翻译, 2008(1):61-64. 徐海亮. 武术翻译四项原则[J]. 中华武术, 2005(1):24-25. 杨伯峻. 论语译注.大字本[M]. 中华书局, 2015. 叶红卫, 刘金龙. 近30年来汉学文献在国内的翻译与出版[J]. 出版发行研究, 2015(5):61-63. 张博. 反义类比构词中的语义不对应及其成因[J]. 语言教学与研究, 2007(1):43-51. 张芳. 汉学论著翻译问题论析——以伊沛霞《剑桥插图中国史》为例[J]. 江苏教育学院学报:社会科学版, 2014(7):93-97. 张西平. 西方汉学研究导论[M]. 学苑出版社, 2007:25. 指文烽火工作室. 中国古代实战兵器图鉴[M]. 中国长安出版社, 2015:66. 周琳. 古今异义成语语义转移的主要类型及成因[J]. 现代语文(语言研究版), 2014(1):42-46. 周庆杰. 杨式太极拳翻译研究[J]. 中国体育科技, 2004, 40(5). 朱明胜. 文化词的翻译——以“麻花”的英译为例[J]. 译林(学术版), 2012(6):180-185. Brislin, R. W. Back-translation for cross-cultural research[J]. Journal of cross-cultural psychology, 1970. Mark Shuttleworth & Moria Cowie. 翻译研究词典[Z]. 外语教学与研究出版社, 2005. Newmark. P. Paragraphs on Translation [M]. Cleveadom: Multilingual Matters Ltd, 1993. Toury. G. In Search of A Theory of Translation [M]. Tel Aviv: Porter Institute for Poetics and Se miotics, 1980. |
公开日期: | 2019-06-14 |
题名: | 基于语料库方法研究G.K.切斯特顿的反犹问题 |
姓名: | |
学号: | 1601210504 |
论文语种: | chi |
培养层次: | 硕士 |
培养单位: | 北京大学 |
导师1姓名: | |
导师1单位: | 北京大学软件与微电子学院 |
论文答辩日期: | 2019-05-27 |
外文题名: | Corpus-based Approaches to Anti-Semitism of G.K. Chesterton |
外文关键词: | G.K. Chesterton Corpus-based Approaches Cohen’s d Anti-Semitism |
论文摘要: | 吉尔伯特·基思·切斯特顿是20世纪初的英国作家和记者。他生前被指控为反犹主义,如今他是否反犹仍是有争议的问题。笔者使用语料库方法研究两个问题:1、切斯特顿的犹太观点有何显著的特点?2、他的犹太观点特点是否同特定的思维模式相关? 研究步骤如下:1、建立切斯特顿几乎全部作品语料库和同时代英国英语参考语料库,使用POS和USAS标注系统进行标注。2、收集一组犹太主题词汇,在切斯特顿语料库中研究它们的搭配,分析出切斯特顿犹太观点的特点。3、笔者认为切斯特顿在不同时期作品中广泛分布的语言特征有可能是他的思维模式的语言表征,因而笔者依据年份信息将切斯特顿语料库分组,同时也将参考语料库分组,使用cohen’s d方法计算两组语料语言特征的效应量差别,并选择cohen’s d值大于0.8的语言特征作为具有关键性的语言特征,将它们视作潜在的切斯特顿思维模式的语言表征。4、筛选出具有关键性的语言特征和搭配的重合部分,并分析它们在切斯特顿语料库中的用法,考察其用法是否与同犹太主题词汇共现时的用法相通,以此揭示切斯特顿思维模式与犹太观点的联系。 通过搭配分析,笔者发现:切斯特顿对犹太人在西方世界的存在、犹太人与金钱的关系、犹太人的人际关系都给予了负面的评价;他常常将犹太人分为不同的类型,并对这些类型进行两极化的评价;他多次将犹太教与基督教对立起来;他对犹太人的“身份”给予了一定关注。结合关键性分析,笔者发现切斯特顿的犹太观点存在背后思维模式的支撑:他将基督教作为理解其它其他宗教的参照,因而会将犹太教与基督教对立;他常常发现并展示事物的矛盾之处,而他对犹太人外在身份与实质的矛盾的关注吻合这一思维模式。 |
外文摘要: | Gilbert Keith Chesterton is a British writer and journalist in the early 20th century. He is accused of Anti-Semitism when he is alive. Whether he is an Anti-Semitist is still a controversial issue today. This study uses corpus lingusitics method and tries to answer two questions: 1.What are the most prominent features of his views of Jewishness? 2. Whether those features are related to his idealogical frame of mind as a whole. The research steps are as follows: 1. Build a corpus of most of Chesterton’s work, as well as a reference corpus of British English roughly of the same era, and tag the corpuses with POS and USAS annotation systems. 2. Collect a set of words of the Jewish theme, and calculate the collocation of lemma and semantic annotation in the Chesterton corpus. Obtain the prominent features of Chesterton’s view of Jewishness through collocation analysis. The author argues that the key linguistic features widely distributed among Chesterton’s works in different times may be the linguistic representation of his general frame of mind. Therefore, this study divides the two corpuses into two groups of texts, using Cohen's d to calculate key linguistic features. According to the benchmark, those linguistics features with Cohen’s d larger than 0.8 are selected as key features and potential representations of his general frame of mind.Then the author filters out the coincidental part of the key features and collocations, and analyzes whether their usage in the Chesterton corpus in general has relations with their usage in collocations with words of the Jewish theme, in order to reveal the connection Chesterton’s view of Jewishness and his general frame of mind. Through collocation analysis the author draws these conclusions: 1、Chesterton has negative opinions on the Jewish people about their existence in the Western world, their relationship with money, their interpersonal relationship with other people; 2. he often divides the Jewish into different types with only negative or positive opinions. 3. he sets up Judaism as the opposite of Christianity; 4. He is concerned about the Jewish identity. When combining those findings with key features analysis, the author finds that Chesterton's Jewish view is supported by his general frame of mind: he uses Christianity as a reference for understanding other religions, and thus pits Judaism against Christianity; he often finds and displays contradictions in things. Therefore, his concern about the contradiction between Jewish external identity and substance is consistent with this mode of thinking. |
分类号: | I56 |
论文总页数: | 60 |
参考文献总数: | 50 |
参考文献列表: |
[1] Dean Rapp. The Jewish response to GK Chesterton's antisemitism, 1911–33[J]. 1990.
[2] Owen-Dudley Edwards. Chesterton and Tribalism[J]. The Chesterton Review, 1979, 6(1): 33-69. [3] Simon Mayers. Chesterton’s Jews: Stereotypes and Caricatures in the Literature and Journalism of G. K. Chesterton[M]. CreateSpace Independent Publishing Platform, 2013: 132. [4] Ann Farmer. Chesterton: Religion, anti-Semitism and the Politics of the Underdog[J]. The Chesterton Review, 2008, 34(1/2): 163-186. [5] G. K. Chesterton's Works on the Web. 2019. [6] Leo-A Hetzler. Chesterton's Political Views, 1892-1914, with Comments on Chesterton and Anti-Semitism: to be continued[J]. The Chesterton Review, 1981, 7(2): 119-138. [7] Hitler branded a barbarian. 1933: 14. [8] Anthony Julius. Trials of the Diaspora: A History of Anti-semitism in England. Oxford University Press, 2012: 242-347. [9] Joyce Eisenberg, Scolnic Ellen. Dictionary of Jewish Words: A JPS Guide[M]. Jewish Publication Society, 2010. [10] Steven Beller. Antisemitism: A very short introduction[M]. Oxford University Press, USA, 2015. [11] Todd-M Endelman. Native Jews and Foreign Jews(1870-1914). Berkeley and Los Angeles, California: University of California Press, 2002: 155. [12] William Oddie. Reform,revolution,and the religion of mankind. New York: 2008: 80. [13] Fred Black. A Note on Chesterton and Anti-Semitism[J]. The Chesterton Review, 1977. [14] Kevin-L Morris. Reflections on Chesterton's Zionism[J]. The Chesterton Review, 1987, 13(2): 163-176. [15] Bryan Cheyette. An overwhelming question: Jewish stereotyping in English fiction and society, 1875-1914. University of Sheffield, 1986. [16] Bryan Cheyette. Constructions of'the Jew'in English Literature and Society: Racial Representations, 1875-1945. Cambridge University Press, 1995: 179-205. [17] Anna Vaninskaya. ‘My mother, drunk or sober’: GK Chesterton and patriotic anti-imperialism[J]. History of European Ideas, 2008, 34(4): 535-547. [18] Mike Scott. PC analysis of key words—and key key words[J]. System, 1997, 25(2): 233-245. [19] Marina Bondi, Scott Mike. Keyness in texts[M]. John Benjamins Publishing, 2010. [20] Paul Baker, Gabrielatos Costas, Khosravinik Majid, et al. A useful methodological synergy? Combining critical discourse analysis and corpus linguistics to examine discourses of refugees and asylum seekers in the UK press[J]. Discourse & society, 2008, 19(3): 273-306. [21] Vaclav Brezina. Statistics in Corpus Linguistics: A Practical Guide[M]. Cambridge: Cambridge University Press, 2018. [22] Paul-Edward Rayson. Computational tools and methods for corpus compilation and analysis[J]. 2015. [23] Bill Louw. Irony in the text or insincerity in the writer? The diagnostic potential of semantic prosodies[J]. Text and technology: In honour of John Sinclair, 1993, 157176. [24] Michael Stubbs. Collocations and semantic profiles: On the cause of the trouble with quantitative studies[J]. Functions of language, 1995, 2(1): 23-55. [25] 朱一凡,胡开宝. “被” 字句的语义趋向与语义韵——基于翻译与原创新闻语料库的对比研究. 2014. [26] Peter Stockwell, Mahlberg Michaela. Mind-modelling with corpus stylistics in David Copperfield[J]. Language and Literature, 2015, 24(2): 129-147. [27] Rocío Montoro. The creative use of absences[J]. International Journal of Corpus Linguistics, 2018, 23(3): 279-310. [28] David-L Hoover. Corpus stylistics, stylometry, and the styles of Henry James[J]. Style, 2007, 41(2): 174-203. [29] Fulya Erdentuğ, Musayeva Vefalı Gülşen. What is “old” and “past” in New Age discourse? A qualitative analysis of corpus evidence[J]. Discourse, Context & Media, 2018, 2485-91. [30] Shuki-J Cohen, Holt Thomas-J, Chermak Steven-M, et al. Invisible empire of hate: gender differences in the Ku Klux Klan's online justifications for violence[J]. Violence and gender, 2018, 5(4): 209-225. [31] Sin Yan Eureka Ho, Crosthwaite Peter. Exploring stance in the manifestos of 3 candidates for the Hong Kong Chief Executive election 2017: Combining CDA and corpus-like insights[J]. Discourse & Society, 2018, 29(6): 629-654. [32] Laura-A Cariola. A Corpus‐based Psychodynamic Analysis of Body Boundary Imagery in Hitler's Mein Kampf[J]. International Journal of Applied Psychoanalytic Studies, 2014, 11(4): 318-338. [33] Marcus Bridle. Male blues lyrics 1920 to 1965: A corpus based analysis[J]. Language and Literature, 2018, 27(1): 21-37. [34] Hendrik De Smet, Flach Susanne, Tyrkkö Jukka, et al. The corpus of Late Modern English (CLMET), version 3.1: Improved tokenization and linguistic annotation[J]. KU Leuven, FU Berlin, U Tampere, RU Bochum, 2015. [35] Vaclav Brezina, McEnery Tony, Wattam Stephen. Collocations in context: A new perspective on collocation networks[J]. International Journal of Corpus Linguistics, 2015, 20(2): 139-173. [36] Scott Piao, Bianchi Francesca, Dayrell Carmen, et al. Development of the multilingual semantic annotation system[A]//2015: 1268-1274. [37] Dawn Archer, Wilson Andrew, Rayson Paul. Introduction to the USAS category system[J]. Benedict project report, October 2002, 2002. [38] Paul Rayson. Matrix: A statistical method and software tool for linguistic analysis through corpus comparison. Lancaster University, 2003. [39] Dana Gablasova, Brezina Vaclav, McEnery Tony. Collocations in corpus‐based language learning research: Identifying, comparing, and interpreting the evidence[J]. Language learning, 2017, 67(S1): 155-179. [40] Jacob Cohen. Statistical power analysis for the behavioral sciences. Routledge, 1988. [41] Daniel Lakens. Calculating and reporting effect sizes to facilitate cumulative science: a practical primer for t-tests and ANOVAs[J]. Frontiers in Psychology, 2013, 4863. [42] William-J Crawford, McDonough Kim, Brun-Mercer Nicole. Identifying Linguistic Markers of Collaboration in Second Language Peer Interaction: A Lexico-grammatical Approach[J]. TESOL Quarterly, 2019, 53(1): 180-207. [43] Norman 所罗门 Solomon,文学王广州. 犹太人与犹太教: a very short introduction[M]. 南京: 译林出版社, 2014. [44] Paul Baker, Levon Erez. Picking the right cherries? A comparison of corpus-based and qualitative analyses of news articles about masculinity[J]. Discourse & Communication, 2015, 9(2): 221-236. [45] Patrick Hanks, Hardcastle Kate, Hodges Flavia. A dictionary of first names[M]. New York;Oxford; : Oxford University Press, 2006. [46] Richard-Coates-Peter-McClure Patrick Hanks. The Oxford Dictionary of Family Names in Britain and Ireland[M]. Great Britain., Oxford: Oxford University Press, 2016. [47] Aidan Nichols. GK Chesterton, Theologian[M]. Sophia Institute Press, 2009. [48] Oxford-English Dictionary. "call, n.". [J]. [49] Miles Schmitt. THE ESSAY STYLE OF CHESTERTON[J]. Franciscan Studies, 1943, 3(1): 73-83. [50] Hugh Kenner. Paradox in Chesterton[M]. New York: Sheed & Ward, 1947. |
公开日期: | 2019-06-25 |
英文汉学著作的汉译: 回译和变译.房一品
题名: | 英文汉学著作的汉译: 回译和变译 |
姓名: | |
学号: | 1701212749 |
公开时间: | 公开 |
学位: | |
院系: | |
导师1单位: | 外国语学院 |
论文答辩日期: | 2019-05-24 |
外文题名: | English to Chinese Translation of Sinology Publications: Back-translation and Translation Variation |
外文关键词: | Chinese Studies early Chinese philosophy back-translation translation variation |
论文摘要: | 本翻译项目源文本取自《早期中国哲学中的情感元素》一书的部分章节。该书是多伦多大学文理学院东亚研究系副教授居里·维拉格(Curie Virág)所著,于2017年由牛津大学出版社在美国首次出版。该书围绕“情感”在早期中国思想家的理论中的地位展开研究,追溯了早期中国哲学概念的谱系, 并考察了它们在古代中国伦理、政治和文化价值观形成中的关键作用。该书分为六个章节,本翻译项目选取了其中的前言、结论和前三章进行翻译,涉及内容包括:孔子《论语》中的情感元素和完整自我、《墨子》对人类社会的重新定义、《道德经》中宇宙欲望和人的能动性。居里·维拉格在哈佛大学东亚语言与文化系取得了博士学位;她的主要研究方向是前现代时期(战国至公元十二世纪)的中国哲学及思想史,已经出版三部学术著作并发表了三十多篇学术论文。作为一部以英语撰写的学术著作,本翻译项目选取的文本具备下列特点:语言风格正式、专业词汇多、名词化场景多、被动句和复合句多。此外,作为一部海外汉学著作,此书涉及大量用英文改译的汉学典故、文献名称和人名头衔名,给翻译造成了一定困难和挑战。 |
外文摘要: | This translation project is based on The Emotions in Early Chinese Philosophy. Written by Curie Virág and published by Oxford University Press, New York in 2017, this book focuses on the significance of emotions in the theories of early Chinese philosophers, traces the genealogy of these early Chinese philosophical conceptions and examines their crucial role in the formation of ethical, political and cultural values in China. The book consists of six chapters, from which the first three chapters are taken as the source text of this project as well as the part of introduction and conclusion. It gives deep insights into emotions and the integrated self in the analects of Confucius, redefinitions of the human community in Mozi, and the cosmic Desire and human agency in the Daodejing. The author Curie Virág received her Ph.D. degree at the Department of East Asian Languages and Civilizations at Harvard University. She works in the fields of premodern Chinese philosophy and intellectual history (Warring States to 12th century) and has published three academic books and more than thirty papers. As an academic work written in English, the text selected in this translation project has the following characteristics: formal language style, richness in philosophical terms, nominalization, passive sentences and compound sentences. In addition, as an overseas work on Chinese Studies, the source text involves a large number of Chinese allusions, titles of references and names of sinologists, which poses lots of difficulties and challenges for translation. |
分类号: | H31 |
论文总页数: | 14 |
参考文献总数: | 17 |
参考文献列表: |
弘学:《禅林宝训》讲释,成都:巴蜀书社,2006。 季进,邓楚,许路:《众声喧哗的中国文学海外传播——季进教授访谈录》,载于《国际汉学》,2016年第2期。 焦鹏帅:《变译研究二十年:哲思、发展和国际化》,载于《外语与翻译》,2018年第2期。 刘家润:《晦涩词句中的科学观——关于“老子”第一章的解读》,国学网,2006年12月21日。 南怀瑾:《老子他说》续集。北京:东方出版社,2010。 孙彬:《中国传统哲学概念“理”与西周哲学译名之研究》,载于《哲学与文化研究》,2015年第2期。 谭载喜 主译:《翻译研究辞典》,Mark Shuttleworth, Moira Cowie著。北京:外语教学与研究出版社,2005。 王宏印:《从“异语写作”到“无本回译”——关于创作与翻译的理论思考》,载于《上海翻译》,2016年第3期。 王楠:《对汉学论著翻译规范的探讨》,载于《史学月刊》,2002年第4期。 吴万伟:《英汉学术翻译中的回译问题》,载于中国英汉语比较研究会《中国英汉语比较研究会第十次全国学术研讨会暨2012英汉语比较与翻译研究国际学术研讨会会议日程和摘要汇编》,2002。 许峰:《海外中国学研究的发展前瞻——北京联合大学海外中国学研究中心成立大会暨学术研讨会述要》,载于《中共党史研究》,2012年第11期。 叶红卫:《海外英文汉学论著翻译研究》,载于《上海翻译》,2016年第4期。 赵旭东 译:《帝国的隐喻:中国民间宗教》,Stephan Feuchtwang著。南京:江苏人民出版社,2009。 Virág, Curie. The Emotions in Early Chinese Philosophy. Oxford University Press, 2017. Craig, Edward, ed. Routledge Encyclopedia of Philosophy: Questions to Sociobiology. Vol. 8. Taylor & Francis, 1998. Heim, Michael Henry, and Andrzej W. Tymowski. Guideline for the Translation of Social Science Texts. American Council of Learned Societies, 2006. |
公开日期: | 2019-06-13 |
题名: | 《译者的取与舍——简析英译汉的异化归化策略》 |
姓名: | |
学号: | 1701212752 |
论文语种: | chi |
培养层次: | 硕士 |
培养单位: | 北京大学 |
导师1姓名: | |
导师1单位: | 中国人民大学外国语学院 |
论文答辩日期: | 2019-05-24 |
论文摘要: | 《欧洲海外殖民帝国,1879–1999——一段短暂的历史》是一本历史题材类著作。作者探讨了 19 世纪末至 20 世纪末这一百年间欧洲海外殖民帝国的发展动力和历史轨迹,以及这段交织着欲望与血泪的殖民史对当今世界的种种影响。出于对世界历史的热爱和对历史的反思,笔者选择本书作为翻译实践的对象。在本篇报告中,笔者按照译前准备、译中处理和译后处理的顺序,先是简要回顾了国内外对于历史类文本英译汉的研究情况,再对作者选取的异化归化理论进行大致的介绍,并结合翻译实例,从词语、句式、修辞和思维逻辑四个方面分析得出结论——翻译实践中异化与归化并存,缺一不可,从而回答了译者对原文和译文如何取舍的问题。最后,笔者探讨了这两种翻译策略的研究意义,进一步思考了翻译理论对翻译实践的指导作用以及译者如何提升自身专业素质的问题。笔者希望借此番探讨引起广大翻译爱好者和从业者的共鸣。 |
分类号: | H059 |
论文总页数: | 277 |
参考文献总数: | 10 |
参考文献列表: |
胡开宝、谢丽欣:《论主体间性与英汉词典历史文本翻译》,载于《宁夏大学学报(人文社会科学版)》,2005年第6期。 刘蓉:《从英汉民族思维差异看英汉语序》,载于《读与写杂志》,2009年第6卷第5期。 刘婷玉:《浅析历史题材类文本的翻译策略——文本类型理论视角》,载于《海外英语(上)》,2017年第7期。 刘婷玉:《浅析历史题材类文本英语被动语态的翻译策略——从主语和主题是否一致视角》,载于《海外英语(上)》,2017年第7期。 Newmark, P. Approaches to Translation. New York: Prentice Hall International (UK) Ltd, 1988. Nida, Eugene A. Toward a Science of Translating: With Special Reference to Principles and Procedures Involved in Bible Translating. Boston: Brill, 2003. Reiss, K. Translation Criticism: The Potentials and Limitations. (Translated by Erroll, F.R.) . Manchester: St Jerome Publishing, 1997/2000. (上海教育出版社,2004) Spears, Richard A. McGraw-Hill Dictionary of American Idioms and Phrasal Verbs. New York: McGraw-Hill, 2002. Venuti, Lawrence. The Translator’s Invisibility. Shanghai: Shanghai Foreign Language Education Press, 2009. |
公开日期: | 2019-06-25 |
题名: | 汉语“V-的”结构中的“的”及其锚定功能 |
作者: | |
学号: | 1601213231 |
专业: | |
培养层次: | 硕士 |
培养单位: | 北京大学 |
导师姓名: | |
导师单位: | 外国语学院 |
答辩日期: | 2019-05-23 |
题目(外文): | The Anchoring Function of de in V-de Construction in Mandarin Chinese |
文摘: | 大量的文献探究了生成语言学视角下的时态表达,但是前人对具体语言中的时态系统仍然没有明确定论。中文通常被认为缺乏显性的形态变位,因此学术界对其时态的表达机制有众多的讨论。本文旨在研究中文“V-的”结构中“的” 的时态功能和锚定机制。前人文献里讨论了与“V-的”相似的结构,如分裂句、事态句、焦点结构等等。本文试图将中文“V-的”结构与其它类似形式区分开来,并表明“V-的”结构表现出特殊的句法特性。中文“V-的”结构并不应该被视为和前人讨论的“是…的”等句内部结构一致,也不应被笼统归为是同一结构的不同变体。众多的研究观察表明中文“V-的”句有两个主要的句法表现:其一、时态上,中文“V-的”结构倾向于得到过去时的解读,且这种解读是由功能词“的”带来的。其二、中文“V-的”句与表示将来的时态标记,中文体助词“了”、“着”、“过”,以及句末“了”在句法上并不兼容。在此基础上,本研究对结构的讨论需要回答两个与中文“V-的”结构的句法属性相关的研究问题:第一、这个结构中的“的”如何产生表达偏向过去时的、非未来的时态解读?第二、为什么这个结构中的“的” 在句法上不允许与上述提到的元素共现?本研究在生成句法的视野和最简方案的框架下提出了一个解释,将“的” 视为有指示性质的词项,含有[+指示性]的特征,其功能为锚定事态。锚定的功能在中文的Dº和Tº上同样实现为词素“的”。“的”在“V-的”结构的句法生成过程中其位置从AspPº移动到Tº,最终落脚到Cº。这种论证的原因在理论上有Marantz(2013)的语境异义性(contextual allosemy)概念的支持,并在实践层面可以解释上述提到的中文“V-的”结构的众多句法表现。 本文结构上首先简要介绍了中文“V-的”结构的一系列句法表现。文章第一章回顾了以往研究提出的关于时态机制的相关文献。第二章综述探究了在不同理论视角下前人研究对和中文“V-的”结构类似的不同形式的结构的分析,如焦点句、分裂句等。第三章讨论了中文“V-的”句的形式和句法表现,将其和其它的形式区分开来,明确定义了什么是本文讨论“V-的”结构,并进一步展开陈述本文要讨论的问题。文章第四章对词项“的” 在中文“V-的”结构中的句法结构和语义属性进行了解释。本研究旨在考察、描述、分析中文“V-的”结构的时态特性并从句法的层面提出解决方案。本研究的贡献在于帮助未来的研究区分与中文“V-的”结构相似的众多结构,并对后人有关中文“的”、焦点结构、信息结构等研究提供思路。本研究同时也为比较跨语言的时态表达和时态锚定的机制提供了一个视角,为后人讨论汉语时态的系统和时态锚定机制提供了话题。 |
文摘(外文): | tense expression has been extensively researched but not adequatelyattested under the generative linguistic paradigm. theories of tense derivation of specific languages abound. mandarin chinese (henceforth chinese)is generally considered to lack overt morphological tense inflection, thus has been the subject of much scholarly debate of various tense related issues. this paper sets out to investigate the tense interpretation of verbal dein the v-de construction in chinese. v-deconstruction has been given many labels in previous literature such as cleft construction, state-of-affairs sentence, focus construction to name but a few. the present study attempts to distinguish v-destructure from other analogous forms and suggests that it demonstrates particular syntactic properties unlike three other structure typespreviously thought to be variations of the samehomogeneous construction as v-de. the current analysis examines two major unresolved puzzles of v-de structure: a) it has been widely recognized to yield preferred past reading and its temporal information is proposed to have been realized via the functional item de; and b) it is incompatible with future tense markers, aspectual auxiliaries le/zhe/guoand sententialle.such distinct properties lead to some inquiries about the syntax ofv-deand the functions of its constituents.this study intends to answer the following questions: a) how does verbalde yield non-future tense reading? and b) why does the structure disallow co-occurrence with the above-mentioned elements? the present study proposes an explanation to account for these syntactic properties from a formal perspective, aligning with the spirit of minimalist program (henceforth mp). it regards verbal de as a featured item in lexicon whose deictic feature could be realized when deis merged either in dº and tº. in both cases, its deicticity fulfills a general anchoring function and its specificity varies in its particular representation on different functional heads. in the analysis of v-de,the temporal reading derived in the construction could be accounted for with the deicticity of dewhen merged in tº. the syntactic process in v-de sentences is argued to be that demoves from asppº to tº and finally cº. the reasons for such an argument is theoretically supported by marantz (2013)’s concept of contextual allosemy and the evidence syntactically attested with chinese verbal deconstructions. this paper first provides a brief introduction to the syntactic behavior of verbal destructure. chapter one reviews relevant literature on tense mechanism proposed in previous studieswhich serve as the groundwork for tense research. the second chapter surveys past studies from different theoretical perspectives both on verbal de and on variant forms of v-deconstructionwhose idiosyncrasy is concealed under various labels such as focus/cleft construction. chapter three discusses the particular form and characteristics of v-destructure and what does not count as v-destructure by examining their syntactic representations and pinning down the exact issues to be addressed in this paper. chapter four offers an explanation of the item deand the basic syntactic and semantic features of v-de structure. this paper not only provides a deive examination of some puzzling structuresbut also puts forward a syntactic explanation of the tense properties of v-de construction, in hope of shedding light on the inquiry of the issueson v-deas well as on tense anchoring in chinese. it meanwhileopensa window into further cross-linguistic comparison in the expression of temporal and aspectual information, thus contributing to the large body of literature on the mechanism of tense system ofanalytic languages in general and chinese in particular. |
分类号: | H04 |
论文总页数: | 60 |
参考文献数: | 91 |
参考文献: |
Adger, D. 2007. Three domains of finiteness: A minimalist perspective. Finiteness: Theoretical and Empirical Foundations. In I. Nikolaeva (ed.). 23–58. Oxford: Oxford University Press.
Baker, M. & Travis,L. 1997. Mood as verbal definiteness in a “tenseless” language. Natural Language Semantics 5(3): 213–269. Chao, Y. R. 1968. A Grammar of Spoken Chinese. Berkeley: University of California Press. Chappell, H. & Thompson, S. A. 1992. The semantics and pragmatics of associative DE in Mandarin Chinese discourse. Cahiers de Linguistique—Asie Orientale 21(2): 199-229. Cheng, L. L-S. 2008. Deconstructing the shi de construction. The Linguistic Review 25, 3/4: 235–266. Chiu, B. H. 1993. The Inflectional Structure of Mandarin Chinese. Doctoral dissertation, UCLA. Chomsky, N. 1995. The Minimalist Program. Cambridge, MA: MIT Press. Comrie, B. 1985.Tense. Cambridge: University Press. Deng, S-H. 1979. Remarks on cleft sentences in Chinese. Journal of Chinese linguistics 7 (1): 101-114. Ehlich, K. 1982. Anaphora and deixis: same, similar, or different? In Jarvella & Klein (eds.): 315-338. Encyclopedia of Chinese Languages and Linguistics. 2015. In R. Sybesma, W. Behr, Z. Handel, C.-T. J. Huang& J. Myers (eds.). Leiden: Brill. Gärdenfors, P. & Brala-Vukanović, M. 2018. Semantic domains of demonstratives and articles: A view of deictic referentiality explored on the paradigm of Croatiandemonstratives. Lingua 201: 102-118. Gerner, M. 2009. Deictic features of demonstratives: Atypological survey with special reference to the Miao group. The Canadian Journal of Linguistics / La revue canadienne de linguistique, 54(1): 43-90. Gillon, C. 2009. Deictic features: evidence from Skwxwú7mesh. International Journal of American Linguistics 75(1): 1-27. Grano, T. 2017. Finiteness contrasts without Tense? A view from Mandarin Chinese. Journal of East Asian Linguistics 26(3): 259–299. Heine, B. T. K. 2002. World Lexicon of Grammaticalization. Cambridge: Cambridge University Press. Hinzen, W.& Sheehan, M. 2013. The Philosophy of Universal Grammar. Oxford: Oxford University Press. Huang, C-T. J. 2015. On syntactic analyticity and parametric theory. Chinese Syntax in a Cross-linguistic Perspective, In Audrey Li, Andrew Simpson & Dylan Tsai (eds.). 1-48. Oxford: Oxford University Press. Huang, C-T. J., Li, Y -H. A. & Li Y. F. 2008. The Syntax of Chinese. Cambridge: Cambridge University Press. Klein, W. 1994.Time in Language. London: Routledge. Klein, W., Li, P.&Hendriks, H. 2000. Aspect and assertion in Chinese. Natural Languageand Linguistic Theory 18:723–770. Levinson, S. C. 1983. Pragmatics. Cambridge: Cambridge University Press. Levinson, S. C. 2004. Deixis.The Handbook of Pragmatics.In L. Horn and G. Ward (eds.). 97–121. Oxford: Blackwell. Lin, J-W. 2000. On the temporal meaning of the verbal–le in Mandarin Chinese. Language and Linguistics 1(2):109-133. Lin, J-W. 2002. 论现代汉语的时制意义. Language and Linguistics 3(1): 1-25. Lin, J-W. 2003. Temporal reference in Mandarin Chinese. Journal of East Asian Linguistics 12:259–311. Lin, J-W. 2006. Time in a language without tense: The case of Chinese. Journal of Semantics 23: 1–56. Lin, J-W. 2010. A tenseless analysis of Mandarin Chinese revisited: A response to Sybesma 2007.Linguistic Inquiry 41:305–329. Lin, J-W. 2012. Tenselessness. The Oxford Handbook of Tense andAspect.In R. I. Binnick (ed.). 669–695. Oxford, UK: Oxford University Press. Lin, T-H. J. 2015. Tense in Mandarin Chinese sentences. Syntax, 18 (3): 320-342. Lyons, J. 1977. Semantics. Cambridge: Cambridge University Press. Marantz, A. 2013. Verbal argument structure: Events and participants. Lingua, 130:152–168. Modine, P. 1993. A theory of evolution of the Mandarin focus construction ‘shi…de’. Asian and African Studies (2): 154-168. Ning, C. Y. 1995. De as a functional head in Chinese. Paper presented at the Annual Forum of the Linguistic Society of Hong Kong. Paris. M-C. 1979. Nominalization in Mandarin Chinese: The morpheme de and the shi…de construction, DRL, Universite de Paris 7, Paris. Paul, W. 2005. Low IP area and left periphery in Mandarin Chinese. Recherches Linguistiques deVincennes 33: 111–133. Law, P. &Ndayiragije, J. 2017. Syntactic tense from a comparative syntax perspective. Linguistic Inquiry, 48(4): 679-696. Paul W. & WhitmanJ. 2008. Shi…de focus clefts in Mandarin Chinese. The Linguistic Review 25, 3/4: 413-451. Pollock, J.-Y. 1989. Verb movement, Universal Grammar, and the structure of IP. LinguisticInquiry, 20, 365-424. Pulleyblank, E. 1995. Outline of Classical Chinese Grammar. Vancouver: University of BritishColumbia Press. Reichenbach, H. 1947. Elements of Symbolic Logic. New York: The Macmillan Company. Ritter, E.& Wiltschko, M. 2005. Anchoring events to utterances without tense. In Proceedings ofthe 24th West Coast Conference on Formal Linguistics. In John Alderete et al. (ed.). 343-351. Somerville, MA:Cascadilla Proceedings Project. Ritter, E. &Wiltschko, M. 2009. Varieties of INFL: TENSE, LOCATION and PERSON. Alternatives to Cartography. In Jeroen van Cranenbroeck (ed.), 153–201. Berlin: Mouton de Gruyter. Ritter, E. &Wiltschko, M. 2014.The composition of INFL: An exploration of tense, tenseless languages, and tenseless constructions. Natural Language & Linguistic Theory 32(4): 1331–1386. Roberts, I. 1993. Verbs and Diachronic Syntax: a Comparative History of English and French.Dordrecht: Kluwer Academic Publishers. Simpon, A. Definiteness agreement and the Chinese DP.Language andLinguistics 2: 125–156. Simpson, A. 2002. On the status of ‘modifying’ DE and the structure of the Chinese DP. On the Formal Way to Chinese Languages. In S-W Tang & C-S Liu (eds.). 260-285. Stadford: CSLI Publications. Simpson, A.& Wu, Zoe X-Z. 2002. From D to T — determiner incorporation and the creation of tense. Journal of East Asian Linguistics 11: 169 - 209. Smith, C. S. & Erbaugh, M. S. 2005. Temporal interpretation in Mandarin Chinese. Linguistics 43 (4): 713–756. Soh, H. L., & Gao, M. 2008. Mandarin sentential -le, perfect and English already. Event Structure in Linguistic Form and Interpretation. In J. Dölling, T. Heyde-Zybatow, & M. Schäfer (eds.). 447-473. Berlin: Mouton de Gruyter. Stowell, T. A. 1982. The tense of infinitives. Linguistic Inquiry 13:561-70. Stowell, T. A. 1995. The phrase structure of tense. Phrase Structure and the Lexicon. In J. Rooryck & L. Zaring (eds.). 277-291. Dordrecht: Kluwer Academic Publishers. Sybesma, R. 2007. Whether we tense-agree overtly or not. Linguistic Inquiry 38: 580–587. Tang, T-C. 1983. Guoyu de jiaodian jiegou: fenlieju, fenlie bianju yu zhun fenlieju [Focusing constructions in Chinese: cleft sentences and pseudo-cleft sentences]. Universe and Scope. Presupposition and Quantification in Chinese. In T-C Tang, R. L. Cheng, & Y-C Li (eds.). 127 - 226. Taipei: Student book Co. Teng, H-H. 1979. Remarks on cleft sentences in Chinese. Journal of Chinese Linguistics 7:101–113. Tsai, W-T. D. 2008. Tense anchoring in Chinese. Lingua 118 : 675–686. Warglien, M.&Gärdenfors, P. 2013. Semantics, conceptual spaces, and the meeting of minds. Synthese, 190: 2165-2193 Wiltschko, M. 2003. On the interpretability of tense on D and its consequences for Case Theory. Lingua113:659-696. Wiltschko, M. 2004. Expletive categorical features: A case study of number in Halkomelem. InProceedings of NELS 35 (2).In Leah Bateman, & Cherlon Ussery (ed.). 631–646. Amherst, MA:GLSA Publications. Wiltschko, M. 2014. The Universal Structure of Categories: Towards a Formal Typology. Cambridge University Press. Wu, J-S. 2009. Tense as a discourse feature: Rethinking temporal location in Mandarin Chinese.Journal of East Asian Linguistics 18: 145–165. Xu, Y. 2014. A corpus-based functional study of shi...de constructions. Chinese Language and Discourse 5(2): 146–184. 邓思颖. 2006. 以“的”为中心语的一些问题. 当代语言学(3). 李讷, 安珊笛, 张伯江. 1998. 从话语角度论证语气词“的” 中国语文(2). 李铁根 2002. “了”、“着”、“过”与汉语时制的表达. 语言研究(3). 林若望. 2017. 再论词尾“了”的时体意义.中国语文(1). 刘勋宁. 1985.现代汉语词尾“了”的语法意义. 中国语文(5). 刘勋宁. 1990. 现代汉语句尾“了”的语法意义及其与词尾“了”的联系. 世界汉语教学(2). 吕叔湘主编. 1980. 现代汉语八百词. 商务印书馆. 郭锐. 2015. 汉语谓词性成分的时间参照及其句法后果.世界汉语教学(4). 郭锐. 2016. 汉语叙述方式的改变和“了1”结句现象. 中国语文 (263). 黄正德. 1990. 說「是」和「有」.中央研究院歷史語言研究所集刊 (59). 马学良&史有为. 1982. 说“上哪儿的”及其“的”. 语言研究(1). 麦子茵. 2012. 终结性与“(是)…的”的焦点结构. 语言学论丛(44). 木村英树. 2003. “的”字句的句式语义及“的”字的功能拓展. 中国语文(4). 杉村博文. 1999. “的”字结构、承指与分类. 汉语现状与历史的研究(江蓝生、侯精一主编).中国社会科学出版社. 石毓智. 2000. 论“的”的语法功能的同一性. 世界汉语教学 (1). 石毓智. 2005. 论判断、焦点、强调与对比之关系—“是”的语法功能和使用条件. 语言研究 25 (4). 石定栩. 2008. “的”和“的”字结构. 当代语言学(4). 宋玉柱. 1981. 关于时间助词“的”和“来着”. 中国语文(4). 史有为. 1984. 表已然义的“的b”补议. 语言研究(1). 完权 2018. “的”和“的”字结构. 上海:学林出版社. 完权. 2013. 事态中的“的”. 中国语文(1). 王文颖. 2016. 现代汉语“是……的”句的焦点结构研究. 博士论文: 北京大学中国语言文学系. 袁毓林. 1995. 谓词隐含极其句法后果—“的”字结构的代称规则和“的”的语法、语义功能。中国语文(4). 袁毓林. 2003a.从焦点理论看句尾“的”的句法语义功能.中国语文(1). 袁毓林. 2003b.句子的焦点结构及其对语义解释的影响. 当代语言学 (4). 朱德熙. 1961. 说“的”. 中国语文(12). 朱德熙. 1978. “的”字结构和判断句. 中国语文(1-2). 朱德熙. 1982. 语法讲义. 北京: 商务印书馆. 朱庆祥. 2017. 也论“应该∅的”句式违实性及其相关问题.手稿. |
公开日期: | 2022-06-04 |
供应链金融下中小企业信用评级研究 -以工程机械行业为例.孙浩
题名: | 供应链金融下中小企业信用评级研究 -以工程机械行业为例 |
姓名: | |
学号: | 1701211051 |
论文语种: | chi |
公开时间: | 公开 |
学位: | |
院系: | |
导师1单位: | 软件与微电子学院 |
论文答辩日期: | 2019-05-20 |
论文摘要: | 中小企业在优化经济结构和缓解就业压力等方面呈现出重要的价值,但是受到生产经营规模较小、管理模式落后等因素的制约,中小企业的融资渠道极为狭窄,融资成功率也较低,极大地限制了中小企业进一步发展壮大的步伐。与此同时,国内供应链金融随之应运而生,商业银行等金融机构帮助中小企业周转流动资金,实现多方互利共赢。然而,供应链金融存在信息不对称风险,不同的供应链金融模式所潜在的风险也具有显著差异。随着我国供应链金融行业呈现出迅猛的发展态势,商业银行在经营过程中开始面临在供应链的特殊环境下对中小企业的信用进行风险评估的问题。 本文以供应链金融的发展状况作为宏观研究背景,通过对工程机械行业供应链金融融资模式及相应模式下的风险特征的研究,筛选并优化工程机械行业供应链金融信用评价指标,量化工程机械行业供应链金融环境下中小企业潜在的信用风险。 首先,本文阐述了研究命题所涉及的相关理论内容,即供应链金融概念、融资模式类型以及相关信用评价体系等;其次,详细阐述了当前工程机械行业供应链金融下不同的融资模式的具体流程及各自的风险特征,从而为构建工程机械行业基于供应链金融环境下的信用指标体系形成良好的前提条件;最后,选取了财务数据完善的工程机械行业中新三板企业作为样本,运用因子分析法对初选的信用指标体系进行降维处理,并利用Logistic回归模型来完成基于供应链金融环境下工程机械行业中小企业信用风险评价体系的构建。 本研究构建了工程机械行业基于供应链金融的信用评价指标体系,并检验了指标体系的可行性。本指标体系的设计和实现对工程机械行业中小融资企业具有理论价值和现实意义。 |
分类号: | F83 |
论文总页数: | 49 |
参考文献总数: | 56 |
参考文献列表: |
[1] 李芹,吴丝丝,霍强.中小企业融资困境与供应链金融创新研究[J].经济论坛,2014(05):61-67.
[2] 宋华.供应链金融[M].二版.北京:中国人民大学出版社,2016:8-13. [3] 丁汀,李雪梅.供应链金融解决中小企业融资的优势分析[J].物流技术,2009(07):73-75. [4] 李金龙.2011.供应链金融理论与实务[M].北京:人民交通出版社, 5-6. [5] 弯红地.供应链金融的风险模型分析研究[J].经济问题,2008(11). [6] B. A. Ahn, S. S. Cho and C.Y Kim. The integrated methodology of rough set Theory and artificial neural network for business failure prediction. Expert Systems with Applications 2008,18(2):65-74. [7] Dr Clarence N. W. Tan, Bond University, Gold Coast,Qld. A Study on Using Artificial Neural Networks to Develop an Early Warning Predictor for Credit Union Financial Distress with Comparison to the Probit Model[J].Managerial Finance,2011,27(4):56-77. [8] Dadios Kumarasamy, Prakasb Singh. Access to Finance, Financial Development Countries and Firm Ability to Export: Experience from Asia-Pacific countries[J].Asian Economic Journal,2412,32(1). [9] Guilherme Barreto Fernandes. Application of metabolic GM (1,1) model in financial repression approach to the financing difficulty of the small and medium-sized enterprises[J].Grey Systems:Theory and Application,2016,4 (2). [10] Maldonado S, Bravo C, Lopez J, et al. Integrated framework for profit-based feature selection and SVM classification in credit scoring[J]. Decision Support Systems, 2017, (04):113-121. [11] 曾筝.商业银行信用风险评估方法研究[J].计算机仿真,2011,28(08):372-375. [12] 运迪,周建辉.基于改进Z值模型的企业信用风险评估与检验[J].统计与决策,2014(10):173-176. [13] 曾玲玲,潘霄,叶曼.基BP-KMV模型的非上市公司信用风险度量[J].财会月刊,2017(18):47-55. [14] 奚梦缘.中小企业信用指标体系构建及评估模型的最优化[J].经济问题,2018(10). [15] Shashank Pao, Thomas J. Goldsby. Supply chain risks: a review and typology [J]. The international journal of logistics and management,2009,20(1):97-123. [16] Demica. Supply chain finance:a third report form Demica[R]. London, UK,2009. [17] Sunil Chopra, Peter Meindl. Suply chain management: strategy, planning and operation [M]. London, UK:Pesrson Pres,2009. [18] Chih-Yang Tsai,On delineating supply chain cash flow under collection risk[J]. International Journal of Production Economics,2010(1):186-194. [19] Bob Dyckman. Integrating supply chain finance into the payables process[J]. International Journal of Production Economics,2011(3):172-180. [20] Abhijeet Ghadge, Samir Dani, Michael Chester,Roy Kalawsky. A systems approach for modeling supply chain risks [J]. Supply chain management:an international journal,2013,18(5):523-538. [21] 张浩.基于供应链金融的中小企业信用评级模型研究[J].东南大学学报(哲学社会科学版),2008(2). [22] 熊熊,马佳,赵文杰.供应链金融模式下的信用风险评价[J]. 南开管理评论,2009(4). [23] 胡海青,张琅,张道宏.供应链金融视角下的中小企业信用风险评估研究——基于SVM与BP神经网络的比较研究[J].管理评论,2012(11). [24] 夏泰凤,王红梅; 中小企业供应链融资模式的风险管理[J].经济导刊,2012(1). [25] 郭战琴.基于供应链金融的小微企业融资模式——以第三方龙头物流企业为平台[J].金融理论与实践,2012(1):76-83. [26] 陈长彬,盛鑫.供应链金融中信用风险的评价体系构建研究[J].福建师范大学学报(哲学社会科学版) ,2013(2). [27] 黄静思,宋河,宋新红.供应链金融贷款风险识别与评价方法研究.金融理论与实践[J]. 2014 (2):46-49. [28] 胡慧慧,傅为忠.基于改进灰色关联度方法的互联网供应链金融风险评价[J].武汉金融.2016 (3) :51-55. [29] 高翔,贾亮亭.基于结构方程模型的企业跨境电子商务供应链风险研究——以上海、广州、青岛等地167家跨境电商企业为例[J].上海经济研究,2016(05):76-83. [30] Angapp Gunasekaran,Kee -hung Lai,T.C. Edwin Cheng.Responsive supplly chain:a competitive strategy in a networked economy[J]. The international journal of management science,2008,36:549-564. [31] Bernabucci R.J. Supply chain gains from integration[J]. Financial Executive,2008,24(3):46-48. [32] Bing Jing,Abraham Seidmann. Financing sourcing in a supply chain [J]. Decision support systems,2014,58(2):15-20. [33] 赵亚娟,杨喜孙,刘心报.供应链金融与中小企业信贷能力的提升[J].金融理论与实践,2009(10). [34] Bob Dyckman. Supply chain finance:risk mitigation and revenue growth [J]. Journal of corporate treasury management,2011,4(2):168-173. [35] Camerinelli D. Supply chain finance[J]. Journal of Payments Strategy & Systems,2009,3(2):114-128. [36] Cossin D, Hricko T. A structural analysis of credit risk with risky collateral: A methodology for haircut determination [J]. Economic Notes,2003, 32(2):243-282. [37] 贾俊平,何晓群,金勇进.”十二五”普通高等教育本科国家级规划教材,21世纪统计学系列教材「Ml.中国人民大学出版社,2012,(05):33-57. [38] 杨丹清.供应链金融背景下中小企业融资模式探究[J].合作经济与科技,2016(03):50-51. [39] Chih-Yang Tsai. On delineating supply chain cash flow under collection risk [J]. International journal of production economics,2011,129(1):186-194. [40] David A. Wuttke, Constantin Blome, Michael Henke. Focusing the financial flow of supply chains: an empirical investigation of financial supply chain management [J]. International journal of production economics,2013,145(2):773-789. [41] Epley R. Donald, Liano Kartono,Haney Richard. Borrower risk signaling using loan-to-value ratios[J]. Journal of Real Estate Research,1996,11(1):71-86. [42] F. Mathis, J. Cavinato. Financing the Global Supply Chain: Growing Need for Management Action [J]. Thunderbird International Business,2010,52(6):467-474. [43] 张文春.供应链金融视角下中小企业融资路径分析[J].商业时代,2010(26):85-116. [44] Hans -Christian Pfohl, Moritz Gomm. Supply chain finance: optimizing financial flows in supply chains [J]. Logist Research,2009(1):149-161. [45] M. Theodore, Paul D. Hutchison. Cash-to-cash: the new supply chain management metric[J]. International journal of physical distribution & logistics management,2002,32 (4):288-298. [46] Miao He, Changrui Ren, Qinhua Wang, Jin Dong. Chapter 3:supply chain finance:concept and modeling [C]// Feiyue Wang. Service science management and engineering. Hangzhou: Zhejiang University Press,2012:37-58. [47] Mingsheng Yang. Research on supply chain finance pricing problem under radnom demand and permissible delay in payment[J]. Procedia computer science,2013(17):245-257. [48] P.L. Abad,C. K. Jaggi. A joint approach for setting unit price and the length of the credit period for a seller when end demand is price sensitive [J]. International journal of production economics, 2003(83):115-122. [49] Peter Finch. Supply chain risk management [J]. Supply chain management:an international journal,2004,9(2):183-196. [50] Rhian Slivestro, Paola Lustrato. Integrating financial and physical supply chain:the role of banks in enabling supply chain integration [J]. International journal of operations & production management,2014,34(3):298-324. [51] Tseng, M.L., Chiang, J.H., Lan, W.L. Selection of optimal supplier in supply chain management strategy with analytic network process and choquet integral. Comput[J]. Ind. Eng. 2009,57 (1): 330-340. [52] Wesley S. Randall, M. Theodore Farris. Supply chain financing:using cash -to -cash variables to strengthen the supply chain [J]. International journal of physical distribution & logistics management,2009,39(8): 669-689. [53] Shang, K.H., Song, J.S., Zipkin, P.H. Coordination mechanisms in decentralized serial inventory systems with batch ordering. Manag. Sci. 2009, 55 (4):685-695. [54] Vickery, Jayaram, Droge & Calantone. The effect of an integrative supply chain strategy on customer service and financial performance: an analysis of direct versus indirect relationships [J]. Journal of operations management,2003,21(5):523-539. [55] Wuttke, D.A., Blome, C., Heese, H.S., Protopappa-Sieke, M. Supply chain finance: optimal introduction and adoption decisions. Int. J. Prod. Econ. 2016,178: 72-81. [56] Xiangjun He, Lingyun Tang. Exploration on building of visualization platform to innovate business operation pattern of supply chain finance [J]. Physics procedia,2012(33):86-93. |
公开日期: | 2019-06-03 |
题名: | 国际视角下建筑行业协会合作对建筑职业培训效果影响的研究 |
姓名: | |
学号: | 1701211055 |
论文语种: | chi |
培养层次: | 硕士 |
培养单位: | 北京大学 |
导师1姓名: | |
导师1单位: | 软件与微电子学院 |
论文答辩日期: | 2019-05-20 |
外文题名: | Research on effect of construction industry vocational training from the perspective of international cooperation among NGOs |
外文关键词: | International cooperation between industry associations Vocational training Game theory |
论文摘要: | 我国建筑业科技水平相对较低,从业者安全意识和专业知识相较于发达国家有所不足,导致建筑安全事故较多,给工人生命安全和经济发展带来了危害,加之国内建筑职业培训成效不足,使得目前建筑工人技能水平和职业素质达不到行业发展需要。有效的培训能够实现建筑工人专业技能和职业素养的提高。海外职业培训经验表明,良好的职业培训效果是工人、政府、企业、行业协会多方参与、良性互动、有机融合的结果。 |
外文摘要: | Backwardness of construction technology and weakness of safety consciousness and the lack of professional skills among migrant workers lead to construction accidents in China,which poses severe threat to lives, families and economy in general. There is no sufficient training program for migrant workers, making this situation even worse. Because the lack of effective training programs, migrant workers do not possess necessary skills for safety, hence unable to meet the requirement of overseas construction projects. Vocational training aims to improve the skills and the knowledge for migrant workers. Study of foreign vocational training drew conclusion that effective training is a result of involvement of workers, organizations, enterprises and the government, who has positive interaction and better |
分类号: | F26 |
论文总页数: | 62 |
参考文献总数: | 74 |
参考文献列表: |
陈圆,任宏.美国建筑业劳工培训剖析与启示[J]. 《建筑经济》, 2010 (9) :13-16
程贵妞,韩国明.行业协会参与职业教育的角色分析[J].教育与职业,2008(6):11-14 方东平等.英国和美国建筑安全的现状与发展[J]. 《建筑经济》, 2001 (8) :26-29 国家统计局.中华人民共和国 2018 年国民经济和社会发展统计公报[EB/OL]. http://www.stats.gov.cn/tjsj/zxfb/201902/t20190228_1651265.html 韩永光.建筑业农民工职业教育管理研究[J].中华民居(下旬刊), 2014(9) :239-240 黄浩明.社会组织国际化战略与路径研究[D].天津大学,2014 赖涪林,付春,肖升生.农民工教育培训参与主体的博弈与抉择分析[J]. 《唯实》, 2012 (10) :80-82 李洵.新加坡、英国及香港地区的建筑质量与安全分析[J]. 《土木工程学报》 , 2003 , 36 (9) :38-45 李梦白.美国汽车工程师协会(SAE)教育培训管理及课程体系简介之一——SAE 的职业培训管理[J].质量与可靠性, 2009(2):58-59 李朝.建筑业农民工安全管理研究及应用[D]. 湖南大学,2016 刘璐.英国建筑安全发展概览[J]. 《中国安全生产》 , 2015 (12) 刘志军.建筑业农民工教育培训体系构建及对策研究[D]. 东南大学,2016 刘能文.2016 年全国建筑物资租赁承包行业分析报告[R]北京:中国基建物资租赁承包协会, 2016:1-3 毛亚男.行业协会参与职业教育人才培养模式研究[D]. 天津大学,2013 牛永宁,蔡庸亨,牛新可.英国建筑安全教育培训分析与借鉴[J].《建筑安全》, 2015(11): 7-9 冉云芳.企业参与职业教育办学的成本收益分析[D]. 华东师范大学,2016 申英博.基于博弈理论的建筑安全管理研究[D]. 天津大学,2015 寺田盛纪.日本职业教育——比较与就业过程视角下的职业教育学[M].陈俊英,马丽华,译.北京:人民教育出版社,2014:25. 孙萌.非营利组织的国际化策略与资源的多重依赖——以北京某基金会为例[D]. 2012. 谭璐.中国非学历教育与个人收入关系的实证研究[J].《开放学习研究》, 2018(12): 31-36 王奕俊.企业收益成本视角的校企合作动力机制分析[J].《教育与职业》, 2011 (03) :15-17 魏体丽.澳大利亚行业技能委员会研究[D].华中师范大学,2013 许华榕.闽台行业协会交流与合作深化问题的研究[D].华侨大学,2011 许惠清,黄日强.以行业为主导的职业教育模式[J].河北师范大学学报,2011(9):79-84 徐振.基础设施项目施工企业应对“用工荒”问题的研究[D]. 清华大学,2014 徐卫.新生代农民工职业培训研究[D]. 武汉大学,2016 燕晓飞.非正规就业劳动力教育培训的多主体博弈分析[J].东北师大学报(哲学社会科学版), 2013(2) :144-147 张健.浅析行业协会的功能——基于弥补市场失灵的视角[J].理论界, 2013(6):28-30. 张沁洁.行业协会间的竞合关系演变研究——以广东为例[J]. 华南理工大学学报(社会科学版), 2018,v.20; No.102(02):77-86 郑茜.基于博弈论视角下中国农民工职业培训问题研究[J]. 《知识经济》, 2009 (14) :69-70 中华人民共和国国务院办公厅. 国务院办公厅关于加快推进行业协会商会改革和发展的若干意见. 国办发[2007]36 号[J].工程造价管理, 2007(52):3-5. 中国基建物资租赁承包协会.协会介绍 [EB/OL]. [2015-10]. http://www.ccmrc.org.cn/about.asp?id=369 中国建筑业协会.2017 年建筑业发展统计分析 [EB/OL]. [2018-01]. http://www.zgjzy.org/NewsShow.aspx?id=9146 周 丽 华 . 辅 助 原 则 与 德 国 “ 双 元 制 ” 职 业 教 育 中 经 济 组 织 的 主 体 地 位 [J]. 外 国 教 育 研究,2015(2):117-128. 朱钰.基于建筑工人认知的安全行为培训研究[D]. 清华大学,2016 赵彬,袁亮,杨希宁.建筑业农民工技能培训障碍与对策研究[J].《建筑经济》 , 2017, 38(12) :100-104 Acemoglu D, Pischke .1-S. The structure of wages and investment in general training. [J]. Journal of Political Economy. 1999107(3). 539-572 ABET.abet accreditation[EB/OL]. [2010-06].https://www.abet.org/accreditation/ Becker G S, Tomes N. Human capital and the rise and fall of families[J]. Journal of Labor Economics,1986, 4(3, Part 2):S1-S39 BEA.2017industry stat data[EB/OL]. [2018-06]. https://apps.bea.gov/industry/factsheet/factsheet.cfm Centre for information on continuing vocational training.A bridge to the future European policy for vocational education and training 2002-10-- National policy report 一 France[DB/OLJ. March 2010/2012-06-09. p.14 Centre for information on continuing vocational education and training 2002-10-- National policy training.A bridge to the future European policy for vocational report 一 France[DB/OL]. March 2010/2012-06-09.p.27 CISRS .CISRS handbook [EB/OL]. [2016-10]. http://www.cisrs.org.uk/ Dietrich H,Koch S,Stops M.The apprenticeship places crisis: training needs to be worthwhile,including for companies.Establishment Panel Survey[R].Nuremberg, Brief Report,2004, No. 6. Edward L. Taylor .Safety benefits of mandatory OSHA 10 h training [J].Safety Science, Volume 77,August 2015, Pages 66-71 Granger, CWJ1, Some Recent Developments in a Concept of Causality [J].Journal of Econometrics,1988,39: 199~2111 Harsanyi J C, Selten R.A Generalized Nash Solution for Two-Person Bargaining Games with Incomplete Information [J]. Management Science, 1972, 18(5-part-2):80-106. Hinze. Analysis of Fatalities Record by OSHA. [J].Journal of Construction Engneering and Management,1995, (6): 23-25. Hansen, Hal. Caps and Gowns [D]. University of Wisconsin-Madison, 1997. H.Rauhut. Higher Punishment, Less Control? Experimental evidence on the inspection game .[J]Rationality and Society.2009,21(21):359-392 Hinze J, Harrison C. Safety Programs in Large Construction Firms [J]. Journal of the Construction Division, 2014, 107(3):455-467. Health and Safety Executive. Construction: Work related injuries and ill health [EB/OL]. [2017-10]. http://www.hse.gov.uk/statistics/industry/construction/construction.pdf Juan Carlos Rubio-Romero.Analysis of the safety conditions of scaffolding on construction sites.[J]. Safety Science, Volume 55, June 2013, Pages 160-164 JIFH. Japan International Food for the Hungry[EB/OL]. [2011-08].https://www.jifh.org/eng/activity/ Lewis W A . Economic Development With Unlimited Supplies Of Labour[J]. Manchester School, 1954,22:139-191. Lehrack D. Environmental NGOs in China - partners in environmental governance [J]. Discussion Papers Presidential Department, 2006. Maslow A H. Preface to motivation theory.[J]. Psychosomatic Medicine, 1943, 5(1):85-92. Mincer J. Schooling, Experience, and Earnings. Human Behavior & Social Institutions No. 2. [M]//Schooling, experience, and earnings. 1974. Mincer J. Human capital and economic growth. [J] Economics of Education Review, Volume 3, Issue 3,1984, Pages 195-205. Muehlemann S.Schweri J.Winkelmann R, Wolter S C. A Structural Model of Demand for Apprentices[R].CESifo Working Paper. 2005, No.1417. Muehlemann S,Schweri J,Winkelmann R Wolter S C. An empirical analysis of the decision to train apprentices [J]. Lab Rev Lab Econ Ind Relat, 2007, 21(3):419-441 Nash J.Two-Person Cooperative Games [J]. Econometric , 1953, 21(1):128-140. OSHA. Introduction of OSHA [EB/OL]. [1985-10]. http://www.osha.gov/ Qualifications and Curriculum Development Agency. UK National Policy report for 2010[DB/OL].2010/2012-09-26. p.64 Ryan P.Gospel H,Lewis P.Educational and Contractual Attributes of the Apprenticeship Program of Large Employers in Britain [J]. Journal of Vocational Education and Training. 2006.58(3):359-383. Shaked A, Sutton J. Involuntary Unemployment as a Perfect Equilibrium in a Bargaining Model[J].Econometrica, 1984, 52(6):1351-1364. Strauss A L, Corbin J M. Grounded theory in practice [M]. Grounded theory in practice. 1997. Starbird S A. Designing Food Safety and Penalties for Noncompliance Regulations: The Effect of Inspection Policy on Food Processor Behavior [J].Journal of Agricultural and ResourceEconomics_2000, 25 (2) :616-635. Sou-Sen Leu, Ching-Miao Chang. Bayesian-network-based safety risk assessment for steel construction projects [J]. Accident Analysis & Prevention. 2013(54):122-133. SAIA. Introduction of SAIA [EB/OL]. [2014-05]. https://www.saiaonline.org/aboutsaia Sevilay Demirkesen.Construction safety personnel's perceptions of safety training practices[J].International Journal of Project Management, Volume 33, Issue 5, July 2015, Pages 1160-1169 Theodore W. Schultz, investing in people: Schooling in low income countries [J]. Economics of Education Review, Volume 8, Issue 3, 1989, Pages 219-223 Von Neumann J, Morgenstern O. Theory of Games and Economic Behavior [M]. 1953. Wheeler N. Invited influence: American private associations in the modernization of China, 1985--2005[J].Dissertations & Theses - Gradworks, 2007. |
公开日期: | 2019-06-11 |
题名: | 中国技术写作认证考试设计与实证 |
姓名: | |
学号: | 1401210700 |
公开时间: | 公开 |
学位: | |
院系: | |
导师1姓名: | |
导师1单位: | 外国语学院 |
导师2姓名: | |
导师2单位: | 软件与微电子学院 |
论文答辩日期: | 2018-11-30 |
外文题名: | The Design and Verification of the Technical Writing Certification for Chinese Technical Writers |
外文关键词: | Technical writing Competency requirements Certification examination |
论文摘要: | 随着中国经济水平的提升,许多企业和高校意识到技术写作在产品销售、用户满意度中占据越来越重要的地位,开始重视高校人才培养和企业人才输送,同时急需一套人才选拔的基准帮助企业寻觅人才。 目前,欧美国家有相对完善的技术写作认证考试,例如美国技术传播协会的CPTC认证考试和德国技术传播协会的TCTrainNet认证考试。但是,把这些认证考试直接平移到中国市场是不恰当的,存在以下几个问题:第一,国外的认证考试内容不能对应中国的技术写作岗位要求及其能力要求;第二,时代和科技的进步对技术写作提出了新的要求,比如内容设计、写作要求和质量控制等。第三,国外的认证考试注重技术写作理论知识的传达,对实践操作的考核几乎没有涉及。 针对以上问题,笔者提出了根据中国技术写作行业需求设计认证考试的研究,并明确了研究思路和方法。首先,本文主要凭借“工作任务”界定能力构成,解释被试需要掌握的能力。笔者通过企业招聘信息、行业从业人员访谈、技术写作课程和已有技术写作考试,总结得出技术写作从业人员需要掌握分析、设计、写作、质量控制、发布这五块能力。其次,笔者根据前文获得的设计依据,制定了中国技术写作认证考试的大纲,并采用专家评定法对考试大纲进行了交叉验证,验证大纲的有效性。接着,笔者根据技术写作考试内容特点,讨论了各题型的适用性,并提出了各题型的设计方法。然后,笔者根据技术写作特点和已有考试评分标准,讨论了本次研究的评价标准。最后,根据技术写作考试大纲和考试方法,笔者展开了三次实验,第一次实验对象为工作两年以上的技术写作从业人员,第二次实验对象为从事技术写作半年内的技术写作从业人员,第三次实验对象为北京大学计算机辅助翻译2017级的学生,测试结果验证了样卷的可靠性和有效性。 研究结果表明,本次研究的技术写作认证考试大纲和考试设计方法,具备有效性、可信性和可行性。设计的考试既符合了中国市场的需求,又满足了新时代对人才的新需求。希望本文提出的考试设计能启发和鼓励更多企业和行业关注技术写作行业的发展完善和人才的培养。 |
外文摘要: | As China’s economy has been improved, many enterprises and universities are aware that technical writing plays an increasingly important role in product sales and customer satisfaction. How to train technical writers and evaluate their output has become a concern. At present, there are many certification exams related to technical writing, such as the CPTC certificate exam of the Society for Technical Communication, and the TCTrainNet certificate exam of tekom. However, these above-mentioned exams can’t be applied to China. Firstly, the content of foreign certification exams does not always fit Chinese job requirements. Secondly, the progress of the era and technology has posed new challenges to technical writing, such as content design, writing requirements, and quality control. Thirdly, foreign certification exams focus more on the theoretical knowledge while practice assessments are barely involved. This paper puts forward the design of the technical writing certificate according to the demand of China's technical writing industry, and expounds research methods. To begin with, this paper defines the composition of capabilities by job analysis. Through the enterprise recruitment information, industry interviews, technical writing courses and existing technical writing certificates, five major capabilies are conclude: analysis, design, writing, quality control and release. Then, this paper determines the outline and details of the Chinese technical writing certificate. The author uses expert method to cross-validate the examination outline. Next, the author discusses the applicability of each question type according to the content of the technical writing, and puts forward the design method of each question type. Afterwards, the author discusses the evaluation criteria of this study based on the characteristics of technical writing and the existing scoring criteria. Finally, according to the previous work, the author carries out three experiments. The first type of experimental subject is technical writers who have worked for more than two years; the second type is technical writers who are engaged in technical writing for half a year, and the third is students majoring in Computer-Aided Translation in Peking University. The test results verify the reliability and validity of the sample test. The results prove that the design of the technical writing certificate is effective, credible and feasible. The design meets the needs of the Chinese market. The author hopes that the design proposed in this paper can inspire and encourage more enterprises and industries to pay attention to the development of the technical writing industry and talent cultivation in the technical writing industry. |
分类号: | G40 |
论文总页数: | 84 |
参考文献总数: | 57 |
参考文献列表: |
陈明庆. 考试研究方法导论[M]. 北京大学出版社, 2009.
陈宇. 职业资格考试概论[M]. 华中师范大学出版社, 2002. 陈宇. 我国职业资格证书制度的回顾与前瞻[J]. 教育与职业, 2004(1):17-19. 戴海琦. 心理测量学[M]. 高等教育出版社, 2015. 郭伟萍. 英国职业资格证书制度的研究[D]. 天津大学, 2005. 黄锐. 标准参照语言测试研究[M]. 厦门大学出版社, 2012. 中国技术传播联盟. 2017中国技术传播发展现状调查报告[DB/OL]. http://www.tc-china.org/2017中国技术传播发展现状调查报告/,2018. 李梅. 技术传播性质课程的设计与实现探索——以同济大学实用英语写作课为例[J]. 上海理工大学学报(社会科学版), 2017, 39(2):101-107. 李金波. 让考试更科学[M]. 武汉大学出版社, 2012. 李清华. 高校英语专业四级测试写作评分标准的设计与效度研究[M]. 科学出版社, 2014. 李双燕. 中国技术传播教育研究浅述[J]. 文化与传播, 2015(6). 柳博. 考试命题制度研究[M]. 高等教育出版社, 2017. 吕忠民. 职业资格制度概论[M]. 中国人事出版社, 2011. 苗菊, 高乾. 构建MTI教育特色课程——技术写作的理念与内容[J]. 中国翻译, 2010(2):35-38. 史庆. 英国的国家职业资格证书制度[J]. 全球教育展望, 1997(6):47-52. 陶百强, 陈效. 我国高考英语考试大纲(说明)的问题与思考[J]. 教育与考试, 2008(4):29-34. 田大洲. 我国职业资格证书制度研究[D]. 首都经济贸易大学, 2004. 徐奇智, 王希华. 技术传播学:美国的发展对我们的启示[C]// 亚太地区媒体与科技和社会发展研讨会. 2006. 杨惠中,C.Weir. 大学英语四、六级考试效度研究[M]. 上海外语教育出版社, 1998. 杨惠中, 朱正才, 方绪军. 中国语言能力等级共同量表研究: 理论, 方法与实证研究[J]. M]. 上海: 上海外语教育出版社, 2012. 杨延. 国家职业资格认证考试的国内外比较研究[J]. 职教论坛, 2006(5s):46-49. 俞敬松, 王惠临, 王聪. 翻译技术认证考试的设计与实证[J]. 中国翻译, 2014(4):73-78. 张凯. 汉语水平考试(HSK)研究[M]. 商务印书馆, 2006. 中华人农民共和国国家质量监督检验检疫总局中国,国家标准化管理委员会. 说明书的编制构成内容和表示方法[DB/OL]. 中国标准出版社,2005. 中兴通讯学院. 科技文档写作实务[M]. 人民邮电出版社, 2013. 周海银. 教学测量与评价[M]. 济南:山东大学出版社,2015:5. Albers M J, Mazur B. Content and Complexity: Information Design in Technical Communication[M]. L. Erlbaum Associates Inc. 2005. Azuma M, Coallier F, Garbajosa J. How to apply the Bloom taxonomy to software engineering[C]// Eleventh International Workshop on Software Technology and Engineering Practice. IEEE, 2003:117-122.Bachman L F. Fundamental considerations in language testing[J]. 1990, 75(4). Blythe S, Lauer C, Curran P G. Professional and technical communication in a web 2.0 world[J]. Technical Communication Quarterly, 2014, 23(4): 265-287. Brumberger E, Lauer C. The evolution of technical communication: An analysis of industry job postings[J]. Technical Communication, 2015, 62(4): 224-243. Carey M, Lanyi M F, Longo D, et al. Developing Quality Technical Information: A Handbook for Writers and Editors[M]. IBM Press, 2014. Carroll J. Minimalism beyond “The Nurnberg Funnel”.[J]. Computers & Human Interaction, 1998. Coe M. Human factors for technical communicators[M]. John Wiley & Sons, Inc. 1996. Donald A. Norman. Emotional Design[J]. Ubiquity, 2004, 2004(45):1-1. Cunningham D. Core competency skills for technical communicators[C]// Professional Communication Conference, 2008. IPCC 2008. IEEE International. IEEE, 2008:1-6. Gao, Z., Yu, J., & De Jong, M. (2014). Establishing technical communication as a professional discipline. Tcworld, 2014(08), 10–13. Glaser, R., & Klaus, D.J. (1962). Proficiency measurement: Assessing human performance. In R.M. Gagné (Ed.),Psychological principles in system development. New York: Holt, Rinehart and Winston. Hackos J A T. Managing Your Documentation Projects[M]. 1994. Hackos J A T, Redish J. User and task analysis for interface design[J]. 1998. Harvey, R. J. (1991). Job analysis. In M. Dunnette & L. Hough (Eds.), Handbook of industrial and organizational psychology (2nd ed., Vol. 2, pp. 71–163). Palo Alto, CA: Consulting Psychologists Press. Henze B, Miller C, Carradini S. Technical Communication[J]. 2016, BTR-7(3):7-7. Johnsonsheehan R. Technical Communication Today[M]// Technical communication today. Longman, 2010:256–260. Krathwohl D R. A revision of Bloom's Taxonomy: an overview - Benjamin S. Bloom, University of Chicago[J]. Theory Into Practice, 2002(Autumn). Mark R. Raymond. Job Analysis and the Specification of Content for Licensure and Certification Examinations[J]. Applied Measurement in Education, 2001, 14(4):369-415. Markel M. Technical Communication: Update 2002[M]. Boston: St. Martin's, 2002. McDowell E E. Certifying Technical Communicators in the 21st Century[J]. 2001. Nugent J. Certificate programs in technical writing: Through sophistic eyes[J]. Design discourse: Composing and revising programs in professional and technical writing, 2010: 153-170. Nugent J. A survey of US certificate programs in technical communication[J]. Programmatic Perspectives, 2013, 5(1): 58-85. O Hara F M. A brief history of technical communication[C]//ANNUAL CONFERENCE-SOCIETY FOR TECHNICAL COMMUNICATION. UNKNOWN, 2001, 48: 500-504. Pruitt J, Adlin T. The persona lifecycle: keeping people in mind throughout product design[M]. Elsevier, 2010. Rainey K T, Turner R K, Dayton D. Do curricula in technical communication jibe with managerial expectations? A report about core competencies[C]// Ipcc 2005. Proceedings. International Professional Communication Conference. IEEE, 2005:359-368. Roy K. Turner, Kenneth T. Rainey. Certification in Technical Communication[J]. Technical Communication Quarterly, 2004, 13(2):211-234. Rubin J, Chisnell D. Handbook of Usability Testing 2nd Edition[J]. Wiley Publishing Inc, 2008. Sauro J, Lewis J R. Quantifying the User Experience[M]. 2012. Spencer D. Practical Guide to Information Architecture[J]. 2010. Thompson I. Competence and critique in technical communication: A qualitative content analysis of journal articles[J]. Journal of Business and Technical Communication, 1996, 10(1): 48-80. Turner R K, Rainey K T. Certification in technical communication[J]. Technical communication quarterly, 2004, 13(2): 211-234. |
公开日期: | 2018-11-30 |
题名: | 医学英语词汇学习系统研究与设计 |
姓名: | |
学号: | 1501210657 |
专业: | |
培养层次: | 硕士 |
培养单位: | 北京大学 |
导师1姓名: | |
导师1单位: | 软件与微电子学院 |
导师2姓名: | |
导师2单位: | 软件与微电子学院 |
论文答辩日期: | 2018-11-30 |
外文题名: | Research and Design of a Medical English Vocabulary Learning System |
外文关键词: | medical English vocabulary vocabulary learning efficiency adaptive recommendation vocabulary memory networks vocabulary repetition rate |
论文摘要: | 医学英语词汇与普通英语词汇不同,有其独特的构词方式,词素特征明显,词汇间的关联性更强。经调研,目前的医学英语词汇学习资源不能完全满足学习者的学习需求,教学资源以纸质材料为主,局限性较大,存在医学英语词汇学习效率不高、学习内容有限、学习者积极性不高等问题,缺乏有效的医学英语词汇教学体系。 聚焦到医学英语词汇学习效率的问题,以下三个方面仍未得到有效解决。一、目前医学英语词汇教学忽视了学习者的个体差异,现有教学方式难以根据每位学习者的学习情况进行医学词汇学习与复习动态推荐。二、现有英语词汇学习软件未能充分挖掘医学英语词汇特征,未能把握医学英语词汇教学重点,教学流程不完全适用于医学英语词汇。三、医学英语词汇复现率较低,学习者记忆效果不佳。为解决上述问题,本研究以医学英语词汇教学理论和第二语言习得理论为依据,利用移动互联网优势,设计了一款医学英语词汇学习系统,提出以下三种提高医学英语词汇学习效率的方式。一、建立医学英语词汇自适应推荐模型,通过综合计算医学英语词汇的特征影响因子实现医学英语词汇动态推荐。二、依据医学英语词汇特点,精选教学内容模块,突出医学英语词汇教学重点,优化教学流程,构建医学英语词汇记忆网。三、多维度复现词汇,通过多种词汇复现方法,增加词汇的复现率。 为了验证本研究设计的医学英语词汇学习系统的教学效果,本研究对北京某学校50名大二非英语专业学生进行了对照教学实验。实验表明,本研究设计的学习方式可有效促进医学英语词汇与词素的习得效果和保持效果,且可以提高学习者的猜词能力。 本研究设计的医学英语词汇学习系统有效提高了医学英语词汇学习效率,缓解了纸质医学资源的局限性,补足了医学英语词汇课堂教授内容受限的短板,可满足学习者的个性化需求,并注重通过多样化学习方式培养学习者的学习兴趣与积极性,对医学英语词汇移动教学具有一定的参考价值。 |
外文摘要: | Different from common English vocabulary, Medical English vocabulary has specific ways of word formation, and the semantic association among them is stronger than common English vocabulary. Moreover, the morpheme feature of medical English vocabulary is distinct. By surveying medical English learners, the author finds out that current medical English vocabulary learning resources cannot fully meet their learning needs. The teaching resources are mainly paper materials which have lots of limitations. At present, there are many problems concerning medical English vocabulary learning, such as low learning efficiency, limited learning materials and low learning initiative, and there is a lack of an effective medical English vocabulary learning system. Focusing on the low learning efficiency of medical English vocabulary, the following three aspects have not been effectively solved. First, current medical English vocabulary teaching ignores individual differences of learners, and the existing teaching methods cannot dynamically recommend medical vocabulary according to each learner’s situation. Second, the existing English vocabulary learning softwares fail to fully utilize the features of medical English vocabulary and grasp the key points of medical English vocabulary teaching, whose teaching processes are not fully applicable to medical English vocabulary. Third, the repetition rate of medical English vocabulary is low, which causes poor learning effect. In order to solve the above problems, based on medical vocabulary teaching theory and second language acquisition theory, taking advantages of the mobile internet, this study designs a medical English vocabulary learning system and proposes the following three ways to improve medical English vocabulary learning efficiency. First, this study establishes an adaptive recommendation model for medical English vocabulary, and realizes dynamic recommendation of medical English vocabulary by comprehensively calculating the influencing factors of medical English vocabulary. Second, based on the distinct features of medical English vocabulary, this study designs appropriate teaching modules, highlights the teaching focus of medical English vocabulary, optimizes the teaching processes and builds medical English vocabulary memory networks. Third, this study designs various ways to increase the repetition rate of medical English vocabulary. In order to verify the teaching effect of the system, this study conducted a comparative experiment on 50 sophomores of non-English majors of a university in Beijing. The experiment result shows that the learning methods can effectively promote the acquisition and retention effect of medical English vocabulary and morphemes, and can promote learners’ ability of guessing words. The medical English vocabulary learning system effectively improves medical English vocabulary learning efficiency, alleviates the limitations of medical paper resources, complements limited teaching content of traditional medical English vocabulary lessons, meets learners’ personalized needs, and pays attention to cultivating learners' learning interest and initiative through diversified learning methods. The system has certain reference value for medical English vocabulary mobile teaching. |
分类号: | H08 |
论文总页数: | 82 |
参考文献总数: | 66 |
参考文献列表: |
[1] Wilkins D A. Linguistics in language teaching [M]. London: Edward Arnold. 1972.
[2] 张燕, 吴新炜, 张顺兴. 我国高等医学院校医学英语教学现状调查与分析[J]. 中国高等医学教育, 2006(8): 29-30. [3] 王连柱. 论高频医学词汇的筛选与医学英语教学[J].中国医学教育技术, 2011, 25(2): 217-220. [4] 刘萍, 刘座雄. 基于ESP语料库的学术英语词汇学习法的有效性研究[J]. 外语研究, 2018(3): 54-60. [5] Sinclair S, Renouf A. A lexical syllabus for language learning [M]. // Carter, R. & McCarthy, M. Vocabulary and language teaching. London and NewYork: Longman, 1988: 142-143. [6] Richard J C. A psycholinguistic measure of vocabulary selection [J]. Iral, 1969, 8(2):87-102. [7] O’ Gorman E. An investigation of the mental lexicon of second language learners [J]. The Irish yearbook of applied linguistics, 1996, (16):15-31. [8] 马雁. ESP理论视角下的医学英语课程设置及其教学探索[J]. 外语电化教学, 2009(1): 60-63. [9] 王国良. ESP还是EGP——普通医学院校大学生对医学英语教学看法的调查研究[J]. 中国医学教育技术, 2014(2): 215-220. [10] Strevens P. ESP after twenty years: A re-appraisal [A]. In M Tickoo (ed.). ESP: State of the Art [C]. Singapore: SEAMEO Regional Language Centre.1998. [11] Hutchinson T, Waters A. English for specific purposes: A learning-centered approach [M]. Cambridge: Cambridge University Press, 1998:1-10. [12] 丁青年. 医学英语与英语医学[J]. 上海中医药杂志, 2002(12): 40-41. [13] Nation I S P. Learning vocabulary in another language [M]. Cambridge: Cambridge University Press.2001. [14] Gylys B A, Wedding M E. Medical Terminology: A System Approach [M]. Philadelphia: F. A. Davis. 1983. [15] 沈姝. 从英语词源角度分析医学英语词汇特点[J]. 医学教育探索, 2007, 6(4):329-330. [16] Schmitt N, M McCarthy. Vocabulary: description, acquisition and pedagogy [M]. Cambridge: Cambridge University Press. 1997. [17] 陈琦, 高云. 学术英语中的半技术性词汇[J]. 外语教学, 2010, 31(6): 42-46. [18] 秦秀白. ESP的性质、范畴和教学原则[J]. 华南理工大学学报(社会科学版), 2003, 5(4): 79-83. [19] 蔡基刚. ESP与我国大学英语教学发展方向[J]. 外语界, 2004, (2): 22-28. [20] 杨慧中. EAP在中国:回顾、现状与展望[R]. 中国ESP研究高端论坛. 北京外国语大学. 2010. [21] 华瑶. 医学英语核心词汇的筛选和教学[J]. 医学教育管理, 2016, 2: 36-38. [22] 李定均. 医学英语词汇学[M]. 上海: 复旦大学出版社. 2006. [23] 黄远振. 词的形态理据与词汇习得的相关性[J]. 外语教学与研究, 2001, 33(6): 430-435. [24] 李媛媛. 注意假说视角下词的形态理据对二语词汇习得的影响研究[D]. 扬州大学. 2017. [25] Yang M N. Nursing pre-professionals’ medical terminology learning strategies [J]. Asian EFL Journal, 2005, 7(1): 137-154. [26] Brown C, M E Payne. Five essential steps of processes in vocabulary learning [C]. Paper presented at the TESOL Convention, Baltimore, MD. 1994. [27] Richards J. The Role of Vocabulary Teaching [J]. TESOL Quarterly. 1976, 10(1): 77-89. [28] Sokmen A J. Word association results: a window to the lexicon of ESL students [J]. JALT Journal, 1993, 15(2): 135-150. [29] Wray A. Formulaic language and the lexicon [M]. Cambridge: Cambridge University Press. 2005. [30] Pitts M, White H, Krashen S. Acquiring second language vocabulary through reading: a replication of the clockwork orange study using second language acquirers [J]. Reading in a Foreign Language, 1989, 5(2), 271-275. [31] Nist S L, Olejnik S. The role of content and dictionary definitions on varying levels of word knowledge [J]. Reading research quarterly, 1995, 172-193. [32] Palmberg R. Computer games and foreign-language vocabulary learning [J]. Elt Journal, 1988.42(4): 247-252. [33] Laufer B. Corpus-based versus lexicographer examples in comprehension and production of new words [M]. // Fontenelle T. Practical Lexicography. Oxford: Oxford University Press. 2008: 71-76. [34] 赵海威. 基于行为特征和数据分析的外语词汇学习模型研究[D]. 北京大学. 2017. [35] Nation P, R Waring. Vocabulary size, text coverage and word lists [M]. In N Schmitt, M McCarthy. Vocabulary Description Acquisition Pedagogy. 1997. [36] West M. A general service list of English words [M]. London: Longman, 1953. [37] Chung T M, Nation P. Identifying technical vocabulary [J]. System, 2004, 32(2): 251-263. [38] Chujo K, Utiyama M. Selecting level-specific specialized vocabulary using statistical measures [J]. System, 2006, 34(2): 255-269. [39] Schmidt R. The role of consciousness in second language learning [J]. Applied linguistics, 1990, 11(2):37-41. [40] Ellis R. SLA research and language teaching [M]. Oxford: Oxford University Press. 1997. [41] Swain M. Three functions of output in second language learning [A]. In G Cook and B Seidlhofer (eds.). Principle and practice in applied linguistics [C]. Oxford: Oxford University Press. 1995. [42] Nation I S P. Teaching and Learning Vocabulary [M]. Boston: Heinle & Heinle Publishers. 1990. [43] Laufer B. The development of passive and active vocabulary in second language: Same or different? [J]. Applied linguistics, 1998, 19: 255-271. [44] Atkinson R C, R M Shiffrin. Human memory: A proposed system and its control process [J]. Psychology of learning and motivation, 1968, 2: 89-195. [45] Craik F I M, R S Lockhart. Levels of processing: A framework for memory research [J]. Journal of verbal learning and verbal behavior, 1972, 11(6): 671-684. [46] 张庆宗, 吴喜燕. 认知加工层次与外语词汇学习——词汇认知直接学习法[J]. 现代外语, 2002, 25(2):176-186. [47] Nunan D. Language teaching methodology [M]. London: Prentice Hall International Ltd. 1991. [48] 张烨, 邢敏, 周大军. 非英语专业本科生英语词汇学习策略的调查[J]. 解放军外国语学院学报, 2003, 26(4): 44-48. [49] Craik F I M, E Turlving. Depth of processing and the retention of words in episodic memory [J]. Journal of experimental psychology, 1975, 104(3): 268-294. [50] Laufer B, J Hulstijn. Incidental vocabulary acquisition in a second language: The construct of task-induced involvement [J]. Applied linguistics, 2001, 22(1): 1-26. [51] Collins A M, M R Quillian. Retrieval time for semantic memory [J]. Journal of verbal learning and verbal behavior, 1969, 8(2): 240-247. [52] Smith E E, E J Shoben, L J Rips. Structure and process in semantic memory: A featural model for semantic decisions [J]. Psychological review, 1974, 81(3): 214-241. [53] Collins A M, E F Loftus. A spreading-activation theory of semantic processing [J]. Psychological review, 1975, 52(6): 407-428. [54] 李晓丽. 中国英语学习者心理词库中的二语语义网络探究[J]. 牡丹江大学学报, 2017, 26(2): 112-117. [55] 陈仕品, 张剑平. 适应性学习支持系统的学生模型研究[J]. 中国电化教育, 2010, (5): 112-117. [56] Chen C M, C J Chung. Personalized mobile English vocabulary learning system based on item response theory and learning memory cycle [J]. Computer & Education, 2008, 51(2):624-645. [57] Jaeyoung J, S Graf. An approach for personalized web-based vocabulary learning through word association games [C]. International Symposium on Applications and the Internet, 2008: 325-328. [58] 孙明庆. 基于模糊逻辑的自适应学习系统的研究与实现——以高中英语词汇为例[D]. 湖北大学. 2017. [59] 赵艳平. 高中英语词汇自适应学习系统的设计与开发[D]. 山东师范大学. 2015. [60] Coffey B. State of the art article -- ESP: English for specific purposes [J]. Language teaching, 1984, 17(1): 2-16. [61] Robinson P. ESP today: A practitioner’s guide [M]. New York & London: Prentice Hall International (UK) Ltd. 1991. [62] O' Malley J, A Chamot. Learning strategies in second language acquisition [M]. Cambridge: Cambridge University Press. 1990. [63] 章国英, 方卫, 李平. 医学英语听力课程主题网站的建设与实践[J]. 中国医学教育技术, 2006, 20(3): 197-199. [64] 李红, 田秋香. 第二语言词汇附带习得研究[J]. 外语教学, 2005, 26(3): 52-56. [65] 赵秀红, 聂建中. 合理删词完形填空与阅读能力的关系研究[J]. 教育理论与实践, 2010, 30(4): 56-58. [66] Lee J J, Hammer J. Gamification in education: What, how, why bother? Academic exchange quarterly, 2011, 15(2): 1-5. |
公开日期: | 2018-11-30 |
题名: | 基于多模态理论和图式理论的雅思听说学习系统的研究与设计 |
姓名: | |
学号: | 1501210821 |
公开时间: | 公开 |
学位: | |
院系: | |
导师1姓名: | |
导师1单位: | 软件与微电子学院 |
导师2姓名: | |
导师2单位: | 软件与微电子学院 |
论文答辩日期: | 2018-11-30 |
外文题名: | Research and Design of IELTS Listening and IELTS Speaking Preparation Application Based on Schema Theory and Multimodality Theory |
外文关键词: | IELTS Listening IELTS Speaking Schema Theory Multimodality Theory |
论文摘要: | “雅思考试”是为准备到以英语为交流语言的国家学习、就业或定居的人们设置的一项英语语言水平测试,包含听、说、读、写四个部分。本文集中研究学术考试中的听说部分。随着越来越多的学生选择赴国外求学,雅思考试的热度不断攀升。市面上,各种各样的雅思备考软件也应运而生,以期帮助学生备考雅思。然而,大多数软件只是作为一个题目资源库而存在,仅注重题目练习,忽视了对学生英语听力理解能力和英语口语表达能力的提升。 在备考雅思听说考试过程中,学生也遇到了诸多困难,往往练习了很多套真题,但是成绩依旧未能提高。究其原因,是因为学生只是一味地盲目刷题对答案,遇到的种种困难未能得到解决。从题目练习整个过程来看,学生遇到的困难主要有以下几点:一、做题前,学生未能获得足够多的可理解性输入。二、做题中,未能准确掌握雅思题目的答题技巧。三、做题后,学生未能获得足够的反馈信息;未能及时对错误进行总结分析,并针对性地安排练习;未能接受针对雅思听说考试的进一步技能提升训练。 本系统针对雅思听力和雅思口语的考试特点,从学生遇到的困难出发,评析当前相关教学系统,基于多模态理论和图式理论,结合相关教学实践和移动学习的特点,研究与设计了雅思听说学习系统。雅思听力学习系统中,做题前安排听力词汇和同义替换的学习与测试,做题后多模态方式展示听力原文,提供听写练习和听力原文学习。雅思口语学习系统中,做题前安排口语词汇的学习与测试和复杂句型学习,做题中,安排答题技巧学习,做题后依照答题框架多模态展示范文,提供雅思范文学习。基于以上设计,本文选取本系统中设计的学习方案与以往学习系统中的学习方案进行对比,通过实验、调查问卷和数据分析等方式在认知负荷和学习目标达成情况方面对本系统提出的学习方案进行了验证,证明了本系统设计的学习方案在认知负荷相似的情况下,更有利于学生达成学习目标。 本文设计的系统,做题前,帮助学生获得足够的可理解性输入;做题中,建立和强化答题技巧图式;做题后,解决存在的错误和问题,帮助学生获得足够的反馈和技能提升训练,有助于增强学生对于知识的内化程度,帮助学生形成一个良性的做题循环,发挥每一套真题的价值,达到在题目练习过程中逐步提升成绩的同时,真正提升英语听力理解能力和口语表达能力。 |
外文摘要: | The International English Language Testing System (IELTS) is the world’s most popular English language proficiency test for higher education and global migration, which assesses all English skills including reading, writing, listening and speaking. This paper focuses on the listening and speaking part of IELTS Academic. As more and more students choose to study abroad, the IELTS test is becoming increasingly popular. As a result, a variety of IELTS preparation systems have been developed to help students prepare for IELTS test. However, most existing systems only work as a repository with a focus on taking IELTS exercises, which ignores the cultivation of students’ listening ability and speaking ability. In the process of preparing for the exam, the students also encountered many difficulties. Usually, they have done a lot of exercises, but the results still failed to improve. The reason is because the students just blindly do exercises and check answers. And the difficulties encountered were not solved. From the point of view of the process of doing exercises, the students have the following difficulties: First, before doing exercises, students failed to obtain enough comprehensible input. Second, when doing exercises, students have not accurately grasped the answering technique of the IELTS. Third, after exercises, except insufficient feedback, error analysis and corresponding exercises, they failed to accept further skills training for the IELTS listening and speaking test. Based on Chinese students’ problems in preparation for the IELTS listening and IELTS speaking test, the analyses of the existing systems and the characteristics of IELTS listening and IELTS speaking test, this system is designed on the basis of Multimodality Theory and Schema Theory, combined with relevant teaching practices and the characteristics of mobile learning. In the IELTS listening learning system, listening vocabulary and synonyms are studied and tested before exercises. After exercises, the transcript is displayed in a multi-modal way, and dictation exercises and the learning of transcripts are provided. In the IELTS speaking learning system, the learning and testing of spoken vocabulary and the learning of complex sentence patterns are arranged before exercises. When doing exercises, the learning of answer techniques are provided. After exercises, the modal essay is displayed in accordance with the answer frame and in a multi-modal way, and the learning of these modal essays are provided. Based on the above design, this paper compares the learning scheme of this system with those of the existing systems. Through experiments, questionnaires and data analysis, the learning schemes proposed by the system was verified in terms of cognitive load and learning goal achievement. It is proved that the cognitive load of the learning scheme of this system designed in this paper is similar to those of the existing systems, but it is more conducive to students' achievement of learning goals. This system helps the students obtain enough comprehensible input before doing exercises, create and strengthen the answering technique patterns when doing exercises, solve the existing errors and problems, obtain sufficient feedback as well as further skill training after exercises. As a result, this system could enhance students’ internalization of knowledge, help students form a virtuous cycle of doing exercises, and thus gradually improve the performance of English listening, and speaking while improving students’ grades in IELTS test. |
分类号: | G43 |
论文总页数: | 111 |
参考文献总数: | 85 |
参考文献列表: |
白丽. 2015. 心理信息加工模式下雅思听力教学内容的研究[硕士学位论文]. 哈尔滨师范大学.
曹怡鲁. 1999. 外语教学应借鉴中国传统语言教学经验[J]. 外语界, 2: 17. 曹治. 2017. 多模态视角下大学英语口语教学模式的实证研究[硕士学位论文]. 西安外国语大学. 崔旻, 周春芳. 2015. 多媒体呈现方式在外语词汇直接学习中的效果研究. 解放军外国语学院学报, 38(03): 88-95. 董卫, 付黎旭. 2003. 背诵式语言输入在大学英语教学中的作用. 外语界, 04: 56-59. 范琳, 王庆华. 2002. 英语词汇学习中的分类组织策略实验研究[J]. 外语教学与研究, 03: 209-212. 范琳, 王震. 2014. 词汇重复模式理论与基于语篇语境线索的词汇推理策略. 山东外语教学, 35(05): 54-60. 郭纯洁. 2007. 有声思维法. 北京:外教学与研究出版社. 顾曰国. 2007. 多媒体、多模态学习剖析. 外语电化教学, 02: 3-12. 黄荣怀, Jyri Salomaa. 2008. 移动学习——理论·现状·趋势. 北京: 科学出版社. 侯云红. 2013. 大学英语课堂复合式听写练习对听力水平的作用[硕士学位论文]. 延边大学 . 何蓉. 2011. 关于雅思口语考试第三部分若干解决方案的探讨. 西南民族大学学报(人文社会科学版), 32(S2): 174-176. 胡永近, 张德禄. 2013. 英语专业听力教学中多模态功能的实验研究. 外语界, 05: 20-25+44. 胡壮麟. 2007. 社会符号学研究中的多模态化. 语言教学与研究, 01: 1-10. 贾冠杰. 2006. 二语习得论. 南京: 东南大学出版社. 李传益.2014. 复述式语言输入对英语听说能力有效性实证研究[J]. 当代外语研究, 07: 44-49. 龙宇飞, 赵璞.2009. 大学英语听力教学中元认知策略与多模态交互研究[J]. 外语电化教学, 04: 58-62+74. 骆雁雁.2009. 基于语块理论的大学英语词汇教学模式研究[J]. 外语学刊, 06: 168-170. 毛佳玳, 蔡慧萍.2016. 基于语类的大学英语口语教学模式应用研究[J]. 外语界, 03: 89-96. 戚焱, 蒋玉梅, 朱雪媛. 2015. 大学英语口语教学中词块教学法的有效性研究. 现代外语, 38(06): 802-812+873-874. 孙燕. 2013. 雅思听力考试应试策略. 海外英语, 04: 85-86. 束定芳, 庄智象. 1996. 现代外语教学一理论、实践与方法. 上海: 上海外语教育出版社. 文秋芳. 2008. 输出驱动假设与英语专业技能课程改革. 外语界, 02: 2-9. 文秋芳. 2013. 输出驱动假设在大学英语教学中的应用: 思考与建议. 外语界, 06: 14-22. 文秋芳. 1995. 英语学习策略论. 上海: 上海外语教育出版社 . 王家义. 2012. 基于语料库的英语词汇教学: 理据与应用. 外语学刊, 04: 127-130. 王丽. 2007. 三种大规模标准化英语考试听力测试部分之比较:——一项基于语篇、任务、说话人相关因素的研究. 外语电化教学, 02: 67-72. 汪梅. 2016. 图式理论在高中英语词汇教学中的应用研究[硕士学位论文]. 上海师范大学. 王巍. 2010. 图式理论在高中英语词汇教学的应用研究[硕士学位论文]. 东北师范大学 . 武晶晶. 2013. 朗读在高职非英语专业英语听力教学中的应用[硕士学位论文]. 湖北大学. 吴延国. 2011. 《二语研究中的有声思维法争议》评述. 外语界, 4:93-96. 徐冉. 2017. 最佳教学实践指导下的英语词汇学习系统前端设计与实现[硕士学位论文]. 北京大学. 杨超. 2017. 最佳教学实践指导下的英语听力学习系统的前端设计与实现[硕士学位论文]. 北京大学. 杨映春.2013. 基于图式理论的专业英语听力教学模式实验研究. 广东外语外贸大学学报, 24(05): 96-100. 叶家春, 曾杰. 2016. 英语词汇教学的多模态—认知策略模式. 教育评论, 08: 127-130. 张德禄. 2009. 多模态话语分析综合理论框架探索. 中国外语, 6(01): 24-30. 张彤彤. 2016. 中外合作办学项目的雅思口语教学研究——基于图式理论的教学法初探. 海外英语, 06: 48-50. 张燕燕. 2015. 基于图式理论的英语口语教学模式探析. 求索, 11: 189-192. 张烨, 邢敏, 周大军. 2003. 非英语专业本科生英语词汇学习策略的调查. 解放军外国语学院学报, 04: 44-48. 朱湘华. 2010. 大学英语听力策略训练模式与效果分析. 外语研究, 02: 53-58. 朱永生. 2007. 多模态话语分析的理论基础与研究方法. 外语学刊, 05. 周相利. 2002. 图式理论在英语听力教学中的应用. 外语与外语教学, 10: 24-26 Brown, H. D. 1994. Teaching by Principles: An Interactive Approach to Language Teaching[M]. Englewood Cliff, NJ: Prentice Hall. Bhatia, V. K. 2014. Analysing genre: Language use in professional settings. Routledge. Carrell, P. L., & Eisterhold, J. C. 1983. Schema theory and ESL reading pedagogy[J]. TESOL quarterly, 17(4), 553-573. Chamot, A. U. 1988. A study of learning strategies in foreign language instruction: Findings of the longitudinal study. Cohen, A.D. 1998. Strategies in Learning and Using a Second Language [M]. London: Longman, Cook, G. 1989. Discourse[M]. Oxford : Oxford University Press. Duncker, K., & Lees, L. S. 1945. On problem-solving. Psychological monographs, 58(5), i. Eggins, S. 1994. An introduction to systemic functional linguistics[M]. London: Printer. Ericsson, K. A.& Simon, H. A. 1984. Protocol Analysis: Verbal Reports as Data. Cambridge: The MIT Press. Faerch, C., & Kasper, G. 1987. Introspection in second language research (Vol. 30). Multilingual Matters Limited. Flowerdew, J. 1993. An educational, or process, approach to the teaching of professional genres. ELT journal, 47(4), 305-316. Forceville, C. 2009. Non-verbal and multimodal metaphor in a cognitivist framework: Agendas for research[A]. In Forceville, C. & E. Urios-Aparisi (eds.). Multimodal Metaphor-Application of Cognitive Linguistics[C]. New York: Mouton de Gruyter. Gerjets, P., Scheiter, K., & Catrambone, R. 2004. Designing instructional examples to reduce intrinsic cognitive load: Molar versus modular presentation of solution procedures. Instructional Science, 32(1-2), 33-58. Gough, P. B., Juel, C., & Griffith, P. L. 1992. Reading, spelling, and the orthographic cipher. Reading acquisition, 35-48. Halliday, M.A.K. 1985. An Introduction to Functional Grammar[M]. London: Edward Arnld Harmer, J. 1983. The practice of English language teaching. Longman, 1560 Broadway, New York, NY 10036. Hasan, R. 1978. Text in the systemic-functional model. Current trends in textlinguistics, 2, 229-45. Johnson, D. W., & Johnson, R. T. 1989. Cooperative learning: What special education teachers need to know. The Pointer, 33(2), 5-11. Kalyuga, S., Chandler, P., & Sweller, J. 1999. Managing split‐attention and redundancy in multimedia instruction. Applied Cognitive Psychology: The Official Journal of the Society for Applied Research in Memory and Cognition, 13(4), 351-371. Kester, L., Lehnen, C., Van Gerven, P. W., & Kirschner, P. A. 2006. Just-in-time, schematic supportive information presentation during cognitive skill acquisition. Computers in Human Behavior, 22(1), 93-112. Krashen, S. D. 1985. The Input Hypothesis: Issues and Implication[M]. London: Longman. Kress, G. 2001. Sociolinguistics and social semiotics[A]. In Cobley, P.(ed.) The Routledge Companion to Semiotics and Linguistics[C]. London and New York: Routledge. Larsen-Freeman D. 2005. Teaching Language: From Grammar to Grammaring[M]. Beijing: Foreign Language Teaching and Research Press. Lee, H., Plass, J. L., & Homer, B. D. 2006. Optimizing cognitive load for learning from computer-based science simulations. Journal of educational psychology, 98(4), 902. O'Malley, M. J., & Chamot, A. U. 1990. Learning strategies in second language acquisition. Cambridge university press. Oxford, R. 1990. Language learning strategies. New York, 3. Paas, F. G., Van Merriënboer, J. J., & Adam, J. J. 1994. Measurement of cognitive load in instructional research. Perceptual and motor skills, 79(1), 419-430. Pennycook, A. 1996. Borrowing others' words: Text, ownership, memory, and plagiarism. TESOL quarterly, 30(2), 201-230. Pollock, E., Chandler, P., & Sweller, J. 2002. Assimilating complex information. Learning and instruction, 12(1), 61-86. Richards J. 2006. Second Language Listening: Theory and Practice[M]. Cambridge: Cambridge University Press. Royce, T. 2002. Multimodality in the TESOL classroom: Exploring visual‐verbal synergy. TESOL quarterly, 36(2), 191-205. Rumelhart, D.E. 1980. Schemata: the building blocks of cognition. In: R.J. Spiro etal. (eds) Theoretical Issues in Reading Comprehension[C], Hillsdale, NJ: Lawrence Erlbaum. Schmidt, R. 1990. The Role of Consciousness in Second Language Learning[J]. Applied Linguistics, 11( 2): 129 -158. Skehan, P. 1998. Individual Differences in Second Language Learning [M]. London: Edward Arnold. Stein, P. 2000. Rethinking Resources: Multimodal Pedagogies in the ESL Classroom[J].TESOL Quarterly, (34):333-336. Swain, M. 1985. Communicative competence; some roles of comprehensible input and comprehensible output in its development [A]. In S. M. Gass & C. G. Madden (eds.). Input in Second Language Acquisition. Rowley [C]. MA: Newbury House. Swain, M. 1993. The hypothesis: Just speaking and writing aren't enough [J]. The Canadian Modern Language Review 50:158-164. Swain, M. 1995. Three functions of output in second language learning [A]. In G. Cook. & B. Seidlhofer (eds.). Principle and Practice in Applied Linguistics [C]. Oxford: Oxford University Press. Sweller, J. 1988. Cognitive load during problem solving: Effects on learning. Cognitive science, 12(2), 257-285. Tarmizi, R. A., & Sweller, J. 1988. Guidance during mathematical problem solving. Journal of educational psychology, 80(4), 424. The New London Group. 1996. A pedagogy of multiliteracies: Designing social futures. Harvard educational review, 66(1), 60-93. Underwood, M. 1990, Teaching Listening[M]. New York: Longman. Van Merriënboer, J. J., Kirschner, P. A., & Kester, L. 2003. Taking the load off a learner's mind: Instructional design for complex learning. Educational psychologist, 38(1), 5-13. |
公开日期: | 2018-11-30 |
题名: | 基于模拟方法的技术写作同源开发教学研究 |
姓名: | |
学号: | 1501210755 |
公开时间: | 公开 |
学位: | |
院系: | |
导师1姓名: | |
导师1单位: | 软件与微电子学院 |
导师2姓名: | |
导师2单位: | 软件与微电子学院 |
论文答辩日期: | 2018-11-30 |
外文题名: | Research on Single Sourcing Teaching Based on Simulation-based Method |
外文关键词: | Technical writing Single sourcing teaching Simulation-based teaching method Instructional design |
论文摘要: | 技术文档开发需求日益增长,同源开发方法应运而生。作为一种文档开发方法论,同源开发强调了通过内容模块化实现文档系统性复用的重要思想。同源开发是北京大学技术传播教学体系中的重要组成部分,教学目标以掌握其基本原理、思路及流程为核心,帮助学生完成线性文档的拆解与模块化文档的生成。 总结近几年的教学经验,学生在掌握同源开发上仍存在三项突出问题:1) 主题识别困难,识别结果准确度低;2) 写作过程中的技术原理掌握不到位;3) 主题组织条理不清,文档架构混乱。这些问题与同源开发学习内容、教学方式及教学工具有着密不可分的关系。 本文通过调查与访谈进一步探究教学问题的症结,总结学习者及行业需求,调研现有教学工具,创新性地设计了CCMS教学模拟器SuperEasyDITA,填补同源开发教学工具的空白,并基于该工具,对症下药展开教学方法设计,具体采用了:1) 主题逆向拆解匹配、正向分析识别的双向教学模式,解决主题识别困难;2) 写作方式难度渐进、写作过程技术提醒、写作成品切换修改,提升技术原理掌握程度;3) 学生自主构建情境、同伴协作讨论架构,优化文档组织架构思路与方法;4) 引导性反馈与讨论,加深对同源开发原理、思路与流程的整体理解,深化各过程学习要点。 为验证教学方法的有效性,本研究依托北京大学2017、2018级技术传播专业课程选修学生开展了教学实验,其中,实验组采用基于模拟器的教学方法,对照组采用传统教学方法。研究结果表明,教学模拟器SuperEasyDITA满足了文档同源开发基本功能需求,在教学有用性、易用性及创新性上都较传统教学工具有显著优势;基于模拟器的教学方法有助于学生掌握XML相关技术原理,提升写作过程中技术原理的掌握程度;有助于学生识别主题类型,提升主题识别准确度;有助于学生组织主题,改善文档组织架构;在整体教学效果上提升教学效率的同时解决了教学问题,能真正有效帮助学生更好地掌握同源开发知识体系。 |
外文摘要: | The demand for technical documents is growing, and the document development methodology called single sourcing comes into being. Single sourcing emphasizes the important idea of systematic reuse of documents through content modularization. Single sourcing teaching is an important part of Peking University's technical communication curriculum. The teaching objectives focus on its basic principles, ideas and processes, and help students to complete the disassembly of linear documents and the generation of modular documents. After summarizing and analysing the teaching experience in recent years, this paper finds that students still have some problems in learning single sourcing: 1) difficulty in identifying the topic type and the recognition accuracy is low; 2) difficulty in understanding the XML related technical knowledge; 3) difficulty in organizing the document structure and the structure is unclear. These problems are inextricably linked to the single sourcing learning content, teaching methods and teaching tools. This paper explores the crux of above problems through surveys and interviews, summarizes learners and industry needs, analyzes existing teaching tools, and innovatively designs the CCMS teaching simulator called SuperEasyDITA to make up for the lack of single sourcing teaching tool. Based on SuperEasyDITA, this paper designs the teaching method aimed at solving current problem. Specifically, it adopts: 1) topic disassembly and matching from standard modular document, and topic analysis and recognition from liner document, which solves the problem of topic recognition; 2) different writing modes with gradual difficulty, instant technical knowledge reminding during writing process, and observation, switch and modification of final document, which improves the mastery of technical knowledge; 3) document use situation construction and collaborative discussion, which strengthens document organizing ideas and methods; 4) instructive feedback and discussion, which deepens the understanding of overall ideas and processes. In order to verify the effectiveness of this method, this study relies on the students of the 2017 and 2018 technical communication courses of Peking University to carry out the teaching experiments. Among them, the experimental group adopts the simulator-based teaching method and the control group adopts the traditional teaching method. The research results show that the teaching simulator SuperEasyDITA satisfies the basic functional requirements of single sourcing, and has significant advantages in teaching usefulness, ease of use and innovation compared with traditional teaching tools. The simulator-based teaching method helps students to improve the topic recognition accuracy; helps students to understand the XML and related technical knowledge in the writing process; helps students to organize topics and improve document organization; improves the teaching efficiency and at the same time optimizes the teaching process, solves the teaching problems, and can effectively help students to better master the knowledge of single sourcing. |
分类号: | H08 |
论文总页数: | 95 |
参考文献总数: | 69 |
参考文献列表: |
安德森, 皮连生. 学习、教学和评估的分类学[M]. 华东师范大学出版社, 2008.
褚慧玲. 基于学校教学常规考试的试卷命制技术[J]. 考试研究, 2008(4):81-92. 费丽嫚. 情景模拟器的设计与实现[硕士学位论文]. 上海:华东师范大学, 2015. 何克抗, 林君芬, 张文兰. 教学系统设计[M]. 高等教育出版社, 2016. 何克抗. 建构主义的教学模式、教学方法与教学设计[J]. 北京师范大学学报(社会科学版), 1997(5). 胡迎春, 广西壮族自治区教育厅组织编写. 职业教育教学法[M]. 华东师范大学出版社, 2010. 金瑞华, 刘春凤, 罗丹. 高仿真模拟教学中引导性反馈的应用进展[J]. 中国高等医学教育, 2017(5):95-97. 李向东, 卢双盈. 职业教育学新编[M]. 高等教育出版社, 2005. 刘晓瑜. 标准参照考试的若干理论与质量分析方法[J]. 华南师范大学学报(社会科学版), 1996(6):69-74. 李双燕. 2015年中国技术写作发展现状调查报告[C]// 中国科协年会. 2015. 李玮. 情景模拟教学法对管理学教学的启示[J]. 教育探索, 2008(7):63-64. 向梅梅, 刘明贵. 应用型本科高校实践教学研究[M]. 暨南大学出版社, 2011. 余文森. 有效教学的理论和模式[M]. 福建教育出版社, 2011. 张军征. 多媒体教学软件设计原理与方法[M]. 科学出版社, 2007. 张建伟. 基于模拟式教学及其效果研究回顾[J]. 电化教育研究, 2001(7):68-71. 张伟远. 网上学习环境评价模型、指标体系及测评量表的设计与开发[J]. 中国电化教育, 2004(7):29-33. 佐藤正夫. 教学论原理[M]. 人民教育出版社, 1996. Abel, S. In search of professional-grade content marketing. [EB/OL] (2013-07-29) [2018-04-09].http://www.thecontentwrangler.com/2013/07/29/in-search-of-professional-grade-content-marketing/. Albers M. Single Sourcing and the Technical Communication Career Path [J]. Technical Communication, 2003, 50(3):335-343. Ament, K. Single sourcing: Building Modular Documentation [M]. William Andrew, 2002. Andersen R, Batova T. The Current State of Component Content Management: An Integrative Literature Review [J]. IEEE Transactions on Professional Communication, 2016, 58(3):247-270. Batova T, Andersen R. A Systematic Literature Review of Changes in Roles/Skills in Component Content Management Environments and Implications for Education [J]. Technical Communication Quarterly, 2017, 26(2). Batova T, Andersen R, Evia C, et al. Incorporating Component Content Management and Content Strategy into Technical Communication Curricula[C]// Acm International Conference on the Design of Communication. ACM, 2016. Bellamy L. DITA Best Practices [J]. Addison-Wesley Longman, Amsterdam, 2011. Benson R, Brack C. Developing the scholarship of teaching: what is the role of e-teaching and learning? [J]. Teaching in Higher Education, 2009, 14(1):71-80. Bell B S, Kanar A M, Kozlowski S W J. Current issues and future directions in simulation-based training in North America [J]. The International Journal of Human Resource Management, 2008, 19(8):1416-1434. Carlsen, DD. Use of a Microcomputer Simulation and Conceptual Change Text to Overcome Student Preconceptions about Electric Circuits [J]. Journal of Computer-Based Instruction, 1992, 19(4):105-109. Carter L. The Implications of Single Sourcing for Writers and Writing [J]. Technical Communication, 2003, 50(3):317-320. Carrington, N. Teaching students to learn unfamiliar technology [J]. Programmatic Perspectives, 2015, 2(7), 230-250. Chambers, S. K., Haselhuhn, C., Andre, T., Mayberry, C., Wellington, S., Krafka, A., & Berger, J. The acquisition of a scientific understanding of electricity: Hands-on versus computer simulation experience; conceptual change versus didactic text [J]. In Annual Meeting of the American Educational Research Association, New Orleans, LA, 1994. Chronister C, Brown D. Comparison of Simulation Debriefing Methods [J]. Clinical Simulation in Nursing, 2012, 8(7):e281-e288. Costabile M F, Marsico M D, Lanzilotti R, et al. On the Usability Evaluation of E-Learning Applications[C]// Hawaii International Conference on System Sciences. IEEE Computer Society, 2005. Cooper A., Reimann, R., & Dubberly, H. About Face 2.0: The Essentials of Interaction Design [C]// John Wiley & Sons, Inc. 2007. Decker S, Fey M, Sideras S, et al. Standards of Best Practice: Simulation Standard VI: The Debriefing Process [J]. Clinical Simulation in Nursing, 2013, 9(6):S26-S29. Dekkers J, Donatti S, Dekkers J, et al. The Integration of Research Studies on the Use of Simulation as an Instructional Strategy [J]. Journal of Educational Research, 1981, 74(6):424-427. Dicheva D, Dichev C. Gamification in Education: Where Are We in 2015? [C]//E-Learn: World Conference on E-Learning in Corporate, Government, Healthcare, and Higher Education. Association for the Advancement of Computing in Education (AACE), 2015: 1445-1454. Dreifuerst, K. T. The essentials of debriefing in simulation learning: a concept analysis [J]. Nursing Education Perspectives, 2009, 30(2):109-114. Doherty S. Leveraging industry onboarding materials in the curriculum[C]// Acm International Conference on the Design of Communication. ACM, 2017. Dzida W, Freitag R. Making use of scenarios for validating analysis and design [J]. IEEE Transactions on Software Engineering, 2002, 24(12):1182-1196. Eble M F. Content vs. Product: The Effects of Single Sourcing on the Teaching of Technical Communication [J]. Technical Communication, 2003, 50(3):344-349. Evans, R. Teaching Single Sourcing To Bridge the Gap between Classrooms and Industry. [EB/OL] (2013-09-05) [2018-04-09]. https://www.writingassist.com/newsroom/teaching-single-sourcing/ Grimes P W, Willey T E. The effectiveness of microcomputer simulations in the principles of economics course [J]. Computers & Education, 1990, 14(1):81-86. Groom J A, Henderson D, Sittner B J. National League for Nursing Jeffries Simulation Framework State of the Science Project: Simulation Design Characteristics [J]. Clinical Simulation in Nursing, 2014, 10 (7), 337–344. Hanson AJ, Lindahl P, Strasser SD, Takemura AF, Englund DR. Technical Communication Instruction for Graduate Students: The Communication Lab vs. a Course [J]. 2017 ASEE Annual Conference & Exposition, 27. Hart-Davidson W. On Writing, Technical Communication, and Information Technology: The Core Competencies of Technical Communication [J]. Technical Communication, 2001, 48(2):145-155. Henschel, S. M. Authoring content for reuse: A study of methods and strategies, past and present, and current implementation in the technical communication curriculum [D]. 2010, Lubbock, TX: Texas Tech University. Hovde M R, Renguette C C. Technological Literacy: A Framework for Teaching Technical Communication Software Tools [J]. Technical Communication Quarterly, 2017, 26(2). Kolb D A, Boyatzis R E, Mainemelis C. Experiential Learning Theory: Previous Research and New Directions [J]. 2001. Kulik J A. Effects of Computer-Based Teaching on Secondary School Students [J]. Journal of Educational Psychology, 1983, 75(1):19-26. Lave J, Wenger E. Situated learning: legitimate peripheral participation [J]. 状況に埋め込まれた学習:正統的周辺参加, 1991, 29(2):167-182. Lee J. Effectiveness of computer-based instructional simulation: A meta-analysis [J]. International Journal of Instructional Media, 1999, 26(1):71-85. Mariani B, Cantrell M A, Meakim C. Nurse educators' perceptions about structured debriefing in clinical simulation [J]. Nursing Education Perspectives, 2014, 35(5):330-331. Mcshane, B. J. How to teach xml: a brief tutorial [J]. Intercom, 2007, 54, 20-39. McDaniel, R., & Steward, S. Technical communication pedagogy and the broadband divide: Academic and industrial perspectives [J]. Complex worlds: Digital culture, rhetoric, and professional communication, 2011, 195-212. Papert, S. Situating Constructionism [A]. In I. Harel, & S. Papert (Eds.). Constructionism: Research Reports and Essays 1985-1990 [C]. Norwood, N.J.: Ablex Publishing Corporation. 1991, 1-11. Price, R. M., Denise S P, Joel K A, et al. Observing populations and testing predictions about genetic drift in a computer simulation improves college students’ conceptual understanding [J]. Evolution Education & Outreach, 2016, 9(1):8. Pruitt, John, Adlin, et al. The Persona Lifecycle [M]. 2006. Rentroia-Bonito M A, Jorge J A P. An Integrated Courseware Usability Evaluation Method[C]// International Conference on Knowledge-based Intelligent Information. 2003, 2774, 208-214. Robidoux, Charlotte. Rhetorically Structured Content: Developing a Collaborative Single-Sourcing Curriculum [J]. Technical Communication Quarterly, 2007, 17(1):110-135. Robidoux, C., & Waychoff, P. CMS solutions: Knowing the right stuff [J]. Best Practices, Center for Information-Management Development, 2005a, 7, 86–89. Rockley A, Cooper C. Managing Enterprise Content [M]. New Riders, 2012. Rockley A. The Impact of Single Sourcing and Technology [J]. Technical Communication, 2001, 48(2):189-193. Rush Hovde, M., Renguette C C. Technological Literacy: A Framework for Teaching Technical Communication Software Tools [J]. Technical Communication Quarterly, 2017, 26(2), 395-411. Salas E, Wildman J, Piccolo R. Using simulation-based training to enhance management education [J]. Academy of Management Learning & Education, 2009, 8(4):559-573. Sapienza, F. Does being technical matter? xml, single source, and technical communication [J]. Journal of Technical Writing & Communication, 2002, 32(2), 155-170. Schertler M. E-Teaching Scenarios [J]. Virtual Technologies Concepts Methodologies Tools & Applications, 2008. Self T. The DITA Style Guide: Best Practices for Authors [M]. Scriptorium Publishing Services, Incorporated, 2011. Thomas R, Hooper E. Simulations: An opportunity we are missing [J]. Journal of Research on Computing in Education, 1991, 23:497-513. Young MF. Instructional design for situated learning [J]. Educational Technology Research and Development, 1993, 41(1):43-58. |
公开日期: | 2018-11-30 |
题名: | 指称理论对于生成语法的必要性 |
姓名: | |
学号: | 1401213083 |
公开时间: | 1年后 |
学位: | |
院系: | |
导师1姓名: | |
导师1单位: | 外国语学院 |
论文答辩日期: | 2018-06-06 |
外文题名: | On the Position of Reference Theory in Generative Grammar |
外文关键词: | Generative Grammar reference notion of satisfaction necessity FI principle |
论文摘要: | 摘要 索绪尔认为语言是一个结构系统,能指和所指是语言符号互补的两个方面。弗雷格在对意义和指称进行区分的基础上,认为语言符号表达意义,指称个体的人或事物。乔姆斯基在处理语言的语义问题时,曾多次否认指称是人类语言系统的组成部分。但通过对生成语言学发展历程的梳理,本文发现,指称对于该语言学理论具有十分重要的作用。本文首先对指称理论进行讨论,梳理了弗雷格、罗素和斯特劳森三位代表性哲学家关于指称的观点,并结合塔斯基的满足概念,尝试将指称理论与生成语法的句法运算联系起来,继而以此理论关联为切入点,探讨指称对于生成语法的必要性。在早期以范畴为基础的规则系统,即短语结构语法中,句子被改写成由句法范畴构成的结构系统,然后在每个范畴内选取具体的词语,构成实际使用的语言。这样,这种语言生成方法就不会涉及到指称的问题。而在后来的原则-参数理论中,如果不考虑指称,DP和IP就无法满足扩展的投射原则,从而会导致句法运算的失败。而在最简方案中,完全解释原则要求语言单位在句法运算的每一步都能得到完全解释,即将每一语言单位在每一步运算所产生的结果都解释为意义和语音的结合体。而在最简方案中,如果不考虑指称,DP以及IP,包括时态、情态动词等,都无法得到完全解释,这样句法运算就会“崩溃”(crash)。基于此,本文得出结论,指称对于生成语言学是十分必要的。 |
外文摘要: | Abstract Saussure considers language as a structured system, with signified and signifier as its two complementary facets. Frege’s theory of reference, based on the distinction of sense and referent, claims that a language sign is to express its sense and to denote its referent. Chomsky in his treatment of semantic problems, repeatedly rejects reference as part of human language system. However, a brief survey of its historical development reveals thatreference relation cannot be neglected, which instead plays a very important role in language computation.This thesis conductsa research on the necessity of reference in Generative Grammar. By surveying the reference theory of Frege, Russell and Strawson, this thesis finds that DP can be defined by more primitive elements, the variables that are undermined, and by assigning truth value to the variable does a DP denote a person or an object in the world. Tarski’s notion of satisfaction defines truth through syntax, which, when connected with the definition of DP, can be used to testify whether reference relation is necessary for Generative Grammar. In the early Category-based Rule System, reference is not involved in language computation. According to this system, a sentence is rewritten as a syntactic structure, which is composed of syntactic categories. Then a word is picked from each category to produce a terminal sentence. In the Principle and Parameter Model, syntactic levels like DP and IP cannot satisfy their respective sentential functions without considering reference, which violates the Extended Projection Principle, and therefore, the language computation cannot move on because projection approach is the basic way of language computation in this model. Then in the Minimalist Program of the Principle and Parameter Model, DP and IP cannot receive their full interpretation without considering reference. Therefore, the syntactic computation will crash for FI Principle is the general property of natural language. Based on these arguments, the thesis concludes that reference is necessary for Generative Grammar. |
分类号: | H04 |
论文总页数: | 64 |
参考文献总数: | 49 |
参考文献列表: |
Aarsleff, H. 1970. The History of Linguistics and Professor Chomsky. Language. Vol. 46, No. 3: 570-585. Alsena, A. 1992. On the Argument Structure of Causatives. Linguistic Inquiry. Vol. 23, No. 4: 517-555. Antony, L. M. & N. Hornstein. 2003. Chomsky and His Critics. Hoboken, New Jersey: The Blackwell Publishing. Araki, N. 2015. Saussure and Chomsky, Language and I-Language. Bull. Hiroshima Inst. Tech. Research. Vol.49: 1-11. Barman, B. 2012. The Linguistic Philosophy of Noam Chomsky. Philosophy and Progress. Vol LI-LII, January-June: 104-122. Berwick, R. C. & N. Chomsky. 2017. Why Only Us, Recent Questions and Answers. Journal of Neurolinguistics. Vol 43, Part B: 166-177. Black, C. A. A Step-by-step Introduction to the Government and Binding Theory of Syntax. http://www.mexico.sil.org/sites/mexico/files/e002-introgb.pdf. Boskovic, Z. Principles and Parameters and Minimalism. http://web2.uconn.edu/boskovic/papers/PrincParam&Minimalism.DikkenRevised2010Final.pdf. Carnie, A. 2006. Syntax, a Generative Introduction (2nd Edition). Hoboken, New Jersey: The Blackwell Publishing. Carrier, J. & H. J. Randall. 1992. The Argument Structure and Syntactic Structure of Resultative. Linguistic Inquiry. Vol. 23, No. 2: 173-234. Chomsky, N. —1957. Syntactic Structures. Hague: Mouton Publishers. —1965. Aspects of the Theory of Syntax. https://faculty.georgetown.edu/irvinem/theory/Chomsky-Aspects-excerpt.pdf. —1968. Quine’s Empirical Assumptions. Synthese. Vol.19, No 1/2: 53-68. —1981. Knowledge of Language, Its Elements and Origins. Philosophical Transactions of the Royal Society. Vol. 295, Series B: 223-234. —1982. A Note on the Creative Aspect of Language. The Philosophical Review. Vol. 91, No. 3: 423-434. —1984. Noam Chomsky Writes to Mrs. Davis about Grammar and Education. English Education. Vol. 16, No. 3: 165-166. —1986. Knowledge of Language, Its Nature, Origin, and Use. New York, London: Paegen Special Studies. —1992. Explaining Language Use. Philosophical Topics: 205-231. —1994. Models, Nature and Language. Grand Street: 170-176. —1995. Language and Nature. Mind, New Series: Vol. 104, No. 413: 1-61. —1995. The Minimalist Program. Cambridge, MA: The MIT Press. —1997. Language and Problems of Knowledge. Teorema: Revista Inernacional de Filosofia. Vol. 16, No. 2: 5-33. —2000. New Horizons in the Study of Language and Mind. Cambridge: Cambridge University Press. —2006. Language and Mind (3rd Edition). Cambridge: Cambridge University Press. —2013. Problems of Projection. Lingua 130: 33-49. Chomsky, N., A. J. Gallego & D. Ott. Generative Grammar and the Faculty of Language: Insights, Questions and Challenges. https://www.google.com.hk/url. Chomsky, N. & J. J. Katz. 1971. What the Linguist Is Talking About. The Journal of Philosophy. Vol. 71, No. 12: 347-367. Emonds, J. E. 1991. Subcategorization and Syntax-Based Theta-role Assignment. Natural Language & Linguistic Theory. Vol. 9, Issue. 3: 369-429. Frege, G. 1948. Sense and Reference. The Philosophical Review. Vol. 57, No. 3: 209-230. Freidin. R. 2007. Generative Grammar, Theory and Its History. London and New York: Routledge Taylor & Francis Group. Haegeman, L. 1997. Elements of Grammar, a Handbook of Generative Syntax. Springer: Springer Science + Business Media Dordrecht. Hauser, M. D., N. Chomsky & W. T. Fitch. 2002. The Faculty of Language, What Is It, Who Has it, and How Did It Evolve? Science. Vol. 298, Issue 5598: 1569-1579. Heim, I. &A. Kratzer. 1998. Semantics in Generative Grammar. Oxford: Blackwell Publisher. Jackendoff, R. Reexamining the Foundations of Generative Grammar. http://citeseerx.ist.psu.edu/viewdoc/download?doi= Katz, J. J. 1980. Chomsky on Meaning. Language: 1-41. Lasnik, H. 2002. The Minimalist Program in Syntax. Trends in Cognitive Sciences. Vol. 6, Issue. 10: 432-437. Lidz, J. & L. Gleitman. 2014. Yes, We Still Need Universal Grammar. Cognition 94: 85-93. Lopez, B. G. 2001. Argument Structure, Thematic Roles and Linking. Atlantis. Vol. 23, No.2: 49-64. Ludlow, P. 2011. The Philosophy of Generative Linguistics. Oxford: The Oxford Press. Putnam, L. R. & N. Chomsky 1994-1995. An Interview with Noam Chomsky. Reading Teacher: 328-333. Roberts, I. 2016. The Oxford Handbook of Universal Grammar. Oxford: Oxford University Press. Runner, J. T. 2002. When Minimalism Isn’t Enough, an Argument for Argument Structure. Linguistic Inquiry. Vol. 33, No. 1: 172-182. Russell, B. 1905. On Denoting. Mind, New Series. Vol. 14, No. 56: 479-493. Russell, B. 2010. The Principles of Mathematics. London and New York: Routledge. Saussure, F. de. 2001. Course in General Linguistics. Beijing: Foreign Language Teaching and Research Press. Stainton, R. J. Meaning and Reference—Some Chomskian Themes. http://publish.uwo.ca. Lepore, E. & B. C. Smith. 1976. Handbook of Philosophy of Language. Oxford: Oxford University Press. Strawson, P. F. 1950. On Referring, Mind. Vol. 59, No. 235: 320-344. Tarski, A. Concept of Truth in Formalized Languages. http://www.thatmarcusfamily.org. |
公开日期: | 2019-06-06 |
题名: | 英汉翻译中的变通与忠实 |
姓名: | |
学号: | 1601213263 |
公开时间: | 公开 |
学位: | |
院系: | |
导师1姓名: | |
导师1单位: | 中国人民大学外国语学院 |
论文答辩日期: | 2018-05-27 |
外文题名: | Flexibility and Fidelity in E-C Translation |
论文摘要: | 在翻译实践中, 译者发现, 除了理解原文外, 翻译的主要工作在于克服语言差异——同一个意思, 英语里这样说, 汉语里则要换一种说法才能理解。 语言间的差异永远存在, 怎样变换“说法” 以达意就成了翻译的永恒课题。 当然, 翻译并不总是在变, 本翻译报告提出的论点是: 变通和忠实是翻译中的两大原则。首先, 作为从一种语言到另一种语言的转换, 翻译总体上是一个语言上的归化过程, 势必涉及到语言上的变通, 才能调和两种语言在语法、 表达习惯等方面的差异, 达到翻译的主要目的: 传达意义; 除语言变通之外, 翻译自然应有不“变”之处, 即应忠实于原作的地方, 本报告将“忠实” 这一概念的内涵界定为意义、语言风格、 术语三方面的忠实。 为阐述这一论点, 报告针对《消费时代的迷思》一书的语言特点举例探讨了多种变通策略, 如抽象名词的翻译、 插入语的处理、 |
分类号: | H059 |
论文总页数: | 276 |
参考文献总数: | 14 |
参考文献列表: |
辜正坤:翻译标准多元互补论,载《中国翻译》,1989第一期,100-105页。 黄河清,毛荣贵:科技翻译使用括号举隅,载《上海翻译》,1988年第四期,23-24页。 姜望琪:论术语翻译的标准,载《上海翻译》,2005第一期,80-84页。 黎运汉:1949年以来语言风格定义研究评述,载《语言文字应用》,2002第一期,100-106 页。 孙周兴:学术翻译的几个原则——以海德格尔著作之汉译为例证,载《中国翻译》,2013 第四期,70-73页。 王克非:近代翻译对汉语的影响,载《外语教学与研究》,2002年第六期,458-463页。 王力:《中国现代语法》。上海:商务印书馆,1943。 王文华:动静之间,载《中国翻译》,2001年第二期,44-47页。 解献芬:试论中西“语言风格”的定义,载《清华大学学报(哲学社会科学版)》2004第一 期,55-59。 许余龙:《对比语言学》。上海:上海外语教育出版社,2002。 余光中:论“的的不休”,载《余光中谈翻译》, 北京:中国对外翻译出版公司,2002。 Li, C. N. & Thompson, S. A. "Subject and Topic: A new typology of language." Contemporary Linguistics (1984). Tytler, Alexander Fraser. Essay on the Principles of Translation (1813): New edition. Vol. 13. John Benjamins Publishing, 1978. |
馆藏号: | 039/M2018(103) |
公开日期: | 2018-05-27 |
题名: | 基于深度学习的文本语句扩展系统的设计与实现 |
作者: | |
学号: | 1501210770 |
专业: | |
培养层次: | 硕士 |
培养单位: | 北京大学 |
导师姓名: | |
导师单位: | 软件与微电子学院 |
答辩日期: | 2018-05-26 |
分类号: | TP3 |
论文总页数: | 60 |
参考文献数: | 38 |
参考文献: |
[1] Mccoy K F. Simple NLP Techniques for Expanding Telegraphic Sentences[J]. Sentences Natural Language Processing for Communication Aids,1997, 2007.
[2] Artificial Neural Networks[J]. Encyclopedia of Microfluidics & Nanofluidics:23-33. [3] Rosenblatt F. The perception: a probabilistic model for information storage and organization in the brain[M] Neurocomputing: foundations of research. MIT Press, 1988:386-408. [4] Jeffrey L. Elman. Finding Structure in Time[J]. Cognitive Science,1990, 14(2):179 -211. [5] Gregor K, Danihelka I, Graves A, et al. DRAW: a recurrent neural network for image generation[J]. Computer Science, 2015:1462-1471. [6] Mikolov T, Karafiát M, Burget L, et al. Recurrent neural network based language model[C]// INTERSPEECH 2010, Conference of the International Speech Communication Association, Makuhari, Chiba, Japan, September. DBLP, 2010:1045-1048. [7] Li L, Jin L, Jiang Z, et al. Biomedical named entity recognition based on extended Recurrent Neural Networks[C]// IEEE International Conference on Bioinformatics and Biomedicine. IEEE, 2015:649-652. [8] Cho K, Van Merrienboer B, Gulcehre C, et al. Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation[J]. Computer Science, 2014. [9] Fan E G. Extended Tanh-function Method and its Applications to Nonlinear Equations[J]. Physics Letters A, 2000, 277(4):212-218. [10] Hecht-Nielsen R. Theory of the backpropagation neural network[M].Neural networks for perception (Vol. 2). Harcourt Brace & Co. 1992:593-605 vol.1. [11] Schmidhuber J, rgen. Deep learning in neural networks[M]. Elsevier Science Ltd. 2015. [12] Schuster M, Paliwal K K. Bidirectional recurrent neural networks[J]. IEEE Transactions on Signal Processing, 2002, 45(11):2673-2681. [13] Hochreiter S. LSTM can solve hard long time lag problems[C]// International Conference on Neural Information Processing Systems. MIT Press, 1996:473-479. [14] Hochreiter S, Schmidhuber J. Long Short-Term Memory[J]. Neural Computation, 1997, 9(8):1735-1780. [15] Graves A, Schmidhuber J. Framewise phoneme classification with bidirectional LSTM and other neural network architectures[J]. Neural Netw, 2005, 18(5):602-610. [16] Gers F A, Schraudolph N N. Learning precise timing with lstm recurrent networks[M]. JMLR.org, 2003. [17] Gers F A, Schmidhuber J, Cummins F. Learning to forget: continual prediction with LSTM[J]. Neural Computation, 2000, 12(10):2451-2471. [18] Cho K, Van Merrienboer B, Gulcehre C, et al. Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation[J]. Computer Science, 2014. [19] Fukushima K. Neocognitron: A self-organizing neural network model for a mechanism of pattern recognition unaffected by shift in position[J]. Biological Cybernetics, 1980, 36(4):193-202. [20] Lecun Y, Boser B, Denker J S, et al. Backpropagation Applied to Handwritten Zip Code Recognition[J]. Neural Computation, 2014, 1(4):541-551. [21] Yin W, Schütze H, Xiang B, et al. ABCNN: Attention-Based Convolutional Neural Network for Modeling Sentence Pairs[J]. Computer Science, 2015. [22] Wang L, Cao Z, Melo G D, et al. Relation Classification via Multi-Level Attention CNNs[C]// Meeting of the Association for Computational Linguistics. 2016:1298-1307. [23] Zhu J, Qiao J, Dai X, et al. Relation Classification via Target-Concentrated Attention CNNs[J]. 2017:137-146. [24] Bengio Y, Ducharme R, Vincent P, et al. A neural probabilistic language model.[M] Innovations in Machine Learning. Springer Berlin Heidelberg, 2006:137-186. [25] Bojanowski P, Grave E, Joulin A, et al. Enriching Word Vectors with Subword Information[J]. 2016. [26] Pennington J, Socher R, Manning C. Glove: Global Vectors for Word Representation[C]// Conference on Empirical Methods in Natural Language Processing. 2014:1532-1543. [27] Sutskever I, Vinyals O, Le Q V. Sequence to Sequence Learning with Neural Networks[J]. 2014, 4:3104-3112. [28] Jaitly N, Sussillo D, Le Q V, et al. A Neural Transducer[J]. Computer Science, 2016. [29] Vinyals O, Le Q. A Neural Conversational Model[J]. Computer Science, 2015. [30] Jean S, Cho K, Memisevic R, et al. On Using Very Large Target Vocabulary for Neural Machine Translation[J]. Computer Science, 2015. [31] Bahdanau D, Cho K, Bengio Y. Neural Machine Translation by Jointly Learning to Align and Translate[J]. Computer Science, 2014. [32] Jaitly N, Sussillo D, Le Q V, et al. A Neural Transducer[J]. Computer Science, 2016. [33] Britz D, Goldie A, Luong M T, et al. Massive Exploration of Neural Machine Translation Architectures[J]. 2017. [34] Papineni S. Blue ; A method for Automatic Evaluation of Machine Translation[C]// Meeting of the Association for Computational Linguistics. Association for Computational Linguistics, 2002. [35] Gehring J, Auli M, Grangier D, et al. Convolutional Sequence to Sequence Learning[J]. 2017. [36] Srivastava N, Hinton G, Krizhevsky A, et al. Dropout: a simple way to prevent neural networks from overfitting[J]. Journal of Machine Learning Research, 2014, 15(1):1929-1958. [37] Hinton G E, Srivastava N, Krizhevsky A, et al. Improving neural networks by preventing co-adaptation of feature detectors[J]. Computer Science, 2012, 3(4): 212-223. [38] Kingma D, Ba J. Adam: A Method for Stochastic Optimization[J]. Computer Science, 2014. |
馆藏号: | 017/M2018(311) |
公开日期: | 2021-05-26 |
题名: | 基于多人在线战术竞技游戏的虚拟团队数据分析与研究 |
姓名: | |
学号: | 1401210506 |
专业: | |
培养层次: | 硕士 |
培养单位: | 北京大学 |
导师1姓名: | |
导师1单位: | 软件与微电子学院 |
论文答辩日期: | 2018-05-26 |
论文摘要: | 计算社会科学(CSS)是计算机技术和社会科学的交叉学科,本研究是该学科对个体行为和团体行为研究的具体实例,受该领域基金支持。本文研究的目的是量化,追踪和预测人类个体和团队行为表现在游戏化虚拟团队环境中常见和异常的行为轨迹,旨在帮助虚拟和现实团队提升整体表现,为在线游戏化平台的个性化激励提供支持。 本文研究的创新性和重要性体现于,对个体行为表现研究中存在的四类问题的针对性解决:个体行为通常以一刀切的方式建模,本文分别从角色,经验,技能,团队网络结构等层面对个体进行多角度的个性化建模;个体行为模型通常不包含时间动态信息,本文所有的模型都考量了人类行为轨迹随时间的动态演变;个体行为模型通常忽略了社会网络效应,本文第五章着重于研究不同网络结构所带来的影响;个体行为模型通常不具概括性,可重复性,可测试性和可解释性。本文方法都是可解释可重复的,实验结果证明本研究的结论具有跨游戏平台的普遍性。 论文首先构建了个人表现随时间的动态演变模型,该模型分析了多人在线战术竞技游戏(MOBA)英雄联盟的玩家数据。通过针对长期行为的回归分析和短期行为的游戏块分析,用数据事实揭示出与一般直觉不同的结论,即短期游戏块内个人行为呈现恶化效应,个人表现提升和长期经验无直接联系,但经验可缓解个人的短期表现恶化。论文使用机器学习算法搭建了能准确预测出玩家何时选择继续或结束当前游戏块的嵌套模型,揭示了决定去留的关键因素。之后论文在该时间模型的基础上构建了个人表现随角色选择的动态演变模型,该模型使用的是MOBA游戏刀塔2的玩家数据。论文通过统计分析定义了不同角色,结果显示出跨角色的个人短期行为热身现象。该模型分别将个体按经验,技能和角色等进行了个性化分类,实验结果揭示了个体玩家成功的模式。最后论文在时间模型的基础上进一步针对网络结构对个体和团队表现所产生的影响进行了建模,该模型不仅使用了MOBA游戏的海量数据还结合了玩家真实朋友关系数据。本文对团队网络结构进行了细分,并应用网络科学,经济学原理和数理统计对随时间动态演变的个体和团队行为表现进行了分析,结果表明低能力团队会因组成网络结构的玩家产生正外部性,从而能提升团队内个体和团队整体的行为和表现。高水平团队需要有意识的让低水平个体和高水平个体搭配,将负外部性内部化来帮助提升团队和个体表现。本文实验结果还显示,密切的团队内部联系能够帮助缓解短期表现恶化效应。虽然本文是关于特定领域的研究,但是所得出的理论结果,建立的动态模型以及使用的分析方法均可应用到更抽象,描述和解释人类行为的上下文中。 |
分类号: | TP3 |
论文总页数: | 127 |
参考文献总数: | 72 |
参考文献列表: |
[1] Ajzen I. The theory of planned behavior, organizational behavior and human decision processes.[J]. Journal of Leisure Research, 1991, 50(2):176-211.
[2] Hamari J, Koivisto J, Sarsa H. Does Gamification Work? -- A Literature Review of Empirical Studies on Gamification.[C] Hawaii International Conference on System Sciences. IEEE, 2014:3025-3034. [3] Farzan R, Dimicco J M, Millen D R, et al. Results from deploying a participation incentive mechanism within the enterprise.[C] Sigchi Conference on Human Factors in Computing Systems. ACM, 2008:563-572. [4] Hey T. The Fourth Paradigm – Data-Intensive Scientific Discovery.[J]. Proceedings of the IEEE, 2011, 99(8):1334-1337. [5] Lazer D, Pentland A, Adamic L, et al. Life in the network: the coming age of computational social science.[J]. Science, 2016, 323(5915):721-723. [6] Conte R, Gilbert N, Bonelli G, et al. Manifesto of computational social science.[J]. European Physical Journal Special Topics, 2012, 214(1):325-346. [7] Lazer D, Pentland A, Adamic L, et al. Social science. Computational social science.[J]. Science, 2009, 323(5915):721-3. [8] Centola D. The Spread of Behavior in an Online Social Network Experiment.[J]. Science, 2010, 329(5996):1194-1197. [9] Calvó-Armengol A, Jackson M O. Like Father, Like Son: Social Network Externalities and Parent-Child Correlation in Behavior.[J]. American Economic Journal Microeconomics, 2009, 1(1):124-150. [10] Lewis K, Gonzalez M, Kaufman J. Social selection and peer influence in an online social network.[J]. Proceedings of the National Academy of Sciences of the United States of America, 2012, 109(1):68-72. [11] Chudoba K M, Wynn E, Lu M, et al. How virtual are we? Measuring virtuality and understanding its impact in a global organization.[J]. Information Systems Journal, 2005, 15(4):279–306. [12] Townsend A M, Hendrickson A R. Virtual Teams: Technology and the Workplace of the Future.[J]. Academy of Management Executive, 1998, 12(3):17-29. [13] Richard H J, Nancy K. Group Behavior and Performance.[M]// Handbook of Social Psychology. 2010:1258-63. [14] Hertel G, Niedner S, Herrmann S. Motivation of software developers in Open Source projects: an Internet-based survey of contributors to the Linux kernel.[J]. Research Policy, 2003, 32(7):1159-1177. [15] Clark J, Leavitt A, Williams D. Online Games, Community Aspects of.[M] The International Encyclopedia of Digital Communication and Society. John Wiley & Sons, Inc. 2015. [16] Huang Y, Ye W, Bennett N, et al. Functional or social?:exploring teams in online games.[C] Conference on Computer Supported Cooperative Work. 2013:399-408. [17] Ducheneaut N, Moore R J. The social side of gaming: a study of interaction patterns in a massively multiplayer online game.[C] ACM Conference on Computer Supported Cooperative Work. ACM, 2004:360-369. [18] Shen C. Network patterns and social architecture in Massively Multiplayer Online Games: Mapping the social world of EverQuest II.[J]. New Media & Society, 2014, 16(4):672-691. [19] Assmann J J, Drescher M A, Gallenkamp J V, et al. MMOGs as Emerging Opportunities for Research on Virtual Organizations and Teams.[C] Americas Conference on Information Systems, Amcis 2010, "sustainable It Collaboration Around the Globe.", Lima, Peru, August. DBLP, 2010:335. [20] Goh S, Wasko M. The effects of leader-member exchange on member performance in virtual world teams.[J]. Journal of the Association for Information Systems, 2012, 13(10):861-885. [21] Nardi B, Harris J. Strangers and Friends: Collaborative Play in World of Warcraft.[C] ACM Conference on Computer Supported Cooperative Work, CSCW 2006, Banff, Alberta, Canada, November. DBLP, 2006:149-158. [22] Kou Y, Gui X. Playing with strangers: understanding temporary teams in league of legends.[C] ACM Sigchi Symposium on Computer-Human Interaction in Play. ACM, 2014:161-169. [23] Park K, Cha M, Kwak H, et al. Achievement and Friends: Key Factors of Player Retention Vary Across Player Levels in Online Multiplayer Games[C]// International Conference on World Wide Web Companion. International World Wide Web Conferences Steering Committee, 2017:445-453. [24] Bardzell S, Bardzell J, Pace T, et al. Blissfully productive: grouping and cooperation in world of warcraft instance runs.[C] ACM Conference on Computer Supported Cooperative Work. ACM, 2008:357-360. [25] Tyack A, Wyeth P, Johnson D. The Appeal of MOBA Games: What Makes People Start, Stay, and Stop[C]// Symposium on Computer-Human Interaction in Play. ACM, 2016:313-325. [26] Benefield G A, Shen C, Leavitt A. Virtual Team Networks: How Group Social Capital Affects Team Success in a Massively Multiplayer Online Game.[C] ACM Conference on Computer-Supported Cooperative Work & Social Computing. ACM, 2016:679-690. [27] Kim J, Keegan B C, Park S, et al. The Proficiency-Congruency Dilemma: Virtual Team Design and Performance in Multiplayer Online Games.[J]. Computer Science, 2015:4351-4365. [28] Leavitt A, Keegan B C, Clark J. Ping to Win?: Non-Verbal Communication and Team Performance in Competitive Online Multiplayer Games.[C] CHI Conference on Human Factors in Computing Systems. ACM, 2016:4337-4350. [29] Kim Y J, Engel D, Mcarthur N, et al. What Makes a Strong Team?: Using Collective Intelligence to Predict Team Performance in League of Legends.[C] ACM Conference on Computer Supported Cooperative Work and Social Computing. ACM, 2017:2316-2329. [30] Huang J, Zimmermann T, Nagapan N, et al. Mastering the art of war:how patterns of gameplay influence skill in Halo.[C] Sigchi Conference on Human Factors in Computing Systems. 2013:695-704. [31] Vicencio-Moreira R, Mandryk R L, Gutwin C. Now You Can Compete With Anyone: Balancing Players of Different Skill Levels in a First-Person Shooter Game.[C] ACM Conference on Human Factors in Computing Systems. ACM, 2015:2255-2264. [32] Sievertsen H H, Gino F, Piovesan M. Cognitive fatigue influences students’ performance on standardized tests.[J]. Proceedings of the National Academy of Sciences of the United States of America, 2016, 113(10):2621. [33] Borghini G, Astolfi L, Vecchiato G, et al. Measuring neurophysiological signals in aircraft pilots and car drivers for the assessment of mental workload, fatigue and drowsiness.[J]. Neuroscience & Biobehavioral Reviews, 2014, 44:58-75. [34] Muraven M, Baumeister R F. Self-regulation and depletion of limited resources: does self-control resemble a muscle?[J]. Psychological Bulletin, 2000, 126(2):247-59. [35] Kooti F, Moro E, Lerman K. Twitter Session Analytics: Profiling Users’ Short-Term Behavioral Changes.[M] Social Informatics. Springer International Publishing, 2016:71-86. [36] Singer P, Ferrara E, Kooti F, et al. Evidence of Online Performance Deterioration in User Sessions on Reddit[J]. Plos One, 2016, 11(8):e0161636. [37] Scerbo M W. Stress, Workload and Boredom in Vigilance: A Problem and an Answer.[J]. Stress Workload & Fatigue, 2001. [38] Warm J S, Matthews G, Finomore V S Jr. Vigilance, workload, and stress.[J]. Performance under stress, 2008:115-41. [39] Boksem M A, Tops M. Mental fatigue: costs and benefits.[J]. Brain Research Reviews, 2008, 59(1):125-139. [40] Marcora S M, Staiano W, Manning V. Mental fatigue impairs physical performance in humans.[J]. Journal of Applied Physiology, 2009, 106(3):857-64. [41] Lim J, Wu W C, Wang J, et al. Imaging brain fatigue from sustained mental workload: an ASL perfusion study of the time-on-task effect[J]. Neuroimage, 2010, 49(4):3426-3435. [42] Pattyn N, Neyt X, Henderickx D, et al. Psychophysiological investigation of vigilance decrement: boredom or cognitive fatigue?[J]. Physiology & Behavior, 2008, 93(1-2):369. [43] Lorist M M, Boksem M A S, Ridderinkhof K R. Impaired cognitive control and reduced cingulate activity during mental fatigue.[J]. Brain Research Cognitive Brain Research, 2005, 24(2):199. [44] Boksem M A, Meijman T F, Lorist M M. Effects of mental fatigue on attention: an ERP study[J]. Brain Res Cogn Brain Res, 2005, 25(1):107-116. [45] Boksem M A, Meijman T F, Lorist M M. Mental fatigue, motivation and action monitoring.[J]. Biological Psychology, 2006, 72(2):123-132. [46] Demerouti E, Bakker A B, Nachreiner F, et al. The job demands-resources model of burnout.[J]. J Appl Psychol, 2001, 86(3):499-512. [47] G. Robert J. Hockey, A. John Maule, Peter J. Clough, et al. Effects of negative mood states on risk in everyday decision making.[J]. Cognition & Emotion, 2000, 14(6):823-855. [48] Sanders A F. Elements of human performance:, Reaction processes and attention in human skill.[M] Elements of Human Performance: Reaction Processes and Attention in Human Skill. Lawrence Erlbaum Associates, 1998:231-234. [49] Van d L D, Frese M, Meijman T F. Mental fatigue and the control of cognitive processes: effects on perseveration and planning.[J]. Acta Psychologica, 2003, 113(1):45. [50] Danziger S, Levav J, Avnaimpesso L. Extraneous factors in judicial decisions.[J]. Proceedings of the National Academy of Sciences of the United States of America, 2011, 108(17):6889. [51] Vohs K D, Baumeister R F, Schmeichel B J, et al. Making choices impairs subsequent self-control: a limited-resource account of decision making, self-regulation, and active initiative.[J]. Journal of Personality & Social Psychology, 2008, 94(5):883-98. [52] Mullettegillman O A, Leong R L, Kurnianingsih Y A. Cognitive Fatigue Destabilizes Economic Decision Making Preferences and Strategies.[J]. 2015, 10(7). [53] Page S E. The Difference:How the Power of Diversity Creates Better Groups, Firms, Schools, and Societies (New Edition).[M]. Princeton University Press, 2008. [54] Jia P, Mirtabatabaei A, Friedkin N E, et al. Opinion Dynamics and the Evolution of Social Power in Influence Networks.[J]. Siam Review, 2013, 57(3):367-397. [55] Woolley A W, Chabris C F, Pentland A, et al. Evidence for a collective intelligence factor in the performance of human groups.[J]. Science, 2010, 330(6004):686-688. [56] Ferrara E, Alipourfard N, Burghardt K, et al. Dynamics of Content Quality in Collaborative Knowledge Production.[J]. 2017. [57] Halfaker A, Keyes O, Kluver D, et al. User Session Identification Based on Strong Regularities in Inter-activity Time.[C] International World Wide Web Conferences Steering Committee, 2015:410-418. [58] Ho T K. The Random Subspace Method for Constructing Decision Forests[J]. IEEE Transactions on Pattern Analysis & Machine Intelligence, 1998, 20(8):832-844. [59] Friedman J, Hastie T, Tibshirani R. The Elements of Statistical Learning.[J]. Journal of the Royal Statistical Society, 2001, 167(1):267-268. [60] Friedman J H. Greedy Function Approximation: A Gradient Boosting Machine.[J]. Annals of Statistics, 2001, 29(5):1189-1232. [61] Freund Y, Schapire R, Abe N. A short introduction to boosting.[J]. Journal-Japanese Society For Artificial Intelligence, 1999, 14:771-780. [62] Schapire R E, Singer Y. Improved Boosting Algorithms Using Confidence-rated Predictions.[J]. Machine Learning, 1999, 37(3):297-336. [63] Radicchi F, Fortunato S, Markines B, et al. Diffusion of scientific credits and the ranking of scientists.[J]. Physical Review E Statistical Nonlinear & Soft Matter Physics, 2009, 80(2):056103. [64] Sinatra R, Wang D, Deville P, et al. Quantifying the evolution of individual scientific impact.[J]. Science, 2016, 354(6312):aaf5239-aaf5239. [65] Rodi G C, Loreto V, Servedio V D P, et al. Optimal Learning Paths in Information Networks.[J]. Scientific Reports, 2015, 5:10286. [66] Memmert D, Lemmink K A, Sampaio J. Current Approaches to Tactical Performance Analyses in Soccer Using Position Data.[J]. Sports Medicine, 2016:1-10. [67] Cha M, Haddadi H, Benevenuto F, et al. Measuring User Influence in Twitter: The Million Follower Fallacy.[C] International Conference on Weblogs and Social Media, Icwsm 2010, Washington, Dc, Usa, May. DBLP, 2010. [68] Hong L, Dan O, Davison B D. Predicting popular messages in Twitter.[C] International Conference on World Wide Web, WWW 2011, Hyderabad, India, March 28 - April. DBLP, 2011:57-58. [69] Movshovitz-Attias D, Movshovitz-Attias Y, Steenkiste P, et al. Analysis of the reputation system and user contributions on a question answering website: StackOverflow.[C] Ieee/acm International Conference on Advances in Social Networks Analysis and Mining. ACM, 2013:886-893. [70] Pobiedina N, Neidhardt J, Moreno M D C C, et al. On Successful Team Formation: Statistical Analysis of a Multiplayer Online Game.[C] Business Informatics. IEEE, 2013:55-62. [71] Becker R, Chernihov Y, Shavitt Y, et al. An analysis of the Steam community network evolution.[C]// Electrical & Electronics Engineers in Israel. IEEE, 2012:1-5. [72] Blackburn J, Kourtellis N, Skvoretz J, et al. Cheating in Online Games: A Social Network Perspective.[J]. Acm Transactions on Internet Technology, 2014, 13(3):9. |
馆藏号: | 017/M2018(336) |
公开日期: | 2019-05-26 |
题名: | 基于神经网络的影视剧向量表示模型 |
作者: | |
学号: | 1501210674 |
语种: | chi |
公开时间: | 3年后 |
学位: | |
院系: | |
导师单位: | 软件与微电子学院 |
答辩日期: | 2018-05-26 |
题目(外文): | A Video Content Embedding Model Using Neural Networks |
文摘: | 随着视频网站不断发展,影视剧数据和用户数量都大幅上升,对影视剧的自动分类、推荐等任务产生了大量需求。传统上,视频网站的分类信息往往来源于人工编辑,推荐系统则主要依据用户行为数据和协同过滤算法。由于标注人力有限和数据稀疏问题,人工分类的可扩展性是一大瓶颈,冷门影视剧或者新用户的推荐结果也存在局限。 本论文采用神经网络,对影视剧的标签、剧情梗概等不同来源的异质文本数据进行降维和整合,将原始文本数据映射到语义空间中,得到基于内容的低维向量表示。这种分布式的向量表示模型在深度学习中称为嵌入模型,近年来在自然语言处理领域受到广泛关注和研究,并在诸多任务上取得突破进展。 本文首先研究了不同粒度的文本数据的建模方式,综述了单词、短语、句子、段落级别的分布式语义表示模型的概念和方法,并探讨如何将其应用于影视剧场景下。其次,本文基于神经网络,建立了影视剧内容的向量表示模型,通过改进的负采样训练策略,将不同粒度、不同来源的文本元数据融合为一致语义空间下的向量表示。研究表明,使用神经网络的分布式向量表示模型,能够对现有影视剧的内容进行有效的建模,并可以应用于新增加的影视剧数据。该模型可以应用于自动推荐、聚类等任务。 |
文摘(外文): | With the continuous advancement of online video providers, the number of movies and television series online has risen significantly, alongside with the amount of user data. A great demand has arisen for such tasks as the automatic classification and recommendation of such video contents. Traditionally, the classification information of video sites often comes from manual editors, while recommendation systems mainly rely on user behavioral data and collaborative filtering algorithms. Due to the limited man-hours of labeling and the problem of data sparseness, the scalability of manual classification is a big bottleneck; the recommended results for unpopular movies or new users are also limited. In this dissertation, neural networks are employed in the dimensionality reduction and integration of heterogeneous text data from different sources, such as labels and synopsis of movies and television series. Through mapping from raw texts to the semantic space, we get low-dimensional vector representations based on their contents. This distributed vector representation model is called an embedded model in deep learning. In recent years, it has received extensive attention and research in the field of natural language processing, and has made breakthroughs in many tasks. Firstly, this dissertation studies how to model text data with different granularities, reviews the concepts and methods of distributed semantic representation models of words, phrases, sentences and paragraphs, and discusses how to apply these models in the context of movies and television series. Secondly, a vector representation model of video contents is established using neural networks. Text metadata of different granularity and from different sources are merged and mapped into a vector representation in a consistent semantic space via an improved negative sampling training strategy. The study shows that the distributed vector representation model of neural networks can effectively model the content of the existing movies and television series, and can be easily applied to newly-added content data. The model can be applied to automatic recommendation, clustering and other tasks. |
分类号: | TP3 |
论文总页数: | 61 |
参考文献数: | 60 |
参考文献: |
Andrew G, Arora R, Bilmes J A, et al. 2013. Deep canonical correlation analysis. ICML, 1247-1255.
Bach F R, Jordan M I. 2002. Kernel independent component analysis. Journal of Machine Learning Research, Issue 3, 1-48. Baroni M, Dinu G, Kruszewski G. 2016. Don't count, predict! a systematic comparison of context-counting vs. context-predicting semantic vectors. Proc. ACL, Volume 1, 238–247. Bengio Y, Ducharme R, Vincent P, et al. 2003. A neural probabilistic language model. Journal of Machine Learning Research, Issue 3, 1137-1155. Blei D. M., Ng A. Y., Jordan M. I. 2003. Latent Dirichlet Allocation. Journal of Machine Learning Research, Issue 3, 993–1022. Bojanowski P., Grave E., Joulin A, Mikolov, T., 2017. Enriching word vectors with subword information. arXiv: 1607.04606. Boureau Y-L, Ponce J, LeCun Y. 2010. A theoretical analysis of feature pooling in visual recognition. ICML, 111-118. Bullinaria J A, Levy J P. 2012. Extracting semantic representations from word co-occurrence statistics: stop-lists, stemming, and SVD. Behavior research methods , 44(3), 890-907. Chandar S, Lauly S, Larochelle H, et al. 2014. An autoencoder approach to learning bilingual word representations. Proceedings of NIPS 2014. Cho K, van Merrienboer B, Gulcehre C, et al. 2014. Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation. arXiv:1406.1078. Chung J, Gulcehre C, Cho K, et al. 2014. Empirical evaluation of gated recurrent neural networks on sequence modeling. arXiv: 1412.3555. Cohn D A, Hofmann T. 2000. The missing link -- A probabilistic model of document content and hypertext connectivity.. NIPS, 430-436. Collobert R, Weston J. 2008. A unified architecture for natural language processing: deep neural networks with multitask learning. ICML, 160–167. Collobert R, Weston J, Bottou L, et al. 2011. Natural language processing (almost) from scratch. Journal of Machine Learning Research, Issue 12, 2493--2537. Conneau A, Lample G, Ranzato M, et al. 2017. Word translation without parallel data. arXiv: 1710.04087. Conneau A, Schwenk H, Barrault L, et al. 2017. Very Deep Convolutional Networks for Text Classification. Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, Volume 1, 1107-1116. Cybenko G. 1989. Approximations by superpositions of sigmoidal functions. Mathematics of Control, Signals, and Systems, 2(4), 303-314. Deerwester S, Dumais S T, Furnas G W, et al. 1990. Indexing by latent semantic analysis. Journal of the American Society for Information Science, 41(6), 391–407, 477, 482. Faruqui M, Dyer C. 2014. Improving vector space word representations using multilingual correlation. Proceedings of EACL 2014. Fawcett T. 2006. An introduction to roc analysis. Pattern recognition letters, 27(8), 861-874. Feng F, Wang X, Li R. 2014. Cross-modal retrieval with correspondence autoencoder. ACM Multimedia 2014, 7-16. Firth J R. 1957. A synopsis of linguistic theory. s.l.:s.n. Golub G H, Reinsch C. 1970. Singular value decomposition and least squares solutions. Numerische mathematik, 14(5), 403-420. Goodfellow I, Bengio Y, Courville A. 2016. Deep Learning. s.l.:MIT Press. Gouws S, Bengio Y, Corrado G. 2015. BilBOWA: Fast Bilingual Distributed Representations without Word Alignments. arXiv: 1410.2455. Hermann K M, Blunsom P. 2013. Multilingual distributed representations without word alignment. arXiv:1312.6173. Hinton G E, Srivastava N, Krizhevsky A, et al. 2012. Improving neural networks by preventing co-adaptation of feature detectors. arXiv:1207.0580. Hochreiter S, Schmidhuber J. 1997. Long short-term memory. Neural computation , 9(8), 1735-1780. Hofmann T. 1999. Probabilistic Latent Semantic Indexing. Proceedings of the Twenty-Second Annual International SIGIR Conference on Research and Development in Information Retrieval, 289-296. Hotelling H. 1936. Relations between two sets of variates. Biometrika , 28(3/4), 321-377. Insall M, Rowland T, Weisstein E W. 2018. Embedding. [Online] Available at: http://mathworld.wolfram.com/Embedding.html [Accessed 2018-04-01]. Kalchbrenner N, Grefenstette E, Blunsom P. 2014. A convolutional neural network for modeling sentences. arXiv: 1606.04640. Karpathy A. 2018. CS231n: Convolutional Neural Networks for Visual Recognition. [Online] Available at: http://cs231n.github.io [Accessed 2018-04-01]. Karpathy A, Fei-Fei L. 2014. Deep visual-semantic alignments for generating image descriptions. CoRR 2014. Kim Y. 2014. Convolutional neural networks for sentence classification. EMNLP 2014, 1746–1751. Kingma D P, Ba J L. 2015. Adam: A method for stochastic optimization. ICLR 2015. Kiros R, Zhu Y, Salakhutdinov R, et al. 2015. Skip-Thought Vectors. arXiv: 1506.06726. Lebret R, Collobert R. 2013. Word emdeddings through hellinger PCA. arXiv: 1312.5542. LeCun Y, Bottou L, Bengio Y, et al. 1998. Gradient-based learning applied to document recognition. Proceedings of the IEEE, Volume 86, 2278-2324. Le Q V, Mikolov T. 2014. Distributed representations of sentences and documents. ICML, 1188-1196. Levy O, Goldberg Y. 2014. Neural word embedding as implicit matrix factorization. Proceedings of NIPS 2014, 2177-2185. Levy O, Goldberg Y. 2015. Improving distributional similarity with lessons learned from word embeddings. TACL, Issue 3, 211-225. Li Y, Yang M, Zhang Z. 2015. Multi-View Representation Learning: A Survey from Shallow Methods to Deep Methods. arXiv: 1610.01206. Maas A L, Hannun A Y, Ng A Y. 2013. Rectifier Nonlinearities Improve Neural Network Acoustic Models. Proceedings of the 30th International Conference on Machine Learning ., JMLR: 28. Mikolov T, Chen K, Corrado G, et al. 2013a. Efficient estimation of word representations in vector space. arXiv: 1301.3781.. Mikolov T, Le Q V, Sutskever I. 2013b. Exploiting similarities among languages for machine translation. International Conference on Learning Representations. Mikolov T, Sutskever I, Chen K, et al., 2013c. Distributed representations of words and phrases and their compositionality. Proceedings of NIPS 2013, 3111-3119. Mitra B, Craswell N. 2017. Neural Models for Information Retrieval. arXiv: 1705.01509. Nair V, Hinton G E. 2010. Rectified linear units improve restricted boltzmann machines. s.l., s.n. Pennington J, Socher R, Manning C D. 2014. Glove: Global vectors for word representation. Proceedings of EMNLP 2014, 1532-1543.. Rehurek R. 2011. Fast and Faster: A comparison of two streamed matrix decomposition methods. arXiv: 1102.5597. Rumelhart D E, Hinton G E, Williams R J. 1986. Learning representations by back-propagating errors. Nature, Issue 323, 533–536. Schmidhuber J. 2014. Deep Learning in Neural Networks: An Overview. Technical Report IDSIA-03-14. arXiv:1404.7828. Smith S L, Turban D H P, Hamblin S, et al. 2017. Offline bilingual word vectors, orthogonal transformations and the inverted softmax. arXiv: 1702.03859. Socher R, Perelygin A, Wu J Y,. et al. 2013. Recursive deep models for semantic compositionality over a sentiment treebank. Proceedings of EMNLP, 1631-1642. Weston J, Chopra S, Adams K. 2014. #TagSpace: semantic embeddings from hashtags. Proceedings of EMNLP, 1822-1827. Zhang X, Zhao J, LeCun Y. 2015. Character-level convolutional networks for text classification. NIPS, 649–657. Zhao Z, Liu T, Li S, et al. 2017. Ngram2vec – learning improved word representations from ngram co-occurrence statistics. Proceedings of EMNLP 2017, 244—253. van der Maaten L, Hinton G. 2008. Visualizing High-Dimensional Data Using t-SNE. Journal of Machine Learning Research, Issue 9, 2579-2605. |
馆藏号: | 017/M2018(368) |
公开日期: | 2021-05-26 |
题名: | 面向移动端的用户检索实体抽取系统设计与实现 |
作者: | |
学号: | 1501210487 |
语种: | chi |
公开时间: | 3年后 |
学位: | |
院系: | |
导师单位: | 软件与微电子学院 |
答辩日期: | 2018-05-26 |
题目(外文): | Design and Implementation of Entity Extraction System in User Query For Mobile Terminal Devices |
文摘: | 实体抽取作为自然语言处理的基本任务,在深度学习兴起之际,又取得了一系列突破性的进展。它作为问答系统、人机对话和机器翻译等任务的基础部分,所起的作用是不可替代的。而近来,随着人工智能的兴起和智能语义交互需求的增加,用户检索中的实体抽取成为很重要的一项功能,它相对于传统命名实体识别具有更宽广的领域需求,更严格的精度和准度需求以及更复杂的用户交互逻辑。我们可以借助实体识别结果,完成一系列的资源请求和服务分发,完成用户的需求,以及引导用户的潜在需求,这是新型的文本交互中非常重要的一环。 本文基于此目标实现了线上和线下两套系统,其核心系统是实体抽取功能,辅以必要的模式匹配模块,以满足用户的热点需求和修正模型的识别缺陷。关于实体抽取部分,我们主要基于tensorflow框架对模型进行训练、调优和部署。在基线部署上,本文创新性地采用了seq2seq结构,实现了命名实体识别的基础框架;然后根据训练数据规模、输入模块粒度、归一化和注意力机制等对基线模型进行了调优;最后从词向量生成方法、注意力机制和新型模型三个方面对模型的结构进行了改进和优化。最终使得模型的效果提高了10多个点。在算法迭代过程中,我们通过整合模型和词向量增强,取得了最优的结果。最后,我们在微软的命名实体识别公开测试集上进行了模型的测试,并达到了比较好的结果。CNN编码器的实践、注意力机制的深度探讨以及实体去歧模型的调研,将作为本文后续的研究方向。 其次在移动端的模型部署上,本文还针对硬件和软件两个方面进行了深层次的优化。软件方面,我们分别进行了模型压缩和数据结构优化;硬件方面则进行了依赖分离和硬件适配。总的来说,较好地解决了深度学习模型在移动端部署时所存在的内存占用高、执行效率低等问题,里边的诸多解决方法有很多值得借鉴的地方。 |
文摘(外文): | As the basic task of Natural Language Processing, Entity Extraction has broken through with the rising of deep learning. Named Entity Extraction has played an irreplaceable role in QA system, interactive chat and machine translation and so on. Recently, with the ascending demands for intelligent semantic intercation and AI's boosting, Entity Extraction has been emerging as a flashpoint in user query precessing. Compared to the traditional named entity recognition, it has a broader fields freedom, more strict limits on precision and recall rate and more sophisticated interactive routines. Based on the extraction results, we can complete a series of resources request and service dispatch , in order not only to meet the users' demands, but also motivate their potential desirements. |
分类号: | TP3 |
论文总页数: | 124 |
参考文献数: | 72 |
参考文献: |
[1] Maha Althobaiti, Udo Kruschwitz and Massimo Poesio. Combining Minimally-supervised Methods
for Arabic Named Entity Recognition. 2015,3. [2] Jimmy Lei Ba, Jamie Ryan Kiros and Geoffrey E Hinton. Layer Normalization. 2016. [3] Dzmitry Bahdanau, Kyunghyun Cho and Yoshua Bengio. Neural Machine Translation by Jointly Learning to Align and Translate. CoRR, 2014, abs/1409.0473. http://arxiv.org/abs/1409. 0473. [4] Yoshua Bengio, Réjean Ducharme, Pascal Vincent et al. A Neural Probabilistic Language Model. Journal of Machine Learning Research, 2004-02-05: 1137–1155. http://dblp.uni-trier.de/ db/journals/jmlr/jmlr3.html#BengioDVJ03. [5] Daniel M. Bikel, Richard Schwartz and Ralph M. Weischedel. An Algorithm that Learns What’s in a Name. Machine Learning, 1999, 34(1-3): 211–231. [6] Andrew Borthwick, John Sterling, Eugene Agichtein et al. Description of the MENE named entity system as used in MUC-7. 1998. [7] Andrew Borthwick, John Sterling, Eugene Agichtein et al. Exploiting Diverse Knowledge Sources via Maximum Entropy in Named Entity Recognition. In: 1998: 152–160. [8] Randall L Calvert. Robustness of the Multidimensional Voting Model: Candidate Motivations, Uncertainty, and Convergence. American Journal of Political Science, 1985, 29(1): 69. [9] Aitao Chen, Fuchun Peng, Roy Shan et al. Chinese named entity recognition with conditional probabilistic models. 2006. [10] Jason P. C. Chiu and Eric Nichols. Named Entity Recognition with Bidirectional LSTM-CNNs. Computer Science, 2015. [11] Key Sun Choi, Key Sun Choi and Key Sun Choi. Unsupervised named entity classification models and their ensembles. In: International Conference on Computational Linguistics, 2002: 1–7. [12] Michael Collins. Unsupervised Models for Named Entity Classification. In: Joint Sigdat Conference on Empirical Methods in Natural Language Processing and Very Large Corpora, 1999: 100–110. [13] Ronan Collobert, Jason Weston, Michael Karlen et al. Natural Language Processing (Almost) from Scratch. Journal of Machine Learning Research, 2011, 12(1): 2493–2537. [14] Tim Cooijmans, Nicolas Ballas, César Laurent et al. Recurrent Batch Normalization. CoRR, 2016, abs/1603.09025. http://arxiv.org/abs/1603.09025. [15] Chuanhai Dong, Jiajun Zhang, Chengqing Zong et al. Character-Based LSTM-CRF with RadicalLevel Features for Chinese Named Entity Recognition. Springer International Publishing, 2016. [16] Radu Florian. Named entity recognition as a house of cards: classifier stacking. In: Conference on Natural Language Learning, 2002: 1–4. [17] Jonas Gehring, Michael Auli, David Grangier et al. Convolutional Sequence to Sequence Learning. 2017. [18] Franck Genet and Franck Genet. Tagging unknown proper names using decision trees. In: Meeting on Association for Computational Linguistics, 2000: 77–84. [19] Yoav Goldberg. The unreasonable effectiveness of Character-level Language Models. [20] Michael Gutmann and Aapo Hyv?rinen. Noise-contrastive estimation: A new estimation principle for unnormalized statistical models. Journal of Machine Learning Research, 2010, 9: 297–304. [21] Kazuma Hashimoto, Caiming Xiong, Yoshimasa Tsuruoka et al. A Joint Many-Task Model: Growing a Neural Network for Multiple NLP Tasks. 2016. [22] Kaiming He, Xiangyu Zhang, Shaoqing Ren et al. Deep Residual Learning for Image Recognition. CoRR, 2015, abs/1512.03385. http://arxiv.org/abs/1512.03385. [23] Geoffrey E. Hinton, Alex Krizhevsky and Sida D. Wang. Transforming Auto-Encoders. 2011, 6791: 44–51. [24] Geoffrey E Hinton, Sara Sabour and Nicholas Frosst. Matrix capsules with EM routing. In: International Conference on Learning Representations, 2018. https://openreview.net/forum?id= HJWLfGWRb. [25] Sergey Ioffe and Christian Szegedy. Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift. 2015: 448–456. [26] Sergey Ioffe and Christian Szegedy. Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift. CoRR, 2015, abs/1502.03167. http://arxiv.org/abs/1502. 03167. [27] Armand Joulin, Edouard Grave, Piotr Bojanowski et al. Bag of Tricks for Efficient Text Classification. 2016: 427–431. [28] Yoon Kim, Yacine Jernite, David Sontag et al. Character-Aware Neural Language Models. Computer Science, 2015. [29] Trausti Kristjansson, Aron Culotta, Paul Viola et al. Interactive Information Extraction with Constrained Conditional Random Fields. In: Nineteenth National Conference on Artificial Intelligence, Sixteenth Conference on Innovative Applications of Artificial Intelligence, July 25-29, 2004, San Jose, California, Usa, 2004: 412–418. [30] Guillaume Lample, Miguel Ballesteros, Sandeep Subramanian et al. Neural Architectures for Named Entity Recognition. CoRR, 2016, abs/1603.01360. http://arxiv.org/abs/1603.01360. [31] César Laurent, Gabriel Pereyra, Philémon Brakel et al. Batch Normalized Recurrent Neural Networks. 2015: 2657–2661. [32] Nicholas Leonard. Language modeling a billion words. [33] Dongyun Liang, Weiran Xu, Yinge Zhao et al. Combining Word-Level and Character-Level Representations for Relation Classification of Informal Text. In: The Workshop on Representation Learning for Nlp, 2017: 43–47. [34] Xuezhe Ma and Eduard Hovy. End-to-end Sequence Labeling via Bi-directional LSTM-CNNs-CRF. 2016. [35] Andrei Mikheev, Claire Grover and Marc Moens. Description Of The Ltg System Used For Muc-7. In: 1998. [36] Tomas Mikolov, Kai Chen, Greg Corrado et al. Efficient Estimation of Word Representations in Vector Space. Computer Science, 2013. [37] Andriy Mnih and Yee Whye Teh. A fast and simple algorithm for training neural probabilistic language models. 2012: 419–426. [38] Richard Morgan, Roberto Garigliano, Paul Callaghan et al. University of Durham: description of the LOLITA system as used in MUC-6. In: Conference on Message Understanding, 1995: 71–85. [39] A?ron van den Oord, Sander Dieleman, Heiga Zen et al. WaveNet: A Generative Model for Raw Audio. CoRR, 2016, abs/1609.03499. http://arxiv.org/abs/1609.03499. [40] Jeffrey Pennington, Richard Socher and Christopher Manning. Glove: Global Vectors for Word Representation. In: Conference on Empirical Methods in Natural Language Processing, 2014: 1532– 1543. [41] Tran Quan, Andrew Mackinlay and Antonio Jimeno Yepes. Named Entity Recognition with stack residual LSTM and trainable bias decoding. 2017. [42] Lisa F Rau. Extracting company names from text. In: Artificial Intelligence Applications, 1991. Proceedings., Seventh IEEE Conference on, 1991: 29–32. [43] Lisa F Rau. Method for extracting company names from text. US, 1994. [44] Nils Reimers and Iryna Gurevych. Reporting Score Distributions Makes a Difference: Performance Study of LSTM-networks for Sequence Tagging. In: Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing (EMNLP). Copenhagen, Denmark, 2017-09: 338–348. http://aclweb.org/anthology/D17-1035. [45] Sara Sabour, Nicholas Frosst and Geoffrey E Hinton. Dynamic Routing Between Capsules. 2017. [46] Samuel L. Smith, Pieter-Jan Kindermans and Quoc V. Le. Don’t Decay the Learning Rate, Increase the Batch Size. CoRR, 2017, abs/1711.00489. http://arxiv.org/abs/1711.00489. [47] Rohini Srihari, Niu Cheng and Li Wei. A Hybrid Approach for Named Entity and Sub-Type Tagging. In: Applied Natural Language Processing Conference, 2000: 247–254. [48] Rupesh Kumar Srivastava, Klaus Greff and Jürgen Schmidhuber. Training very deep networks. Computer Science, 2015. [49] Emma Strubell, Patrick Verga, David Belanger et al. Fast and Accurate Sequence Labeling with Iterated Dilated Convolutions. CoRR, 2017, abs/1702.02098. http://arxiv.org/abs/1702. 02098. [50] Chen Sun, Abhinav Shrivastava, Saurabh Singh et al. Revisiting Unreasonable Effectiveness of Data in Deep Learning Era. CoRR, 2017, abs/1707.02968. http://arxiv.org/abs/1707.02968. [51] Ashish Vaswani, Noam Shazeer, Niki Parmar et al. Attention Is All You Need. 2017. [52] A Waibel, T Hanazawa, G Hinton et al. Phoneme recognition using time-delay neural networks. IEEE Press, 1990: 328–339. [53] Haochang Wang, Tiejun Zhao and Jianmiao Liu. Multi-Agent Classifiers Fusion Strategy for Biomedical Named Entity Recognition, 2008: 311–315. [54] Dekai Wu, Grace Ngai and Marine Carpuat. A Stacked, Voted, Stacked Model for Named Entity Recognition. In: Conference on Natural Language Learning at Hlt-Naacl, 2003: 200–203. [55] Zichao Yang, Diyi Yang, Chris Dyer et al. Hierarchical Attention Networks for Document Classification. In: Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2017: 1480–1489. [56] Fisher Yu and Vladlen Koltun. Multi-Scale Context Aggregation by Dilated Convolutions. CoRR, 2015, abs/1511.07122. http://arxiv.org/abs/1511.07122. [57] Suxiang Zhang, Juan Wen and Xiaojie Wang. Word Segmentation and Named Entity Recognition for SIGHAN Bakeoff3. 2006. [58] Xiang Zhang and Yann LeCun. Text Understanding from Scratch. CoRR, 2015, abs/1502.01710. http://arxiv.org/abs/1502.01710. [59] Xiang Zhang, Junbo Zhao and Yann Lecun. Character-level Convolutional Networks for Text Classification. 2015: 649–657. [60] Yimin Zhang and Joe F. Zhou. A trainable method for extracting Chinese entity names and their relations. In: The Workshop on Chinese Language Processing: Held in Conjunction with the Meeting of the Association for Computational Linguistics, 2000: 66–72. [61] Junsheng Zhou, Liang He, Xinyu Dai et al. Chinese Named Entity Recognition with a Multi-Phase Model. 2012. [62] ZHOU, Junsheng, Weiguang et al. Chinese Named Entity Recognition via Joint Identification and Categorization. Chinese Journal of Electronics, 2013. [63] 冯元勇, 孙乐, 张大鲲 et al. 基于小规模尾字特征的中文命名实体识别研究. 电子学报, 2008, 36(9): 1833–1838. [64] 黄德根, 马玉霞 and 杨元生. 基于互信息的中文姓名识别方法. 大连理工大学学报, 2004, 44(5): 744–748. [65] 季姮 and 罗振声. 基于反比概率模型和规则的中文姓名自动辨识系统. In: 全国计算语言学联 合学术会议, 2001. [66] 季姮 and 罗振声. 基于统计和规则的中文姓名自动辨识. 语言文字应用, 2001, (1): 14–18. [67] 孙茂松, 黄昌宁, 高海燕 et al. 中文姓名的自动辨识. 中文信息学报, 1995, 9(2): 16–27. [68] 孙茂松 and 邹嘉彦. 汉语自动分词研究评述. 当代语言学, 2001, 3(1): 22–32. [69] 向晓雯, 史晓东 and 曾华琳. 一个统计与规则相结合的中文命名实体识别系统. 计算机应用, 2005, 25(10): 2404–2406. [70] 张小衡 and 王玲玲. 中文机构名称的识别与分析. 中文信息学报, 1997, 11(4): 21–32. [71] 郑家恒, 李鑫 and 谭红叶. 基于语料库的中文姓名识别方法研究. 中文信息学报, 2000, 14(1): 7–12. [72] 周俊生, 戴新宇, 尹存燕 et al. 基于层叠条件随机场模型的中文机构名自动识别. 电子学报, 2006, 34(5): 804–809. |
馆藏号: | 017/M2018(372) |
公开日期: | 2021-05-26 |
题名: | 基于笔画的中文字向量模型设计与研究 |
姓名: | |
学号: | 1501211040 |
论文语种: | chi |
专业: | |
公开时间: | 公开 |
培养层次: | 硕士 |
学位: | |
培养单位: | 北京大学 |
院系: | |
导师1姓名: | |
导师1单位: | 软件与微电子学院 |
论文答辩日期: | 2018-05-26 |
外文题名: | Design and research of Chinese Word Embedding Model Based on Strokes |
关键词: | |
外文关键词: | |
论文摘要: | 数据表示是机器学习领域的基础问题。在机器学习任务中,第一步即输入样本数字化。不同于声音、图像、视频等数字信号,自然语言具有先天的高度结构化、抽象化的特点。因此自然语言任务的首要任务便是将语言文字数字化。 随着技术的发展,语言文字的表征方式不断进步。从最初始的one-hot到如今的分布式表示,词向量包含的信息愈加的丰富。现有的统计模型对于未登录词、低频词依然无法有效的表征。中文词向量研究受限于中文汉字特有的“象形”特征,尚没有一种有效利用笔画信息方法。 本文通过研究word2vec的CBOW框架,提出了一种基于笔画的汉字字向量模型,通过研究笔画组合构造汉字的规律,为中文未登录字、低频字等构造高质量的字向量。模型使用了以下方法:依靠当前汉字的上下文信息,将笔画向量化,学习笔画组合构造汉字的规律;引入注意力机制,丰富笔画构字的规律;采用CNN模型,捕捉汉字部件、合体字信息。与此同时,论文借鉴了生成对抗网络的思想,基于word2vec的Skip-gram模型,尝试以对抗的方式将笔画信息加入到字向量中。 测评工作是对比模型产生的字向量与word2vec、glove产生的字向量在中文分词、命名实体等任务上的准召率。其中在命名实体识别任务中,字向量F1值为81.6%,word2vec、glove分别为80.2%、81.2%。在分词任务中,分别为:96.23%,96.30%、96.31%。 分析表明,论文提出的模型可以有效的捕捉汉字笔画信息,并且有以下两点创新:使用CNN模型捕捉笔画构造汉字规律;引入Attention,计算笔画对汉字的贡献度。 |
外文摘要: | Data representation is a basic question in Machine Learning. The first step when I come up with a ML task is to digitize the sample data. Being different with the voice、image、video data, natural language is inherently highly structured and abstract. Therefore, the primary task of the natural language task is to digitize the language. As the development of technology, the representation skill of natural language improves a lot. From one-hot to the distribution representation, the information that word embedding contains is much richer. However, the existing statistical models cannot effectively represent unregistered words and low-frequency words. There isn’t an effective way to use strokes information to digitize the Chinese word, as for the limitation by pictographic" characteristics to Chinese. We propose a novel model that is Chinese word embedding model based on stroke combination, according to the CBOW. We aim to provide high quality words embedding for the unseen and low-frequency words through studying the rules of Chinese word. The Stroke2Vec model has following innovations: using context information to digitize strokes, learning the rules of Chinese word combinations, enriching the patterns of strokes by attention mechanism and convolutional neural networks. Then we test our models by comparing the results among our model、Word2Vec and GloVe on Named Entity Recognition、Chinese Word Segmentation、Part-Of-Speech tasks. In NER task, F1- scores are 81.6%, 80.2%, 81.2%. In CWS task, F1-scores are 96.23%, 96.30%, 96.31%. Meanwhile inspired by the GAN, we expand the Skip-gram model of word2vec that try to represent word vector by using strokes information during training. |
分类号: | TP3 |
论文总页数: | 52 |
参考文献总数: | 47 |
参考文献列表: |
[1] 石纯一, 黄昌宁, 王家廞. 人工智能原理[M]. 清华大学出版社, 1993.
[2] 常宝宝. 自然语言分析与生成术语简介[J]. 产品安全与召回, 2010(4):19-22. [3] 张钹. 自然语言处理的计算模型[J]. 中文信息学报, 2007, 21(3):3-7. [4] Goodstein R L, Harris Z. Mathematical Structures of Language[J]. Mathematical Gazette, 1970, 54(388):173. [5] Bengio Y, Vincent P, Janvin C. A neural probabilistic language model[J]. Journal of Machine Learning Research, 2003, 3(6):1137-1155. [6] Mnih A, Hinton G. A scalable hierarchical distributed language model[C]// International Conference on Neural Information Processing Systems. Curran Associates Inc. 2008:1081-1088. [7] Mikolov T, Karafiát M, Burget L, et al. Recurrent neural network based language model[C]// INTERSPEECH 2010, Conference of the International Speech Communication Association, Makuhari, Chiba, Japan, September. DBLP, 2010:1045-1048. [8] Mikolov T, Chen K, Corrado G, et al. Efficient Estimation of Word Representations in Vector Space[J]. Computer Science, 2013. [9] Levy O, Goldberg Y. Neural word embedding as implicit matrix factorization[J]. Advances in Neural Information Processing Systems, 2014, 3:2177-2185. [10] Goldberg Y, Levy O. word2vec Explained: deriving Mikolov et al.'s negative-sampling word-embedding method[J]. Eprint Arxiv, 2014. [11] Ji S, Yun H, Yanardag P, et al. WordRank: Learning Word Embeddings via Robust Ranking[J]. Computer Science, 2015. [12] CAO, S.; LU, W.. Improving Word Embeddings with Convolutional Feature Learning and Subword Information. AAAI Conference on Artificial Intelligence, North America, feb. 2017. [13] Mikolov T, Chen K, Corrado G, et al. Efficient Estimation of Word Representations in Vector Space[J]. Computer Science, 2013. [14] Bojanowski P, Grave E, Joulin A, et al. Enriching Word Vectors with Subword Information[J]. 2016. [15] Mikolov T A. Statistical Language Models Based on Neural Networks[J]. 2012. [16] Pinter Y, Guthrie R, Eisenstein J. Mimicking Word Embeddings using Subword RNNs[J]. 2017. [17] Pennington J, Socher R, Manning C. Glove: Global Vectors for Word Representation[C]// Conference on Empirical Methods in Natural Language Processing. 2014:1532-1543. [18] Chen X, Xu L, Liu Z, et al. Joint learning of character and word embeddings[C]// International Conference on Artificial Intelligence. AAAI Press, 2015:1236-1242. [19] Lecun Y. LeNet-5, convolutional neural networks[J]. [20] Lecun Y, Bottou L, Bengio Y, et al. Gradient-based learning applied to document recognition[J]. Proceedings of the IEEE, 1998, 86(11):2278-2324. [21] Bahdanau D, Cho K, Bengio Y. Neural Machine Translation by Jointly Learning to Align and Translate[J]. Computer Science, 2014. [22] 许慎. 说文解字校订本[M]. 凤凰出版社, 2004. [23] Luong M T, Pham H, Manning C D. Effective Approaches to Attention-based Neural Machine Translation[J]. Computer Science, 2015. [24] Lin, Z., Feng, M., Santos, C. N. dos, Yu, M., Xiang, B., Zhou, B., & Bengio, Y. (2017). A Structured Self-Attentive Sentence Embedding. In ICLR 2017. [25] Parikh, A. P., T?ckstr?m, O., Das, D., & Uszkoreit, J. (2016). A Decomposable Attention Model for Natural Language Inference. In Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing [26] Cheng, J., Dong, L., & Lapata, M. (2016). Long Short-Term Memory-Networks for Machine Reading. arXiv Preprint arXiv:1601.06733. [27] Paulus, R., Xiong, C., & Socher, R. (2017). A Deep Reinforced Model for Abstractive Summarization. [28] Daniluk, M., Rockt, T., Welbl, J., & Riedel, S. (2017). Frustratingly Short Attention Spans in Neural Language Modeling. In ICLR 2017. [29] Liu, Y., & Lapata, M. (2017). Learning Structured Text Representations. In arXiv preprint arXiv:1705.09207. [30] 梁南元. 书面汉语的自动分词与一个自动分词系统—CDWS[J]. 北京航空航天大学学报, 1984(4):101-108. [31] 张华平, 刘群. 基于N-最短路径方法的中文词语粗分模型[J]. 中文信息学报, 2002, 16(5):1-7. [32] Meishan Zhang, Yue Zhang, and Guohong Fu. Transition-based neural word segmentation. In Proceedings of the Annual Meeting of the Association for Computational Linguistics, 2016, pp. 421–431. [33] Xue N, Shen L. Chinese Word Segmentation as LMR Tagging[J]. Proc of Sighan Workshop, 2003:176--179. [34] Huang Z, Xu W, Yu K. Bidirectional LSTM-CRF models for sequence tagging[J]. arXiv preprint arXiv:1508.01991, 2015. [35] 李航. 统计学习方法[M]. 清华大学出版社, 2012. [36] Hochreiter S, Schmidhuber J. Long short-term memory[J]. Neural computation, 1997, 9(8): 1735-1780. [37] Cho K, Van Merri?nboer B, Gulcehre C, et al. Learning phrase representations using RNN encoder-decoder for statistical machine translation[J]. arXiv preprint arXiv:1406.1078, 2014. [38] Gers F A, Schmidhuber J. Recurrent nets that time and count[C]//Neural Networks, 2000. IJCNN 2000, Proceedings of the IEEE-INNS-ENNS International Joint Conference on. IEEE, 2000, 3: 189-194. [39] Yao K, Cohn T, Vylomova K, et al. Depth-gated recurrent neural networks[J]. arXiv preprint, 2015. [40] 黄昌宁, 赵海. 中文分词十年回顾[J]. 中文信息学报, 2007, 21(3):8-19. [41] Gehring J, Auli M, Grangier D, et al. Convolutional sequence to sequence learning[J]. arXiv preprint arXiv:1705.03122, 2017. [42] Kalchbrenner N, Grefenstette E, Blunsom P. A convolutional neural network for modelling sentences[J]. arXiv preprint arXiv:1404.2188, 2014. [43] Kim Y. Convolutional neural networks for sentence classification[J]. arXiv preprint arXiv:1408.5882, 2014. [44] Hu B, Lu Z, Li H, et al. Convolutional neural network architectures for matching natural language sentences[C]//Advances in neural information processing systems. 2014: 2042-2050. [45] Goodfellow I, Pouget-Abadie J, Mirza M, et al. Generative adversarial nets[C]//Advances in neural information processing systems. 2014: 2672-2680. [46] Goodfellow I. NIPS 2016 tutorial: Generative adversarial networks[J]. arXiv preprint arXiv:1701.00160, 2016. [47] Cao S, Lu W, Zhou J, et al. cw2vec: Learning Chinese Word Embeddings with Stroke n-gram Information[J]. 2018. |
馆藏号: | 017/M2018(401) |
公开日期: | 2018-05-26 |
题名: | 英语智能写作个性化辅助系统的设计与实现 |
姓名: | |
学号: | 1501210804 |
专业: | |
培养层次: | 硕士 |
培养单位: | 北京大学 |
导师1姓名: | |
导师1单位: | 软件与微电子学院 |
论文答辩日期: | 2018-05-26 |
外文关键词: | Vocabulary Network Sentence Recommendation Article Recommendation English Writing Level Computer-assisted English Writing |
论文摘要: | 在生活工作的交流沟通和英语学习中,英语写作起的作用越来越重要。一方面只有丰富、准确的描述文章内容才能有效的传递思想和信息;另一方面对于母语为非英语的英语学习者来说,写作也可以提高英语水平,大量写作这也是“写长法”英语教学理论的基本要求。但是写作对于英语学习者来说却是一件很难的事情,针对写作困难的问题,出现了很多辅助写作系统。区别于这些系统,本系统是基于学生个人学习状况和写作水平,从词、句子、篇章多个维度进行帮助写作的个性化辅助系统。
外文摘要: | English writing plays an increasingly important role in daily life, especially in work communication and English learning. On the one hand, it is necessary to enrich and accurately describe the contents of the article to convey ideas and information, on the other hand, writing is the most important strategy in improving English for non-native English speakers. And a lot of writing is an essential basic requirement of "Length Approach" that is an English teaching theory. However, writing is a very difficult thing for English learners, there are many auxiliary writing software and systems to solve the problem for writing difficulties. Different from these systems and software, this system is based on every individual learning status and writing level of the students. It is a personalized Writing Assistant System that helps students to write from word level to sentence level and topic level.
分类号: | TP3 |
论文总页数: | 73 |
参考文献总数: | 45 |
参考文献列表: |
[1] 王初明.论外语“写长法”的教学理念[A].北京:中央编译出版社, 2002.
[2] 袁秀凤.近十年英语“写长法”教学模式研究综述[J] .宁德师范学院学报(哲学社会科学版) , 2013 (3) :108-111. [3] 占飞. 计算语言学领域英文辅助写作系统[D]. 哈尔滨工业大学, 2011. [4] Chen M H, Huang S T, Hsieh H T, et al. FLOW: A First-Language-Oriented Writing Assistant System[C]//Proceedings of the ACL 2012 System Demonstrations. Association for Computational Linguistics, 2012: 157-162. [5] 孔行. 基于主题推荐的辅助写作系统[D]. 哈尔滨工业大学, 2015. [6] 吴伟成,周俊生,曲维光. 基于统计学习模型的句法分析方法综述[J]. 中文信息学报, 2013 , 27(3): 9?19. [7] Quattoni A, Wang S, Morency L, et al. Hidden conditional random fields[J]. IEEE Trans. PAMI 29(10),1848–1852 (2007). [8] Page L, Brin S, Motwani R, et al. The PageRank Citation Ranking: Bringing Order to the Web[R]. Technical report, Stanford Digital Library Technologies Project,1998. [9] Mihalcea R, Tarau P. TextRank: bringing order into texts[C]// Proc Conference on Empirical Methods in Natural Language Processing,2004:404-411. [10] 刘知远. 基于文档主题结构的关键词抽取方法研究[R].清华大学, 2011. [11] Bengio Y, Ducharme R, Vincent P, et al. A neural probabilistic language model[J]. Journal of Machine Learning Research,3:1137-1155,2003. [12] Mikolov T, Sutskever I,Chen K, et al. Distributed Representations of Words and Phrases and their Compositionality[C]//International Conference on Neural Information Processing Systems,2013:3111-3119. [13] Mikolov T, Chen K, Corrado G, et al. Efficient Estimation of Word Representations in Vector Space[J]. Computer Science,2013. [14] Morin F, Bengio Y. Hierarchical Probabilistic Neural Network Language Model[J]. Aistats, 2005. [15] Pennington J, Socher R, Manning C. GloVe: Global Vectors for Word Representation[C]// Empirical Methods in Natural Language Processing (EMNLP), 2014 :1532-1543. [16] Le Q, Mikolov T. Distributed Representations of Sentences and Documents[C]// International Conference on International Conference on Machine Learning,2014: II-1188-II-1196. [17] Tsoi A C, Tan S. Recurrent neural networks: A constructive algorithm, and its properties[J]. Neurocomputing.1997,15 (3–4) :309-326. [18] Hochreiter S, Schmidhuber J. Long Short-Term Memory[J]. Neural Computation,1997,9 (8) :1735-1780. [19] Dey R, Salem F M. Gate-Variants of Gated Recurrent Unit (GRU) Neural Networks[C]//IEEE International Midwest Symposium on Circuits & Systems,2017 :1597-1600. [20] Lipton Z C, Berkowitz J, Elkan C. A Critical Review of Recurrent Neural Networks for Sequence Learning[J]. Computer Science,2015. [21] Mikolov T, Kombrink S, Burget L, et al. Extensions of recurrent neural network language model[C]//IEEE International Conference on Acoustics,2011, 125 (3) :5528-5531. [22] Neculoiu P, Versteegh M, Rotaru M. Learning Text Similarity with Siamese Recurrent Networks[C]//Repl4nlp Workshop at Acl,2016. [23] Mueller J, Thyagarajan A. Siamese Recurrent Architectures for Learning Sentence Similarity[C]//Thirtieth Aaai Conference on Artificial Intelligence,2016 :2786-2792. [24] Sutskever I, Vinyals O, Le Q. Sequence to Sequence Learning with Neural Networks[C]//Neural Information Processing Systems,2014. [25] Chung J, Gulcehre C, Cho K H, et al. Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling[J]. Eprint Arxiv,2014. [26] Lowe R, Pow N, Serban I, et al. The Ubuntu Dialogue Corpus: A Large Dataset for Research in Unstructured Multi-Turn Dialogue Systems[J]. Computer Science,2015. [27] Rush AM, Chopra S,Weston J. A Neural Attention Model for Abstractive Sentence Summarization[J]. Computer Science,2015. [28] Blei DM, Ng AY, Jordan MI. Latent dirichlet allocation[J]. J Machine Learning Research Archive,2003,3 :993-1022. [29] 张龙凯,王厚峰.文本摘要中的句子抽取方法研究[J].中国计算语言学研究前沿进展,2011. [30] Erkan, Radev, Dragomir R. LexRank: graph-based lexical centrality as salience in text summarization[J]. Journal of Qiqihar Junior Teachers College,2012,22:2004. [31] Smith M, Turner J, Sanford-Moore E, et al. The Lexile Framework for Reading: An Introduction to What It Is and How to Use It[J]. Springer Singapore,2016. [32] Chen T, Guestrin C. XGBoost: A Scalable Tree Boosting System[C]//Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2016:785-794. [33] Bojanowski P, Grave E, Joulin A, et al. Enriching Word Vectors with Subword Information [J]. arXiv preprint arXiv:1607.04606, 2016. [34] Joulin A, Grave E, Bojanowski P, et al. Bag of Tricks for Efficient Text Classification[J]. arXiv preprint arXiv:1607.01759,2016. [35] Schuster M, Paliwal KK. Bidirectional recurrent neural networks[J]. IEEE Transactions on Signal Processing,2002,45(11):2673-2681. [36] Kim Y. Convolutional Neural Networks for Sentence Classification[J]. Eprint Arxiv. 2014. [37] Ketkar N. Convolutional Neural Networks[J]. Apress,2017. [38] WiKi. WordNet. https://en.wikipedia.org/wiki/WordNet. [39] XOxford University. British National Corpus[DB]. https://corpus.byu.edu/bnc/. [40] Hilary N, Sheena G, Paul T, et al. British Academic Written English Corpus[DB]. https://www.coventry.ac.uk/research/research-directories/current-projects/2015/british-academic-written-english-corpus-bawe/. [41] Manning C D, Surdeanu M, Bauer J, et al. The Stanford CoreNLP Natural Language Processing Toolkit[C]//Proceedings of 52nd Annual Meeting of the Association for Computational Linguistics: System Demonstrations, At Baltimore, Maryland,2014. [42] Corpus of Contemporary American English (COCA) [DB]. https://corpus.byu.edu/coca/. [43] Jurafsky D, Martin J H. Speech and Language Processing[G]. http://web.stanford.edu/~jurafsky/slp3/,2018. [44] Christopher M. Bishop. Pattern Recognition and Machine Learning [M]. Springer,2007. [45] Goodfellow I, Bengio Y, Courville A. Deep Learning [M]. The MIT Press,2016. |
馆藏号: | 017/M2018(402) |
公开日期: | 2018-05-26 |
题名: | 基于深度学习的英文手写识别的设计与实现 |
作者: | |
语种: | chi |
公开时间: | 3年后 |
学位: | |
院系: | |
导师单位: | 软件与微电子学院 |
答辩日期: | 2018-05-26 |
题目(外文): | Design and Implementation of English Handwritten Recognition Based on Deep Learning |
文摘: | 文字是人类进入文明社会的重要标志之一,推动着人类社会的进步和发展。在科技发达的今天,将这些纸上的古老符号转化成现代计算机中能够识别、存储和检索的内容有着重要意义。近些年来,随着深度学习技术的飞速发展,使用计算机对单个英文字符的识别已经达到了极高的准确率。但是,由于个人书写风格的差异、字符之间笔画的粘连等问题,对整个手写英文字符串进行识别仍是一个很有挑战性的问题。 |
文摘(外文): | Text is one of the important signs that human beings enter the civilized society. It promotes the progress and development of human society. Nowadays, with the development of science and technology, it is of great significance to transform the ancient symbols on these papers into the contents that can be identified, stored and retrieved in modern computers. In recent years, with the rapid development of deep learning, the recognition of single English character by computers has reached a high accuracy rate. However, it is still a challenging problem to recognize the whole handwritten English string due to the differences of personal writing styles and the adhesion of strokes between characters.
分类号: | TP3 |
论文总页数: | 69 |
参考文献数: | 54 |
参考文献: |
[1] 刘排排. 空中手写字符串识别算法研究[硕士学位论文]. 北京交通大学, 2015.
[2] 武裕朴, 赵景台. 印刷体汉字识别方法综述[J]. 机器人, 1981, 3(5):6-12. [3] Jain |