Content-based Map of Science using Cross-lingual Document Embedding - A Comparison of US-Japan Funded Projects

Leiden Repository

Content-based Map of Science using Cross-lingual Document Embedding - A Comparison of US-Japan Funded Projects

Type: Article in monograph or in proceedings
Title: Content-based Map of Science using Cross-lingual Document Embedding - A Comparison of US-Japan Funded Projects
Author: Kawamura T.Watanabe K.Egami S.Matsumoto N.Jibu M.
Journal Title: STI 2018 Conference Proceedings
Start Page: 385
End Page: 394
Publisher: Centre for Science and Technology Studies (CWTS)
Issue Date: 2018-09-11
Keywords: Scientometrics
Abstract: Maps depicting the structure of science help us understand the development of science and technology. However, as it is difficult to apply inter-citation and co-citation analysis to recently published papers and ongoing projects that have few or no references, our previous work developed a content-based map by locating research papers and funding projects using word/document embedding. Because difficulties arise when comparing the content-based map in different languages, this paper improves our content-based map by developing a method for generating multi-dimensional vectors in the same space from cross-lingual (English and Japanese) documents. Using 1,000 IEEE papers, we confirmed a similarity of 0.76 for matching bilingual contents. Finally, we constructed a map from 34,000 projects of the National Science Foundation and Japan Society for the Promotion of Science from 2012 to 2015, and we indicate the findings obtained from a comparison of the US-Japan funded projects.
Handle: http://hdl.handle.net/1887/65248
 

Files in this item

Description Size View
application/pdf STI2018_paper_72.pdf 1.314Mb View/Open

This item appears in the following Collection(s)