
http://chineseinput.net/에서 pinyin(병음)방식으로 중국어를 변환할 수 있습니다.
변환된 중국어를 복사하여 사용하시면 됩니다.

대용량 자원 기반 과학기술 핵심개체 탐지를 위한 정보추출기술 통합에 관한 연구
최윤수,정창후,최성필,류범종,김재훈,Choi, Yun-Soo,Cheong, Chang-Hoo,Choi, Sung-Pil,You, Beom-Jong,Kim, Jae-Hoon 한국과학기술정보연구원 과학기술정보센터 2009 Journal of Information Science Theory and Practice Vol.40 No.4
Large-scaled information extraction plays an important role in advanced information retrieval as well as question answering and summarization. Information extraction can be defined as a process of converting unstructured documents into formalized, tabular information, which consists of named-entity recognition, terminology extraction, coreference resolution and relation extraction. Since all the elementary technologies have been studied independently so far, it is not trivial to integrate all the necessary processes of information extraction due to the diversity of their input/output formation approaches and operating environments. As a result, it is difficult to handle scientific documents to extract both named-entities and technical terms at once. In this study, we define scientific as a set of 10 types of named entities and technical terminologies in a biomedical domain. in order to automatically extract these entities from scientific documents at once, we develop a framework for scientific core entity extraction which embraces all the pivotal language processors, named-entity recognizer, co-reference resolver and terminology extractor. Each module of the integrated system has been evaluated with various corpus as well as KEEC 2009. The system will be utilized for various information service areas such as information retrieval, question-answering(Q&A), document indexing, dictionary construction, and so on.
최윤수,김재명,윤창범,Choi, Yun-Soo,Kim, Jae-Myeong,Yun, Chang-Beom 대한공간정보학회 2010 한국공간정보학회지 Vol.18 No.1
Recently, many countries are promoting the rapid development of marine for securing territorial sea. The importance of territorial sea has being emphasized as territorial disputes among countries has been increasing. The South Korea should be encouraged to expand the territory due to territorial disputes with neighborhood countries. The purpose of this study is to derive a improvement plan for the efficient control of the territorial base point through analyzing the existing territorial base point and checking the territorial base point. Therefore, we proposed a variety of new plans for accurate positioning by sea level observation and MBES, reduced the surveying procedure through analyzing the existing territorial base points and provided a basis for the territorial base points which can be the 2nd grade national control points by improving a grade. We also suggested that the territorial base points and sub-territorial base points' database should be given standardized number for increasing the efficient management of other national control points and territorial base points. Finally, we suggested a improved work regulation about analysis and maintenance for territorial base points, the information activity of territorial base points and the new plan of community relations. This study will be a basis for the foundation of maritime territory which could be superior to other countries and the expansion of maritime territory.
최윤수,김도현,Choi, Yun-Soo,Kim, Do-Hyeon 한국벤처창업학회 2011 벤처창업연구 Vol.6 No.2
While the role of universities in the modern society has long been a key agenda of discussion, Korean universities have recently tried to get good results in league tables, mostly published by major media. Although the criteria of evaluation are developed through benchmarking and thoughtful discussion, they are still in the process of development and fragile to the question whether they really suggest the future direction of universities. The study seeks to find whether the number of UIC(university industry co-authored articles), as suggested by a few scholars, is meaningful and feasible as an alternative or complement to current performance indices of universities. The adoption of UIC implicitly means that a key role of university should be developing and diffusing knowledge to the society by working with industry participants. We found limited evidence that the index of UIC has meaningful discrepancy with current indices for research, which opens up a new discussion.
최윤수,김재명,김현수,박병문,Choi, Yun-Soo,Kim, Jae-Myeong,Kim, Hyun-Soo,Park, Byung-Moon 대한공간정보학회 2012 한국공간정보학회지 Vol.20 No.2
최근 국내 여러 지방자치단체들은 해양개발이 활성화됨에 따라 관할해역 확보에 총력을 기울이고 있으며, 인접 지방자치단체간 분쟁의 증가로 인하여 관할해역 구분의 기준이 되는 해상경계의 중요성이 급격히 부각되고 있다. 특히, 해상경계 획정기준의 부재는 인접 지방자치단체들 간 분쟁을 발생시킬 수 있는 요소를 많이 내포함에 따라서 해상경계 획정기준 마련을 위한 노력이 절실히 필요하다고 할 수 있다. 본 논문에서는 과학적이고 합리적인 해상경계 획정기준을 마련하기 위하여 기 고시된 "해상경계 확인을 위한 수로측량업무규정"을 조사 분석하여 개선사항을 도출하였다. 그 결과 첫째, 해상경계 획정의 개념을 수립하였고 둘째, 해상경계 획정기준의 범위와 내용 및 절차를 구체적으로 설정하였으며, 마지막으로 지방자치단체들 간 해상경계 분쟁의 사전예방 및 사후 합법적 해결을 목적으로 해상경계조정위원회의 설치, 구성, 직무, 조정결과의 효력 등과 같은 해상경계 분쟁조정방법을 제시하였다.
과학 기술 문헌 분석을 위한 기계학습 기반 범용 전문용어 인식 시스템
최윤수,송사광,전홍우,정창후,최성필,Choi, Yun-Soo,Song, Sa-Kwang,Chun, Hong-Woo,Jeong, Chang-Hoo,Choi, Sung-Pil 한국정보처리학회 2011 정보처리학회논문지D Vol.18 No.5
문헌에서의 전문용어 인식 연구는 정보검색, 정보추출, 시맨틱 웹, 질의응답 분야 등의 연구를 위한 선행 연구로서, 지금까지 대부분 특정 분야, 특히 생의학 분야에서 집중되어 연구되어 왔다. 그러나 기존 연구들이 특정 도메인 또는 문헌 내부 통계 정보를 활용함으로써 범용적인 전문용어 인식에 한계점을 보여 왔기 때문에, 본 연구에서는 웹 검색 결과와 사전, 후보용어의 문형 특징 등을 활용하는 기계 학습 기반 범용 전문용어 인식 방법을 제안하였다. 제안한 방법을 문헌의 지역 통계 정보를 사용하는 방법(C-value)과 비교 실험하여 80.8%의 F-값으로 6.5%의 성능향상을 보였다. 다양한 응집도 자질들을 접목한 두 번째 실험에서는 Normalized Google Distance 방법과 접목한 방식이 F-값 81.8%의 성능으로 최고의 성능을 나타냈다. 기계 학습 방법으로는 로지스틱 회귀분석, C4.5, SVMs 등을 적용하였는데, 일반적으로 이진 분류에 좋은 성능을 보이는 SVMs과 로지스틱 회귀분석 방법보다 결정 트리 방식의 C4.5가 전반적으로 좋은 성능을 보였다. Terminology recognition system which is a preceding research for text mining, information extraction, information retrieval, semantic web, and question-answering has been intensively studied in limited range of domains, especially in bio-medical domain. We propose a domain independent terminology recognition system based on machine learning method using dictionary, syntactic features, and Web search results, since the previous works revealed limitation on applying their approaches to general domain because their resources were domain specific. We achieved F-score 80.8 and 6.5% improvement after comparing the proposed approach with the related approach, C-value, which has been widely used and is based on local domain frequencies. In the second experiment with various combinations of unithood features, the method combined with NGD(Normalized Google Distance) showed the best performance of 81.8 on F-score. We applied three machine learning methods such as Logistic regression, C4.5, and SVMs, and got the best score from the decision tree method, C4.5.



