Processing rhetorical, morphosyntactic, and semantic features from corporate technical documents for identifying organizational domain knowledge

During the requirements elicitation (RE) process, transformations among languages occur from natural language-in which the stakeholders express their domain and needs-to a controlled language. One source of domain information to be used by the transformation process is related to technical documents...

Full description

Autores:

Tipo de recurso:

Fecha de publicación:: 2013

Institución:: Universidad de Medellín

Repositorio:: Repositorio UDEM

Idioma:: eng

id	REPOUDEM2_42df65daea8b7d5eb2be55b679298507
oai_identifier_str	oai:repository.udem.edu.co:11407/2319
network_acronym_str	REPOUDEM2
network_name_str	Repositorio UDEM
repository_id_str
spelling	2016-06-23T21:52:09Z2016-06-23T21:52:09Z201323259000http://hdl.handle.net/11407/2319During the requirements elicitation (RE) process, transformations among languages occur from natural language-in which the stakeholders express their domain and needs-to a controlled language. One source of domain information to be used by the transformation process is related to technical documents belonging to the organizations (e.g. technical reports, legacy documents, and procedure manuals). Some properties of such documents are: different representation formats, high degree of ambiguity, and particular linguistics elements. The analysis and processing of such documents becomes complex because of these properties and, in turn, the complexity makes difficult both identifying the domain knowledge and understanding the associated processes. As a solution for an automated transformation, in this paper we define linguistics features to enable identification and composition of information units from a procedure manual, as a central task for the language transformation process in RE. The identified features are classified into rhetorical, morphosyntactic, and semantic features. Also, the features can be used to identify and represent organizational domain knowledge. Copyright © 2013 by Knowledge Systems Institute Graduate School.engKnowledge Systems Institute Graduate Schoolhttps://www.scopus.com/record/display.uri?eid=2-s2.0-84937679594&origin=inward&txGid=0Proceedings of the International Conference on Software Engineering and Knowledge Engineering, SEKE Volume 2013-January, Issue January, 2013, Pages 268-272ScopusProcessing rhetorical, morphosyntactic, and semantic features from corporate technical documents for identifying organizational domain knowledgeConference Paperinfo:eu-repo/semantics/conferenceObjecthttp://purl.org/coar/resource_type/c_c94finfo:eu-repo/semantics/restrictedAccesshttp://purl.org/coar/access_right/c_16ecFaculty of Engineering, Universidad de Medellín, Medellín, ColombiaFaculty of Mines, Universidad Nacional de Colombia, Medellín, ColombiaLosada B.M.Jaramillo C.M.Z.Domain knowledgeNatural languageRequirements elicitationTechnical documentsTexts processing11407/2319oai:repository.udem.edu.co:11407/23192020-05-27 17:34:32.316Repositorio Institucional Universidad de Medellinrepositorio@udem.edu.co
dc.title.spa.fl_str_mv	Processing rhetorical, morphosyntactic, and semantic features from corporate technical documents for identifying organizational domain knowledge
title	Processing rhetorical, morphosyntactic, and semantic features from corporate technical documents for identifying organizational domain knowledge
spellingShingle	Processing rhetorical, morphosyntactic, and semantic features from corporate technical documents for identifying organizational domain knowledge Domain knowledge Natural language Requirements elicitation Technical documents Texts processing
title_short	Processing rhetorical, morphosyntactic, and semantic features from corporate technical documents for identifying organizational domain knowledge
title_full	Processing rhetorical, morphosyntactic, and semantic features from corporate technical documents for identifying organizational domain knowledge
title_fullStr	Processing rhetorical, morphosyntactic, and semantic features from corporate technical documents for identifying organizational domain knowledge
title_full_unstemmed	Processing rhetorical, morphosyntactic, and semantic features from corporate technical documents for identifying organizational domain knowledge
title_sort	Processing rhetorical, morphosyntactic, and semantic features from corporate technical documents for identifying organizational domain knowledge
dc.contributor.affiliation.spa.fl_str_mv	Faculty of Engineering, Universidad de Medellín, Medellín, Colombia Faculty of Mines, Universidad Nacional de Colombia, Medellín, Colombia
dc.subject.keyword.eng.fl_str_mv	Domain knowledge Natural language Requirements elicitation Technical documents Texts processing
topic	Domain knowledge Natural language Requirements elicitation Technical documents Texts processing
description	During the requirements elicitation (RE) process, transformations among languages occur from natural language-in which the stakeholders express their domain and needs-to a controlled language. One source of domain information to be used by the transformation process is related to technical documents belonging to the organizations (e.g. technical reports, legacy documents, and procedure manuals). Some properties of such documents are: different representation formats, high degree of ambiguity, and particular linguistics elements. The analysis and processing of such documents becomes complex because of these properties and, in turn, the complexity makes difficult both identifying the domain knowledge and understanding the associated processes. As a solution for an automated transformation, in this paper we define linguistics features to enable identification and composition of information units from a procedure manual, as a central task for the language transformation process in RE. The identified features are classified into rhetorical, morphosyntactic, and semantic features. Also, the features can be used to identify and represent organizational domain knowledge. Copyright © 2013 by Knowledge Systems Institute Graduate School.
publishDate	2013
dc.date.created.none.fl_str_mv	2013
dc.date.accessioned.none.fl_str_mv	2016-06-23T21:52:09Z
dc.date.available.none.fl_str_mv	2016-06-23T21:52:09Z
dc.type.eng.fl_str_mv	Conference Paper
dc.type.coar.fl_str_mv	http://purl.org/coar/resource_type/c_c94f
dc.type.driver.none.fl_str_mv	info:eu-repo/semantics/conferenceObject
dc.identifier.issn.none.fl_str_mv	23259000
dc.identifier.uri.none.fl_str_mv	http://hdl.handle.net/11407/2319
identifier_str_mv	23259000
url	http://hdl.handle.net/11407/2319
dc.language.iso.none.fl_str_mv	eng
language	eng
dc.relation.isversionof.spa.fl_str_mv	https://www.scopus.com/record/display.uri?eid=2-s2.0-84937679594&origin=inward&txGid=0
dc.relation.ispartofen.eng.fl_str_mv	Proceedings of the International Conference on Software Engineering and Knowledge Engineering, SEKE Volume 2013-January, Issue January, 2013, Pages 268-272
dc.rights.coar.fl_str_mv	http://purl.org/coar/access_right/c_16ec
dc.rights.accessrights.none.fl_str_mv	info:eu-repo/semantics/restrictedAccess
eu_rights_str_mv	restrictedAccess
rights_invalid_str_mv	http://purl.org/coar/access_right/c_16ec
dc.publisher.spa.fl_str_mv	Knowledge Systems Institute Graduate School
dc.source.spa.fl_str_mv	Scopus
institution	Universidad de Medellín
repository.name.fl_str_mv	Repositorio Institucional Universidad de Medellin
repository.mail.fl_str_mv	repositorio@udem.edu.co
_version_	1851059156657635328

Processing rhetorical, morphosyntactic, and semantic features from corporate technical documents for identifying organizational domain knowledge

Publicaciones similares