Processing rhetorical, morphosyntactic, and semantic features from corporate technical documents for identifying organizational domain knowledge

During the requirements elicitation (RE) process, transformations among languages occur from natural language-in which the stakeholders express their domain and needs-to a controlled language. One source of domain information to be used by the transformation process is related to technical documents...

Full description

Autores:
Tipo de recurso:
Fecha de publicación:
2013
Institución:
Universidad de Medellín
Repositorio:
Repositorio UDEM
Idioma:
eng
OAI Identifier:
oai:repository.udem.edu.co:11407/2319
Acceso en línea:
http://hdl.handle.net/11407/2319
Palabra clave:
Domain knowledge
Natural language
Requirements elicitation
Technical documents
Texts processing
Rights
restrictedAccess
License
http://purl.org/coar/access_right/c_16ec
id REPOUDEM2_42df65daea8b7d5eb2be55b679298507
oai_identifier_str oai:repository.udem.edu.co:11407/2319
network_acronym_str REPOUDEM2
network_name_str Repositorio UDEM
repository_id_str
spelling 2016-06-23T21:52:09Z2016-06-23T21:52:09Z201323259000http://hdl.handle.net/11407/2319During the requirements elicitation (RE) process, transformations among languages occur from natural language-in which the stakeholders express their domain and needs-to a controlled language. One source of domain information to be used by the transformation process is related to technical documents belonging to the organizations (e.g. technical reports, legacy documents, and procedure manuals). Some properties of such documents are: different representation formats, high degree of ambiguity, and particular linguistics elements. The analysis and processing of such documents becomes complex because of these properties and, in turn, the complexity makes difficult both identifying the domain knowledge and understanding the associated processes. As a solution for an automated transformation, in this paper we define linguistics features to enable identification and composition of information units from a procedure manual, as a central task for the language transformation process in RE. The identified features are classified into rhetorical, morphosyntactic, and semantic features. Also, the features can be used to identify and represent organizational domain knowledge. Copyright © 2013 by Knowledge Systems Institute Graduate School.engKnowledge Systems Institute Graduate Schoolhttps://www.scopus.com/record/display.uri?eid=2-s2.0-84937679594&origin=inward&txGid=0Proceedings of the International Conference on Software Engineering and Knowledge Engineering, SEKE Volume 2013-January, Issue January, 2013, Pages 268-272ScopusProcessing rhetorical, morphosyntactic, and semantic features from corporate technical documents for identifying organizational domain knowledgeConference Paperinfo:eu-repo/semantics/conferenceObjecthttp://purl.org/coar/resource_type/c_c94finfo:eu-repo/semantics/restrictedAccesshttp://purl.org/coar/access_right/c_16ecFaculty of Engineering, Universidad de Medellín, Medellín, ColombiaFaculty of Mines, Universidad Nacional de Colombia, Medellín, ColombiaLosada B.M.Jaramillo C.M.Z.Domain knowledgeNatural languageRequirements elicitationTechnical documentsTexts processing11407/2319oai:repository.udem.edu.co:11407/23192020-05-27 17:34:32.316Repositorio Institucional Universidad de Medellinrepositorio@udem.edu.co
dc.title.spa.fl_str_mv Processing rhetorical, morphosyntactic, and semantic features from corporate technical documents for identifying organizational domain knowledge
title Processing rhetorical, morphosyntactic, and semantic features from corporate technical documents for identifying organizational domain knowledge
spellingShingle Processing rhetorical, morphosyntactic, and semantic features from corporate technical documents for identifying organizational domain knowledge
Domain knowledge
Natural language
Requirements elicitation
Technical documents
Texts processing
title_short Processing rhetorical, morphosyntactic, and semantic features from corporate technical documents for identifying organizational domain knowledge
title_full Processing rhetorical, morphosyntactic, and semantic features from corporate technical documents for identifying organizational domain knowledge
title_fullStr Processing rhetorical, morphosyntactic, and semantic features from corporate technical documents for identifying organizational domain knowledge
title_full_unstemmed Processing rhetorical, morphosyntactic, and semantic features from corporate technical documents for identifying organizational domain knowledge
title_sort Processing rhetorical, morphosyntactic, and semantic features from corporate technical documents for identifying organizational domain knowledge
dc.contributor.affiliation.spa.fl_str_mv Faculty of Engineering, Universidad de Medellín, Medellín, Colombia
Faculty of Mines, Universidad Nacional de Colombia, Medellín, Colombia
dc.subject.keyword.eng.fl_str_mv Domain knowledge
Natural language
Requirements elicitation
Technical documents
Texts processing
topic Domain knowledge
Natural language
Requirements elicitation
Technical documents
Texts processing
description During the requirements elicitation (RE) process, transformations among languages occur from natural language-in which the stakeholders express their domain and needs-to a controlled language. One source of domain information to be used by the transformation process is related to technical documents belonging to the organizations (e.g. technical reports, legacy documents, and procedure manuals). Some properties of such documents are: different representation formats, high degree of ambiguity, and particular linguistics elements. The analysis and processing of such documents becomes complex because of these properties and, in turn, the complexity makes difficult both identifying the domain knowledge and understanding the associated processes. As a solution for an automated transformation, in this paper we define linguistics features to enable identification and composition of information units from a procedure manual, as a central task for the language transformation process in RE. The identified features are classified into rhetorical, morphosyntactic, and semantic features. Also, the features can be used to identify and represent organizational domain knowledge. Copyright © 2013 by Knowledge Systems Institute Graduate School.
publishDate 2013
dc.date.created.none.fl_str_mv 2013
dc.date.accessioned.none.fl_str_mv 2016-06-23T21:52:09Z
dc.date.available.none.fl_str_mv 2016-06-23T21:52:09Z
dc.type.eng.fl_str_mv Conference Paper
dc.type.coar.fl_str_mv http://purl.org/coar/resource_type/c_c94f
dc.type.driver.none.fl_str_mv info:eu-repo/semantics/conferenceObject
dc.identifier.issn.none.fl_str_mv 23259000
dc.identifier.uri.none.fl_str_mv http://hdl.handle.net/11407/2319
identifier_str_mv 23259000
url http://hdl.handle.net/11407/2319
dc.language.iso.none.fl_str_mv eng
language eng
dc.relation.isversionof.spa.fl_str_mv https://www.scopus.com/record/display.uri?eid=2-s2.0-84937679594&origin=inward&txGid=0
dc.relation.ispartofen.eng.fl_str_mv Proceedings of the International Conference on Software Engineering and Knowledge Engineering, SEKE Volume 2013-January, Issue January, 2013, Pages 268-272
dc.rights.coar.fl_str_mv http://purl.org/coar/access_right/c_16ec
dc.rights.accessrights.none.fl_str_mv info:eu-repo/semantics/restrictedAccess
eu_rights_str_mv restrictedAccess
rights_invalid_str_mv http://purl.org/coar/access_right/c_16ec
dc.publisher.spa.fl_str_mv Knowledge Systems Institute Graduate School
dc.source.spa.fl_str_mv Scopus
institution Universidad de Medellín
repository.name.fl_str_mv Repositorio Institucional Universidad de Medellin
repository.mail.fl_str_mv repositorio@udem.edu.co
_version_ 1814159169426554880