Processing rhetorical, morphosyntactic, and semantic features from corporate technical documents for identifying organizational domain knowledge
During the requirements elicitation (RE) process, transformations among languages occur from natural language-in which the stakeholders express their domain and needs-to a controlled language. One source of domain information to be used by the transformation process is related to technical documents...
- Autores:
- Tipo de recurso:
- Fecha de publicación:
- 2013
- Institución:
- Universidad de Medellín
- Repositorio:
- Repositorio UDEM
- Idioma:
- eng
- OAI Identifier:
- oai:repository.udem.edu.co:11407/2319
- Acceso en línea:
- http://hdl.handle.net/11407/2319
- Palabra clave:
- Domain knowledge
Natural language
Requirements elicitation
Technical documents
Texts processing
- Rights
- restrictedAccess
- License
- http://purl.org/coar/access_right/c_16ec
id |
REPOUDEM2_42df65daea8b7d5eb2be55b679298507 |
---|---|
oai_identifier_str |
oai:repository.udem.edu.co:11407/2319 |
network_acronym_str |
REPOUDEM2 |
network_name_str |
Repositorio UDEM |
repository_id_str |
|
spelling |
2016-06-23T21:52:09Z2016-06-23T21:52:09Z201323259000http://hdl.handle.net/11407/2319During the requirements elicitation (RE) process, transformations among languages occur from natural language-in which the stakeholders express their domain and needs-to a controlled language. One source of domain information to be used by the transformation process is related to technical documents belonging to the organizations (e.g. technical reports, legacy documents, and procedure manuals). Some properties of such documents are: different representation formats, high degree of ambiguity, and particular linguistics elements. The analysis and processing of such documents becomes complex because of these properties and, in turn, the complexity makes difficult both identifying the domain knowledge and understanding the associated processes. As a solution for an automated transformation, in this paper we define linguistics features to enable identification and composition of information units from a procedure manual, as a central task for the language transformation process in RE. The identified features are classified into rhetorical, morphosyntactic, and semantic features. Also, the features can be used to identify and represent organizational domain knowledge. Copyright © 2013 by Knowledge Systems Institute Graduate School.engKnowledge Systems Institute Graduate Schoolhttps://www.scopus.com/record/display.uri?eid=2-s2.0-84937679594&origin=inward&txGid=0Proceedings of the International Conference on Software Engineering and Knowledge Engineering, SEKE Volume 2013-January, Issue January, 2013, Pages 268-272ScopusProcessing rhetorical, morphosyntactic, and semantic features from corporate technical documents for identifying organizational domain knowledgeConference Paperinfo:eu-repo/semantics/conferenceObjecthttp://purl.org/coar/resource_type/c_c94finfo:eu-repo/semantics/restrictedAccesshttp://purl.org/coar/access_right/c_16ecFaculty of Engineering, Universidad de Medellín, Medellín, ColombiaFaculty of Mines, Universidad Nacional de Colombia, Medellín, ColombiaLosada B.M.Jaramillo C.M.Z.Domain knowledgeNatural languageRequirements elicitationTechnical documentsTexts processing11407/2319oai:repository.udem.edu.co:11407/23192020-05-27 17:34:32.316Repositorio Institucional Universidad de Medellinrepositorio@udem.edu.co |
dc.title.spa.fl_str_mv |
Processing rhetorical, morphosyntactic, and semantic features from corporate technical documents for identifying organizational domain knowledge |
title |
Processing rhetorical, morphosyntactic, and semantic features from corporate technical documents for identifying organizational domain knowledge |
spellingShingle |
Processing rhetorical, morphosyntactic, and semantic features from corporate technical documents for identifying organizational domain knowledge Domain knowledge Natural language Requirements elicitation Technical documents Texts processing |
title_short |
Processing rhetorical, morphosyntactic, and semantic features from corporate technical documents for identifying organizational domain knowledge |
title_full |
Processing rhetorical, morphosyntactic, and semantic features from corporate technical documents for identifying organizational domain knowledge |
title_fullStr |
Processing rhetorical, morphosyntactic, and semantic features from corporate technical documents for identifying organizational domain knowledge |
title_full_unstemmed |
Processing rhetorical, morphosyntactic, and semantic features from corporate technical documents for identifying organizational domain knowledge |
title_sort |
Processing rhetorical, morphosyntactic, and semantic features from corporate technical documents for identifying organizational domain knowledge |
dc.contributor.affiliation.spa.fl_str_mv |
Faculty of Engineering, Universidad de Medellín, Medellín, Colombia Faculty of Mines, Universidad Nacional de Colombia, Medellín, Colombia |
dc.subject.keyword.eng.fl_str_mv |
Domain knowledge Natural language Requirements elicitation Technical documents Texts processing |
topic |
Domain knowledge Natural language Requirements elicitation Technical documents Texts processing |
description |
During the requirements elicitation (RE) process, transformations among languages occur from natural language-in which the stakeholders express their domain and needs-to a controlled language. One source of domain information to be used by the transformation process is related to technical documents belonging to the organizations (e.g. technical reports, legacy documents, and procedure manuals). Some properties of such documents are: different representation formats, high degree of ambiguity, and particular linguistics elements. The analysis and processing of such documents becomes complex because of these properties and, in turn, the complexity makes difficult both identifying the domain knowledge and understanding the associated processes. As a solution for an automated transformation, in this paper we define linguistics features to enable identification and composition of information units from a procedure manual, as a central task for the language transformation process in RE. The identified features are classified into rhetorical, morphosyntactic, and semantic features. Also, the features can be used to identify and represent organizational domain knowledge. Copyright © 2013 by Knowledge Systems Institute Graduate School. |
publishDate |
2013 |
dc.date.created.none.fl_str_mv |
2013 |
dc.date.accessioned.none.fl_str_mv |
2016-06-23T21:52:09Z |
dc.date.available.none.fl_str_mv |
2016-06-23T21:52:09Z |
dc.type.eng.fl_str_mv |
Conference Paper |
dc.type.coar.fl_str_mv |
http://purl.org/coar/resource_type/c_c94f |
dc.type.driver.none.fl_str_mv |
info:eu-repo/semantics/conferenceObject |
dc.identifier.issn.none.fl_str_mv |
23259000 |
dc.identifier.uri.none.fl_str_mv |
http://hdl.handle.net/11407/2319 |
identifier_str_mv |
23259000 |
url |
http://hdl.handle.net/11407/2319 |
dc.language.iso.none.fl_str_mv |
eng |
language |
eng |
dc.relation.isversionof.spa.fl_str_mv |
https://www.scopus.com/record/display.uri?eid=2-s2.0-84937679594&origin=inward&txGid=0 |
dc.relation.ispartofen.eng.fl_str_mv |
Proceedings of the International Conference on Software Engineering and Knowledge Engineering, SEKE Volume 2013-January, Issue January, 2013, Pages 268-272 |
dc.rights.coar.fl_str_mv |
http://purl.org/coar/access_right/c_16ec |
dc.rights.accessrights.none.fl_str_mv |
info:eu-repo/semantics/restrictedAccess |
eu_rights_str_mv |
restrictedAccess |
rights_invalid_str_mv |
http://purl.org/coar/access_right/c_16ec |
dc.publisher.spa.fl_str_mv |
Knowledge Systems Institute Graduate School |
dc.source.spa.fl_str_mv |
Scopus |
institution |
Universidad de Medellín |
repository.name.fl_str_mv |
Repositorio Institucional Universidad de Medellin |
repository.mail.fl_str_mv |
repositorio@udem.edu.co |
_version_ |
1814159169426554880 |