Detalhes da Produção

TipoArtigo Publicado
GrupoProdução Bibliográfica
DescriçãoSILVA, Edilberto M. ; do Prado, Hércules Antonio ; FERNEDA, Edilson. Text mining: crossing the chasm between the academy and the industry. Management Information Systems, v. , p. 351-361, 2002.
AutorHercules Antonio do Prado
Ano2002

Informações Complementares

Ano do artigo2002
Descricão e Informacões AdicionaisThe existence of a chasm between the development phase and the adoption of new technologies has been widely recognized. Some reasons that make hard the transition academy-industry for new technology are: (a) the weak usability commonly presented by emergent technology in regard to the required ease of ordinary users; (b) few successful experiences reported; and (c) the lack of an adequate methodology to new tools. In this paper we argue that text mining technology is exactly in the chasm point and study the hypothesis (c) mentioned above. The start point of our argumentation is the contradiction posed by the extraordinary amount of information in text form - about 80% of all existing information in a company - while the amount of text mining/web mining applications does not go beyond 7%. At the same time, we observe that the available technological alternatives present an excellent level of maturity, with many functions and adequate interfaces for the common user. The research was carried out by means of a case study in which we used texts issued by a journalistic agency. In order to explore our hypothesis, we applied the CRISP-DM method that was originally conceived for data mining. The contribution of this work includes the examination of the methodological hypothesis for the lack of text mining applications, an experience report in which we describe the steps carried out to apply CRISP-DM to text mining, and the findings in the target domain.
Divulgacão CientíficaNAO
Homepage do Trabalhohttp://www.wessex.ac.uk/conferences/2002/datamining02
IdiomaInglês
ISSN14706326
local de publicacaoBologna, Itália
Meio de DivulgaçãoIMPRESSO
NaturezaCOMPLETO
Página Final361
Página Inicial351
País de PublicaçãoItália
RelevânciaNAO
Título do ArtigoText mining: crossing the chasm between the academy and the industry
Título do Períodico ou RevistaManagement Information Systems