Clarin-k center - Ixa taldea

Clarin-k center Ixa taldea

ANALHITZA will help you extracting from text in Basque, Spanish or English, some linguistic information, such as:

nouns, adjectives, verbs, adverbs...
person names, location names...
sequences of two, three and four words
... and much more!

The text could be the one that you have in a file, something that you will copy it here, or from a web page, but it should be encoded in UTF8. To use ANALHITZA, enter the text you want to analyze using one of the 3 below options, and then choose the language of your text (Basque, Spanish or English). After waiting a moment, you will get the results on an Excel file. Thus, you will be able to adapt the results to meet your requirements.

Input file	Created	Ouput file
txt	January 01 1970 01:00:00	log.txt
txt	January 01 1970 01:00:00	naf_stats
txt	January 01 1970 01:00:00	dir_input
txt	January 01 1970 01:00:00	dir_output
.txt	January 01 1970 01:00:00	log_funt.txt
txt	January 01 1970 01:00:00	tresnak
a.txt	January 01 1970 01:00:00	analhitza.php
txt	January 01 1970 01:00:00	index.php
txt	January 01 1970 01:00:00	irudiak
txt	January 01 1970 01:00:00	zaborra
ader.txt	January 01 1970 01:00:00	index_header.php
ioa.txt	January 01 1970 01:00:00	dibulgazioa.php
txt	January 01 1970 01:00:00	bideoa.php
txt	January 01 1970 01:00:00	NAFStats.sh
tzuk.txt	January 01 1970 01:00:00	beste_batzuk.php
_akatsa_dagoenean_probatzeko.txt	January 01 1970 01:00:00	IRAKURRI_akatsa_dagoenean_probatzeko.txt
.sh.txt	January 01 1970 01:00:00	NAFStats.sh.zar
txt	January 01 1970 01:00:00	robots.txt
k.txt	January 01 1970 01:00:00	funtzioak.php
o_testtxt	January 01 1970 01:00:00	probarako_testuak
a_masiboa.txt	January 01 1970 01:00:00	analhitza_masiboa.php
a.php.otxt	January 01 1970 01:00:00	analhitza.php.orig
txt	January 01 1970 01:00:00	bideoa
txt	January 01 1970 01:00:00	corpusak
txt	January 01 1970 01:00:00	bloga
txt	January 01 1970 01:00:00	index
ted_ttxt	January 01 1970 01:00:00	unformatted_text
txt	January 01 1970 01:00:00	bideoak.php
txt	January 01 1970 01:00:00	sarrera.php
enak.txt	January 01 1970 01:00:00	argitalpenak.php
a_info.txt	January 01 1970 01:00:00	analhitza_info.php
oter.txt	January 01 1970 01:00:00	index_footer.php
oooo.txt	January 01 1970 01:00:00	emaitza_oooo.xls
ktxt	January 01 1970 01:00:00	gordetzekoak
a.phptxt	January 01 1970 01:00:00	analhitza.php.kk
a_masiboa_form.txt	January 01 1970 01:00:00	analhitza_masiboa_form.php
txt	January 01 1970 01:00:00	mantenu.php
a.php.ztxt	January 01 1970 01:00:00	analhitza.php.zar2
a_2.txt	January 01 1970 01:00:00	analhitza_2.php
txt	January 01 1970 01:00:00	indizea
txt	January 01 1970 01:00:00	deskargak
txt	January 01 1970 01:00:00	cgi
.txt	January 01 1970 01:00:00	php_info.php
txt	January 01 1970 01:00:00	estiloak
ddle.txt	January 01 1970 01:00:00	index_middle.php
txt	January 01 1970 01:00:00	borratu.php
ddle_kendutakoa.txt	January 01 1970 01:00:00	index_middle_kendutakoa.php
txt	January 01 1970 01:00:00	multiling
tml_dom.txt	January 01 1970 01:00:00	simple_html_dom.php
ext_file.txt	January 01 1970 01:00:00	upload_text_file.php
txt	January 01 1970 01:00:00	scripts

ANALHITZA processes automatically the text using ixaKat (for Basque) and Ixa pipes (for Spanish and English), which are two modular chains of linguistic processors.
ANALHITZA, making use of language technologies, has been designed with the purpose of offering to researchers from the fields of humanities and social sciences a simple way to obtain reliable and easily manipulable linguistic data. If you have defined a research topic in one of these fields, if you have a corpus and if you are interested in using ANALHITZA, contact us and we will advise you: mikel.iruskieta@ehu.eus
You are free to use any information from this website, but we would appreciate an acknowledgement. The proper way to cite the ANALHITZA is the following:
Otegi, A. Imaz, O. Díaz de Ilarraza, A. Iruskieta, M. Uria, L. 2017 ANALHITZA: a tool to extract linguistic information from large corpora in Humanities research. Procesamiento del Lenguaje Natural 58: 77-84.

Processing...

Processing...

Processing...

Demos

Videos