Yaseen et al., 2014 - Google Patents
Extracting the roots of Arabic words without removing affixesYaseen et al., 2014
View PDF- Document ID
- 14561400322817934564
- Author
- Yaseen Q
- Hmeidi I
- Publication year
- Publication venue
- Journal of Information Science
External Links
Snippet
Most research in Arabic roots extraction focuses on removing affixes from Arabic words. This process adds processing overhead and may remove non-affix letters, which leads to the extraction of incorrect roots. This paper advises a new approach to dealing with this issue by …
- 238000000034 method 0 abstract description 27
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/3061—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F17/30634—Querying
- G06F17/30657—Query processing
- G06F17/30675—Query execution
- G06F17/30684—Query execution using natural language analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/2765—Recognition
- G06F17/277—Lexical analysis, e.g. tokenisation, collocates
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/3061—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F17/30613—Indexing
- G06F17/30619—Indexing indexing structures
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/2705—Parsing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/3061—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F17/30731—Creation of semantic tools
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/2795—Thesaurus; Synonyms
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/3061—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F17/30705—Clustering or classification
- G06F17/30707—Clustering or classification into predefined classes
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/21—Text processing
- G06F17/22—Manipulating or registering by use of codes, e.g. in sequence of text characters
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30861—Retrieval from the Internet, e.g. browsers
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
-
- G—PHYSICS
- G09—EDUCATION; CRYPTOGRAPHY; DISPLAY; ADVERTISING; SEALS
- G09B—EDUCATIONAL OR DEMONSTRATION APPLIANCES; APPLIANCES FOR TEACHING, OR COMMUNICATING WITH, THE BLIND, DEAF OR MUTE; MODELS; PLANETARIA; GLOBES; MAPS; DIAGRAMS
- G09B5/00—Electrically-operated educational appliances
- G09B5/02—Electrically-operated educational appliances with visual presentation of the material to be studied, e.g. using film strip
-
- G—PHYSICS
- G09—EDUCATION; CRYPTOGRAPHY; DISPLAY; ADVERTISING; SEALS
- G09B—EDUCATIONAL OR DEMONSTRATION APPLIANCES; APPLIANCES FOR TEACHING, OR COMMUNICATING WITH, THE BLIND, DEAF OR MUTE; MODELS; PLANETARIA; GLOBES; MAPS; DIAGRAMS
- G09B19/00—Teaching not covered by other main groups of this subclass
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Slabakova | The scalpel model of third language acquisition | |
Assiri et al. | Towards enhancement of a lexicon-based approach for Saudi dialect sentiment analysis | |
Jiménez-Zafra et al. | Combining resources to improve unsupervised sentiment analysis at aspect-level | |
Qasim et al. | Concept map construction from text documents using affinity propagation | |
Yaseen et al. | Extracting the roots of Arabic words without removing affixes | |
Azmi et al. | Aara’–a system for mining the polarity of Saudi public opinion through e-newspaper comments | |
Gorbaniuk et al. | The factor structure of Polish personality–descriptive adjectives: An alternative psycho–lexical study | |
Mustafa et al. | Kurdish stemmer pre-processing steps for improving information retrieval | |
Tocoglu et al. | TREMO: A dataset for emotion analysis in Turkish | |
Serigos | Using distributional semantics in loanword research: A concept-based approach to quantifying semantic specificity of Anglicisms in Spanish | |
Goebel et al. | Summary of the competition on legal information, extraction/entailment (COLIEE) 2023 | |
Qian et al. | Detecting new Chinese words from massive domain texts with word embedding | |
Bonikowski et al. | From ends to means: The promise of computational text analysis for theoretically driven sociological research | |
Leydesdorff | In search of epistemic networks | |
Mihi et al. | MSTD: Moroccan sentiment twitter dataset | |
Shafiee et al. | Similarity versus relatedness: A novel approach in extractive Persian document summarisation | |
Parent | The most frequent English homonyms | |
Fournier-Tombs et al. | Big data and democratic speech: Predicting deliberative quality using machine learning techniques | |
Wu | Enriching a thesaurus as a better question-answering tool and information retrieval aid | |
Mhamed et al. | A deep CNN architecture with novel pooling layer applied to two Sudanese Arabic sentiment data sets | |
Goldwasser et al. | Introduction: Graphemic classifiers in complex script systems | |
Shatnawi et al. | A framework for retrieving Arabic documents based on queries written in Arabic slang language | |
Muncer et al. | The bigram trough hypothesis and the syllable number effect in lexical decision | |
Dhar et al. | Hybrid approach for text categorization: A case study with Bangla news article | |
Hlayel et al. | An algorithm to improve the performance of string matching |