Daelemans et al., 1996 - Google Patents

Part-of-speech tagging for Dutch with MBT, a memory-based tagger generator

Daelemans et al., 1996

Document ID: 10273434213070740366
Author: Daelemans W; Zavrel J; Berck P
Publication year: 1996
Publication venue: Informatiewetenschap

External Links

Cited by

Snippet

We present a part of speech tagger (morphosyntactic disambiguator) for Dutch, constructed by means of the Memory-Based Tagger generation method. In this approach, inductive learning methods are used to derive a tagger, lexicon and unknown word category guesser …

Continue reading at www.academia.edu (PDF) (other versions)

238000004458 analytical method 0 abstract description 8

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/3061—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F17/30634—Querying
- G06F17/30657—Query processing
- G06F17/30675—Query execution
- G06F17/30684—Query execution using natural language analysis
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/2765—Recognition
- G06F17/277—Lexical analysis, e.g. tokenisation, collocates
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/2705—Parsing
- G06F17/271—Syntactic parsing, e.g. based on context-free grammar [CFG], unification grammars
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/2785—Semantic analysis
- G06F17/279—Discourse representation
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/28—Processing or translating of natural language
- G06F17/2872—Rule based translation
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/2795—Thesaurus; Synonyms
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/28—Processing or translating of natural language
- G06F17/2809—Data driven translation
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/274—Grammatical analysis; Style critique
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/21—Text processing
- G06F17/22—Manipulating or registering by use of codes, e.g. in sequence of text characters
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/99931—Database or file accessing
- Y10S707/99933—Query processing, i.e. searching
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems

Similar Documents

Publication	Publication Date	Title
Daelemans et al.	1996	MBT: A memory-based part of speech tagger-generator
Bikel et al.	1999	An algorithm that learns what's in a name
Mansouri et al.	2008	Named entity recognition approaches
Brill et al.	1997	An overview of empirical natural language processing
EP0597630B1 (en)	2002-07-31	Method for resolution of natural-language queries against full-text databases
Kanakaraddi et al.	2018	Survey on parts of speech tagger techniques
Varma et al.	2009	IIIT Hyderabad at TAC 2009.
JP2001043236A (en)	2001-02-16	Synonym extracting method, document retrieving method and device to be used for the same
Budi et al.	2003	Association rules mining for name entity recognition
CN114254653A (en)	2022-03-29	Scientific and technological project text semantic extraction and representation analysis method
Krizhanovsky et al.	2013	An approach to automated construction of a general-purpose lexical ontology based on Wiktionary
Patman et al.	2003	Names: A new frontier in text mining
Wan et al.	2021	Enhancing metaphor detection by gloss-based interpretations
Antony et al.	2023	A survey of advanced methods for efficient text summarization
Horváth et al.	1999	Application of different learning methods to Hungarian part-of-speech tagging
Montoyo et al.	2000	Word sense disambiguation with specification marks in unrestricted texts
Daelemans et al.	1996	Part-of-speech tagging for Dutch with MBT, a memory-based tagger generator
Džeroski et al.	1999	Learning to lemmatise Slovene words
Khoufi et al.	2014	Chunking Arabic texts using conditional random fields
Budi et al.	2007	Application of association rules mining to Named Entity Recognition and co-reference resolution for the Indonesian language
Franz	1995	Learning PP attachment from corpus statistics
Talpur et al.	2023	Researching on Analysis and creating Corpus from Primary level Sindhi language Book for Sindhi
Raza et al.	2022	Saraiki Language Word Prediction And Spell Correction Framework
Tsai et al.	2002	Applying an NVEF Word-Pair Identifier to the Chinese Syllable-to-Word Conversion Problem
Awwalu et al.	2021	A corpus based transformation-based learning for Hausa text parts of speech tagging