SIGIR 2008
[1] An Unsupervised Framework for Extracting and Normalizing Product Attributes from Multiple Web Sites
[2] Enhancing Keyword-Based Botanical Information Retrieval with Information Extraction
[3] An Alignment-based Pattern Representation Model for Information Extraction
WWW 2009
[4] StatSnowball: a Statistical Approach to Extracting Entity Relationships
[5] Incorporating Site-Level Knowledge to Extract Structured Data from Web Forums
[6] SOFIE: A Self-Organizing Framework for Information Extraction
[7] Extracting Key Terms From Noisy and Multi-theme Documents
[8] Extracting Article Text from the Web with Maximum Subsequence Segmentation
[9] Extracting Data Records from the Web Using Tag Path Clustering
[10] News Article Extraction with Template-Independent Wrapper
[11] Estimating Web Site Readability Using Content Extraction
CIKM2007
[12] Autonomously Semantifying Wikipedia
CIKM 2008
[13] Using Structured Text for Large-Scale Attribute Extraction
[14] Extremely Fast Text Feature Extraction for Classification and Indexing
[15] Metadata Extraction and Indexing for Map Search in Web Documents
[16] Extracting Non-Redundant Association Rules from Multi-Level Datasets
[17] Using Tag Semantic Network for Keyphrase Extraction in Blogs
[18] CoreEx: Heuristic Content Extraction from Online News Articles
[19] Academic Conference Homepage Understanding Using Constrained Hierarchical Conditional Random Fields
[20] Identifying Table Boundaries in Digital Documents via Sparse Line Detection
ICDE 2008
[21] An Algebraic Approach to Rule-Based Information Extraction
[22] Efficient Information Extraction over Evolving Text
[23] Automatic Extraction of Useful Facet Terms from Text Documents
[24] Extracting Loosely Structured Data Records Through Mining Strict Patterns
[25] LabelEx: A Scalable Approach for Extracting Form Labels
VLDB 2008
[26] StreamTX: Extracting Tuples from Streaming XML Data
[27] Scalable Ad-hoc Entity Extraction from Text Collections
[28] Learning to Extract Form Labels
[29] Large-Scale Collaborative Analysis and Extraction of Web Data
SIGKDD 2008
[30] Information Extraction from Wikipedia: Moving Down the Long Tail
[31] A Unified Approach for Schema Matching, Coreference, and Canonicalization
SIGMOD/POD 2008
[32] Toward Best-effort Information Extraction
[33] Damia: Data Mashups for Intranet Applications
ICDM2007
[34] Extracting Product Comparisons from Discussion Boards
ICDM 2006
[35] Extracting Keyphrases using Semantic Networks Structure Analysis
[36] High-Performance Unsupervised Relation Extraction from Large Corpora
本文来自CSDN博客,转载请标明出处:http://blog.csdn.net/ICTExtr9/archive/2009/07/08/4330426.aspx