


default search action
7th ICDM 2007: Omaha, Nebraska, USA
- Proceedings of the 7th IEEE International Conference on Data Mining (ICDM 2007), October 28-31, 2007, Omaha, Nebraska, USA. IEEE Computer Society 2007, ISBN 0-7695-3018-4

Regular Papers
- Sumeet Agarwal

, Shantanu Godbole, Diwakar Punjani, Shourya Roy:
How Much Noise Is Too Much: A Study in Automatic Text Classification. 3-12 - Shin Ando

:
Clustering Needles in a Haystack: An Information Theoretic Analysis of Minority and Outlier Detection. 13-22 - Benjamin Arai, Song Lin, Dimitrios Gunopulos

:
Efficient Data Sampling in Heterogeneous Peer-to-Peer Networks. 23-32 - Brett W. Bader, Richard A. Harshman, Tamara G. Kolda

:
Temporal Analysis of Semantic Graphs Using ASALSAN. 33-42 - Robert M. Bell, Yehuda Koren:

Scalable Collaborative Filtering with Jointly Derived Neighborhood Interpolation Weights. 43-52 - Axel Blumenstock, Franz Schweiggert, Markus Müller:

Rule Cubes for Causal Investigations. 53-62 - Björn Bringmann, Albrecht Zimmermann:

The Chosen Few: On Identifying Valuable Patterns. 63-72 - Deng Cai, Xiaofei He, Jiawei Han:

Spectral Regression: A Unified Approach for Sparse Subspace Learning. 73-82 - Toon Calders, Nele Dexters

, Bart Goethals
:
Mining Frequent Itemsets in a Stream. 83-92 - Shing-Kit Chan, Wai Lam, Xiaofeng Yu:

A Cascaded Approach to Biomedical Named Entity Recognition Using a Unified Model. 93-102 - Yanhua Chen, Manjeet Rege, Ming Dong, Jing Hua:

Incorporating User Provided Constraints into Document Clustering. 103-112 - Yixin Chen, Henry L. Bart Jr., Xin Dang, Hanxiang Peng:

Depth-Based Novelty Detection and Its Application to Taxonomic Research. 113-122 - David A. Cieslak, Nitesh V. Chawla

:
Detecting Fractures in Classifier Performance. 123-132 - Ying Cui, Xiaoli Z. Fern, Jennifer G. Dy:

Non-redundant Multi-view Clustering via Orthogonalization. 133-142 - Jing Gao, Wei Fan, Jiawei Han:

On Appropriate Assumptions to Mine Data Streams: Analysis and Practice. 143-152 - Mohammad Al Hasan, Vineet Chaoji

, Saeed Salem, Jérémy Besson, Mohammed Javeed Zaki
:
ORIGAMI: Mining Representative Orthogonal Graph Patterns. 153-162 - Huahai He, Ambuj K. Singh:

Efficient Algorithms for Mining Significant Substructures in Graphs with Quality Guarantees. 163-172 - Tianyi Jiang, Alexander Tuzhilin

:
Dynamic Micro Targeting: Fitness-Based Approach to Predicting Individual Preferences. 173-182 - Ruoming Jin, Yuri Breitbart, Chibuike Muoh:

Data Discretization Unification. 183-192 - Wei Jin, Rohini K. Srihari, Hung Hay Ho, Xin Wu:

Improving Knowledge Discovery in Document Collections through Combining Text Retrieval and Link Analysis Techniques. 193-202 - Vasileios Kandylas, S. Phineas Upham, Lyle H. Ungar:

Finding Cohesive Clusters for Analyzing Knowledge Communities. 203-212 - Rong Liu, Yong Shi:

Succinct Matrix Approximation and Efficient k-NN Classification. 213-222 - Xiaoming Liu, Zhaohui Wang, Zhilin Feng, Jinshan Tang

:
A Pairwise Covariance-Preserving Projection Method for Dimension Reduction. 223-231 - Bo Long, Xiaoyun Xu, Zhongfei (Mark) Zhang, Philip S. Yu:

Community Learning by Graph Approximation. 232-241 - Claudio Lucchese

, Salvatore Orlando
, Raffaele Perego
:
Parallel Mining of Frequent Closed Patterns: Harnessing Modern Computer Architectures. 242-251 - David R. Musicant, Janara M. Christensen, Jamie F. Olson:

Supervised Learning by Training on Aggregate Outputs. 252-261 - Feng Pan, Adam Roberts, Leonard McMillan

, David Threadgill
, Wei Wang
:
Sample Selection for Maximal Diversity. 262-271 - Ardian Kristanto Poernomo, Vivekanand Gopalkrishnan:

Mining Statistical Information of Frequent Fault-Tolerant Patterns in Transactional Databases. 272-281 - Daniele Quercia, Stephen Hailes, Licia Capra:

Lightweight Distributed Trust Propagation. 282-291 - Jie Tang, Duo Zhang, Limin Yao:

Social Network Extraction of Academic Researchers. 292-301 - Dacheng Tao, Xuelong Li

, Xindong Wu, Stephen J. Maybank:
General Averaged Divergence Analysis. 302-311 - Nikolaj Tatti

:
Maximum Entropy Based Significance of Itemsets. 312-321 - Chao Wang, Venu Satuluri, Srinivasan Parthasarathy

:
Local Probabilistic Models for Link Prediction. 322-331 - Pu Wang, Jian Hu, Hua-Jun Zeng, Lijun Chen, Zheng Chen:

Improving Text Classification by Using Encyclopedia Knowledge. 332-341 - Richard C. Wang, William W. Cohen:

Language-Independent Set Expansion of Named Entities Using the Web. 342-350 - Xiaozhe Wang, Anthony Wirth

, Liang Wang:
Structure-Based Statistical Features and Multivariate Time Series Clustering. 351-360 - Junjie Wu, Hui Xiong, Jian Chen, Wenjun Zhou

:
A Generalization of Proximity Functions for K-Means. 361-370 - Liang Xiong, Fei Wang, Changshui Zhang:

Multilevel Belief Propagation for Fast Inference on Markov Random Fields. 371-380 - Dragomir Yankov, Eamonn J. Keogh, Umaa Rebbapragada:

Disk Aware Discord Discovery: Finding Unusual Time Series in Terabyte Sized Datasets. 381-390 - Zhongyuan Zhang, Tao Li, Chris H. Q. Ding, Xiang-Sun Zhang:

Binary Matrix Factorization with Applications. 391-400
Short Papers
- Sujeevan Aseervatham, Emmanuel Viennet, Younès Bennani

:
A Semantic Kernel for Semi-structured DocumentS. 403-408 - Ira Assent, Ralph Krieger, Emmanuel Müller

, Thomas Seidl
:
DUSC: Dimensionality Unbiased Subspace Clustering. 409-414 - Suhrid Balakrishnan, David Madigan:

Finding Predictive Runs with LAPS. 415-420 - Arindam Banerjee, Hanhuai Shan:

Latent Dirichlet Conditional Naive-Bayes Models. 421-426 - Deng Cai, Xiaofei He, Jiawei Han:

Efficient Kernel Discriminant Analysis via Spectral Regression. 427-432 - Mete Celik

, James M. Kang, Shashi Shekhar:
Zonal Co-location Pattern Discovery with Dynamic Parameters. 433-438 - Bi Chen, Qiankun Zhao, Bingjun Sun, Prasenjit Mitra:

Predicting Blogging Behavior Using Temporal and Social Networks. 439-444 - Chen Chen, Xifeng Yan, Feida Zhu

, Jiawei Han:
gApprox: Mining Frequent Approximate Patterns from a Massive Network. 445-450 - Weizhu Chen, Jun Yan, Benyu Zhang, Zheng Chen, Qiang Yang:

Document Transformation for Multi-label Feature Selection in Text Categorization. 451-456 - Haibin Cheng, Pang-Ning Tan

, Jon Sticklen, William F. Punch:
Recommendation via Query Centered Random Walk on K-Partite Graph. 457-462 - Kun Deng, Chris Bourke, Stephen Scott, Julie Sunderman, Yaling Zheng:

Bandit-Based Algorithms for Budgeted Learning. 463-468 - Ronen Feldman, Moshe Fresko, Jacob Goldenberg, Oded Netzer, Lyle H. Ungar:

Extracting Product Comparisons from Discussion Boards. 469-474 - Xiaoli Z. Fern, Chaitanya Komireddy, Margaret M. Burnett:

Mining Interpretable Human Strategies: A Case Study. 475-480 - Gemma C. Garriga, Hannes Heikinheimo, Jouni K. Seppänen:

Cross-Mining Binary and Numerical Attributes. 481-486 - Karam Gouda, Mosab Hassaan

, Mohammed Javeed Zaki
:
Prism: A Primal-Encoding Approach for Frequent Sequence Mining. 487-492 - Qi He

, Kuiyu Chang, Ee-Peng Lim:
Using Burstiness to Improve Clustering of Topics in News Streams. 493-498 - Alexander Hinneburg, Hans-Henning Gabriel, André Gohr

:
Bayesian Folding-In with Dirichlet Kernels for PLSI. 499-504 - Shen-Shyang Ho

, Roman A. Polyak:
Confident Identification of Relevant Objects Based on Nonlinear Rescaling Method and Transductive Inference. 505-510 - Han-Shen Huang, Yu-Ming Chang, Chun-Nan Hsu:

Training Conditional Random Fields by Periodic Step Size Adaptation for Large-Scale Text Mining. 511-516 - Ruizhang Huang, Wai Lam:

Semi-supervised Document Clustering via Active Learning with Pairwise Constraints. 517-522 - Tsuyoshi Idé

, Spiros Papadimitriou, Michail Vlachos
:
Computing Correlation Anomaly Scores Using Stochastic Nearest Neighbors. 523-528 - Frederik Janssen, Johannes Fürnkranz

:
On Meta-Learning Rule Learning Heuristics. 529-534 - Ming Jia, Shaozhi Ye, Xing Li, Julie A. Dickerson:

Web Site Recommendation Using HTTP Traffic. 535-540 - Ruoming Jin, Scott McCallen, Eivind Almaas

:
Trend Motif: A Graph Mining Approach for Analysis of Dynamic Complex Networks. 541-546 - Nitin Jindal, Bing Liu:

Analyzing and Detecting Review Spam. 547-552 - David M. Kaplan

, David M. Blei:
A Computational Approach to Style in American Poetry. 553-558 - Yoshinobu Kawahara

, Takehisa Yairi, Kazuo Machida:
Change-Point Detection in Time-Series Data Based on Subspace Identification. 559-564 - Longin Jan Latecki

, Qiang Wang, Suzan Köknar-Tezel, Vasileios Megalooikonomou:
Optimal Subsequence Bijection. 565-570 - Srivatsan Laxman, Prasad Naldurg, Raja Sripada, Ramarathnam Venkatesan:

Connections between Mining Frequent Itemsets and Learning Generative Models. 571-576 - Tao Li, Chris H. Q. Ding, Michael I. Jordan

:
Solving Consensus and Semi-supervised Clustering Problems Using Nonnegative Matrix Factorization. 577-582 - Yinglung Liang, Yanyong Zhang, Hui Xiong, Ramendra K. Sahoo:

Failure Prediction in IBM BlueGene/L Event Logs. 583-588 - Masoud Makrehchi, Mohamed S. Kamel

:
A Text Classification Framework with a Local Feature Ranking for Learning Social Networks. 589-594 - Hassan H. Malik, John R. Kender:

Optimizing Frequency Queries for Data Mining Applications. 595-600 - David Minnen, Charles L. Isbell Jr., Irfan A. Essa, Thad Starner:

Detecting Subdimensional Motifs: An Efficient Algorithm for Generalized Multivariate Pattern Discovery. 601-606 - Nam Nguyen, Rich Caruana:

Consensus Clusterings. 607-612 - Biswanath Panda, Mirek Riedewald, Johannes Gehrke, Stephen B. Pope:

High-Speed Function Approximation. 613-618 - Jing Peng, Stefan A. Robila

:
Weighted Additive Criterion for Linear Dimension Reduction. 619-624 - Wen Pu, Ning Liu, Shuicheng Yan, Jun Yan, Kunqing Xie, Zheng Chen:

Local Word Bag Model for Text Categorization. 625-630 - Chedy Raïssi, Pascal Poncelet:

Sampling for Sequential Pattern Mining: From Static Databases to Data Streams. 631-636 - Calum S. Robertson, Shlomo Geva

, Rodney C. Wolff:
Can the Content of Public News Be Used to Forecast Abnormal Stock Market Behaviour? 637-642 - Jianhua Ruan

, Weixiong Zhang
:
An Efficient Spectral Algorithm for Network Community Discovery and Its Applications to Biological and Social Networks. 643-648 - Jerry Scripps, Pang-Ning Tan

, Abdol-Hossein Esfahanian:
Exploration of Link Structure and Community-Based Node Roles in Network Analysis. 649-654 - Pannagadatta K. Shivaswamy, Wei Chu, Martin Jansche:

A Support Vector Approach to Censored Targets. 655-660 - Muhammad Subianto

, Arno Siebes:
Understanding Discrete Classifiers with a Case Study in Gene Prediction. 661-666 - Atsuhiro Takasu, Daiji Fukagawa, Tatsuya Akutsu

:
Statistical Learning Algorithm for Tree Similarity. 667-672 - Gert Van Dijck, Marc M. Van Hulle

, Jo Van Vaerenbergh:
A Novel Criterion for Onset Detection: Differential Information Redundancy with Application to Human Movement Initiation. 673-678 - Florian Verhein, Sanjay Chawla:

Using Significant, Positively Associated and Relatively Class Correlated Rules for Associative Classification of Imbalanced Datasets. 679-684 - Jilles Vreeken

, Matthijs van Leeuwen, Arno Siebes:
Preserving Privacy through Data Generation. 685-690 - Qian Wan, Aijun An

:
Transitional Patterns and Their Significant Milestones. 691-696 - Xuerui Wang, Andrew McCallum, Xing Wei:

Topical N-Grams: Phrase and Topic Discovery, with an Application to Information Retrieval. 697-702 - Pinata Winoto

, Yiu-ming Cheung
, Jiming Liu:
Mechanism Design for Clustering Aggregation by Selfish Systems. 703-708 - Ho Jin Woo, Won Suk Lee:

estMax: Tracing Maximal Frequent Itemsets over Online Data Streams. 709-714 - Dragomir Yankov, Eamonn J. Keogh, Kin Fai Kan:

Locally Constrained Support Vector Clustering. 715-720 - Yang Yu, Zhi-Hua Zhou, Kai Ming Ting:

Cocktail Ensemble for Regression. 721-726 - Qi Zhang, Jinze Liu, Wei Wang

:
Incremental Subspace Clustering over Multiple Data Streams. 727-732 - Yan Zhang, Xindong Wu:

Noise Modeling with Associative Corruption Rules. 733-738 - Ding Zhou, Sergey A. Orshanskiy, Hongyuan Zha, C. Lee Giles

:
Co-ranking Authors and Documents in a Heterogeneous Network. 739-744 - Ding Zhou, Isaac G. Councill, Hongyuan Zha, C. Lee Giles

:
Discovering Temporal Communities from Social Network Documents. 745-750 - Feida Zhu

, Xifeng Yan, Jiawei Han, Philip S. Yu:
Efficient Discovery of Frequent Approximate Sequential Patterns. 751-756 - Xingquan Zhu

, Peng Zhang, Xiaodong Lin, Yong Shi:
Active Learning from Data Streams. 757-762 - Xingquan Zhu

:
Lazy Bagging for Classifying Imbalanced Data. 763-768

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














