Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Proposal for a Track at the 2006 ACM Symposium on Applied Computing on Data Mining Hasan Jamil Department of Computer Science Mississippi State University, USA jamil@cs.wayne.edu and Rosa Meo Department of Computer Science University of Torino, Italy meo@di.unito.it Data mining from traditional relational databases as well as from non-traditional ones such as semi-structured data, web data and scientific databases such as biological, earth and atmospheric, solar system, ecological, animal behavior, linguistic and sensor data have recently become a popular way of discovering hidden knowledge. In the context of relational and traditional data, methods such as association rules, chi square rules, ratio rules, implication rules, etc. have been proposed in multiple, varied contexts. In the context of non-traditional data, newer, more experimental yet novel techniques are being proposed. There is an agreement among the researchers across communities that data mining is a key ingredient for success in their respective area of research and development. Consequently, interest in developing new techniques for data mining has peaked and a tremendous stride is being made to answer interesting and fundamental questions in various disciplines using data mining. There is a new interest in developing techniques for obtaining solid data mining models from distributed databases with privacy and autonomy guarantees. In the past, researchers mainly focused on algorithmic issues in data mining and placed much emphasis on scalability. Recently, the focus has shifted towards a more declarative way of answering questions using data mining that gave rise to the concept of mining queries. Declarative queries facilitate building larger systems using small mining building blocks. In such a paradigm, the system assumes the responsibility of optimization and scalability. Such an approach will be extremely useful in developing solutions in complex systems such as scientific databases where numerous domain specific knowledge interact with mining queries. In such an environment, the choice of algorithm, execution method, and processing strategy using secondary information often become complicated and time consuming. A well-developed and robust implementation help eliminate relational and focus on their details. strategy for declarative systems can these obstacles in a way similar to deductive databases and let the users application rather than on low level As part of the 2006 ACM SAC meeting, we propose to organize a Track on "Data Mining" that will encourage submissions in all areas of data mining in traditional as well as emerging non-standard databases. We will emphasize submissions on declarative data mining, intelligent querying and associated issues such as optimization, indexing, query processing, languages and constraints. We also encourage submissions in the area of data preprocessing such as data cleaning, discretization and sampling. The study of new data models and techniques for privacy preserving data mining, and data security will be encouraged. This will allow also to exploit the synergy of mining in different databases, in parallel, distributed or grid environments. We aim at organizing at least five sessions consisting of about twenty papers in total. Because of our focus and emphasis on the issues presented before, this ACM track will be distinct from others such as SIGMOD, VLDB, ICDE and even SIGKDD, and PAKDD, and specialized workshops such as DMKD as their focus is too general and covers a broad range of issues. This will be our fifth such Track in ACM SAC. As you are aware, the previous editions of this Track on Data Mining were successful, and we would like to continue with the tradition and see this Track grow and evolve. If approved, as before, we will develop a web-site to manage the activities of the Track, and electronically advertise in the specialized user groups and research networks and institutions. We will advertise in DBWORLD list, SIGKDD list, KD-net, the European Network of Excellence in Knowledge Discovery, and our own list of data mining researchers that includes about 200 active researchers. We will develop a system to collect, review, select and put together an attractive program solely through the electronic media. In light of our experience and the increase in submissions last year, we plan to form a large program committee with almost 60 distinguished members including eminent researchers such as Jiawei Han, Carlo Zaniolo, Mohammed Zaki, and others for the purpose of reviewing submissions and developing an attractive Track program. Like previous years, we will also consider inviting selected authors to submit an extended edition of their contributions for a special issue of a journal or an edited book. We would like to continue our tradition and organize and manage the 2006 ACM SAC Data Mining jointly. Like the past year, Hasan Jamil and Rosa Meo will serve as the Program Co-Chairs for the 2006 track. We are also considering adding third Co-Chair to increase circulation, number of submission and increase diversity of geographic presentation. We will let SAC know about our decision soon. Dr. Hasan Jamil has significant experiences in organizing and managing such events and is involved in declarative data mining. Dr. Jamil has been involved in organizing AMAST Montreal and Sydney meetings. He was also one of the organizers of the 2000 IEEE BIBE Symposium and was the PC Chair for the 2001 and 2003 IEEE BIBE Symposiums. He was member of the PC of DaWaK 2001/2002 and 2001/2002 ACM SIGKDD workshop on Data Mining in Bioinformatics, DBFusion 2002, NGITS 2002, and so on. He is also a member of the IASTED Technical Committee on Databases, and the Chair of the IFIP TC 5 Special Interest Group on Bioinformatics. Dr. Hasan Jamil's research interests include databases and Bioinformatics. He has published several articles in leading database, logic programming and Bioinformatics conferences such as ACM SIGMOD, VLDB, ICDT, KR, ILPS, etc. He has organized the ACM SAC Data Mining track for the last four years. He also holds grants from NSF and USDA for his Bioinformatics projects. Dr. Jamil's home page at www.cs.wayne.edu/~jamil/ may be consulted for more information. Rosa Meo actively works in database and data mining research since the last ten years. She has developed database system prototypes for data mining and worked in European funded projects (V Framework) on data mining themes and inductive databases. She published papers on major database and data mining International Conferences and Journals, such as ACM TODS, Kluwer DMKD, IEEE IT, VLDB, ICDE, EDBT, etc. She has been co-chair in 2002 for DTDM (DataBase Technologies for Data Mining) Workshop at EDBT, and KDID (Knowledge Discovery in Inductive Database) at ECML/PKDD. This year (2005), she is a member of the PC of ACM SIGKDD, VLDB (core database and IIS tracks), IEEE ICDM, ECML/PKDD. Rosa Meo's home page at http://www.di.unito.it/~meo/ can be consulted for more information. As in the past, we hope to form a large Program Committee for the purpose of selecting outstanding papers and developing an interesting program. Tentatively, we propose the following PC for 2006 ACM SAC DM track. Proposed PC: Reda Alhajj Canada Elena Baralis alhajj@cpsc.ucalgary.ca, University of Calgary, baralis@polito.it, Politecnico di Torino, Italy Roberto Bayardo bayardo@alum.mit.edu, IBM Almaden Research Center, USA Christian Bohm christian.boehm@umit.at, UMIT, Austria Francesco Bonchi francesco.bonchi@isti.cnr.it, ISTI-CNR, Pisa, Italy Marco Botta botta@di.unito.it, University of Torino, Italy Jean-Francois Boulicaut Jean-Francois.Boulicaut@lisi.insa-lyon.fr, INSA LISI, Lyon, France Toon Calders toon.calders@ua.ac.be, University of Antwerp, BelgiumSaso Dzeroski Bruno Cremilleux bruno.cremilleux@info.unicaen.fr, GREYC Department d'Informatique, France Ding Chris chqding@lbl.gov, Lawrence Berkeley National Laboratory, USA Saso Dzeroski Saso.Dzeroski@ijs.si, Jozef Stefan Institute, Slovenia Johannes Gerke johannes@CS.Cornell.EDU, Cornell University, USA Fosca Giannotti fosca.giannotti@isti.cnr.it, CNUCE-CNR of Pisa, Italy Bart Goethals bart.goethals@cs.helsinki.fi, Helsinki Institute for Information Technology (HIIT), Finland Le Gruenwald ggruenwald@ou.edu, University of Oklahoma, USA Dimitrios Gunopulos dg@cs.ucr.edu, University of California, Riverside, USA Jiawei Han hanj@cs.uiuc.edu, University of Illinois at Urbana-Champaign David Hand djhand@doc.ic.ac.uk, Imperial College, London, UK Sherri Harms harmssk@unk.edu, University of Nebraska, Kearney, USA Tomasz Imielinski imielins@cs.rutgers.edu, Rutgers, the State University of New Jersey, USA Thorsten Joachims tj@cs.cornell.edu, Cornell University, USA Daniel A. Keim keim@informatik.uni-halle.de , Martin-LutherUniversity Halle-Wittenberg, Germany Kristian Kersting kersting@informatik.uni-freiburg.de, Albert-Ludwigs-University, Freiburg, Germany Marzena Kryszkiewicz mkr@ii.pw.edu.pl, Warsaw University of Technology, Poland Krzysztof Koperski krisk@insightful.com, Insightful Corporation Stefan Kramer skramer@informatik.uni-freiburg.de, Institut für Informatik, Albert-Ludwigs-Universität Freiburg, Germany Pier Luca Lanzi lanzi@elet.polimi.it, Politecnico di Milano, Italy Dominique Laurent laurent@univ-tours.fr, University of Tours, France Nada Lavrac Nada.Lavrac@ijs.si, Jozef Stefan Institute, Slovenia Donato Malerba malerba@di.uniba.it, University of Bari, Italy Giuseppe Manco manco@isi.cs.cnr.it, ICAR-CNR, Italy Andrew W. Moore awm@cs.cmu.edu, Carnegie Mellon University, USA Katharina Morik morik@kismet.cs.uni-dortmund.de, University of Dortmund, Germany Raymond T. Ng rng@cs.ubc.ca, University of British Columbia, USA Salvatore Orlando orlando@dsi.unive.it, Università di Ca' Foscari, Italy Stefano Paraboschi parabosc@elet.polimi.it, University of Bergamo, Italy Jian Pei jianpei@cse.buffalo.edu, University at Buffalo, The State University of New York, USA Giuseppe Psaila psaila@unibg.it, University of Bergamo, Italy Rauch Jan rauch@vse.cz, University of Economics, Czech Republic Christophe Rigotti christophe.rigotti@lisi.insa-lyon.fr, INSA LISI, Lyon, France Domenico Sacca' sacca@unical.it, Universita' della Calabria, Italy Lorenza Saitta lorenza.saitta@mfn.unipmn.it, AMEDEO AVOGADRO University, Eastern Piedmont, Italy Maria Luisa Sapino mlsapino@di.unito. it, University of Torino, Italy Sunita Sarawagi sunita@it.iitb.ac.in, KR School of Information Technology, IIT Bombay, India Savinov Alexandr alexandr.savinov@ais.fraunhofer.de, Fraunhofer AIS, Germany Arno Siebes Arno.Siebes@cs.uu.nl, Utrecht University, The Netherland Ramakrishnan Srikant srikant@us.ibm.com, IBM Almaden Research Center, USA Einoshin Suzuki suzuki@ynu.ac.jp, Yokohama National University, Japan Hannu TT Toivonen hannu.toivonen@cs.helsinki.fi, University of Helsinki, Finland Franco Turini turini@di.unipi.it, University of Pisa, Italy Jiong Yang jioyang@cs.uiuc.edu, University of Illinois at Urbana Champaign, USA Philip S. Yu psyu@us.ibm.com, IBM T.J. Watson Research Center, USA Raymond Wong wong@cse.unsw.edu.au, University of New South Wales, Australia Osmar Zaiane zaiane@cs.ualberta.ca, University of Alberta, Edmonton, Alberta, Canada Mohammed Zaki zaki@cs.rpi.edu, Rensselaer Polytechnic Institute, USA Kang Zhang kzhang@utdallas.edu, The University of Texas at Dallas, USA Carlo Zaniolo zaniolo@cs.ucla.edu, University California Los Angeles, USA