Greg Hamerly
Computer Science Department
School of Engineering & Computer Science (Rogers building)
105 Baylor Ave.
Baylor University
Waco, TX 76798-7356, USA
Phone: +1-254-710-6846
Fax: +1-254-710-3889
Email: hamerly at cs dot baylor dot edu
I am an assistant professor of computer science at Baylor University in Waco, Texas. On this webpage you can find more information about my research, teaching, schedule, publications, funding, and other things.
Research
My research is in machine learning, a sub-field of artificial intelligence. I am interested in unsupervised learning methods (clustering, primarily), and improving algorithms like k-means to be faster, give better-quality results, or to act more intelligently (such as by finding the number of clusters). I have applied machine learning to the task of optimizing computer program simulation, in the SimPoint project. I have done work with mixtures of naive Bayes models in unsupervised learning, for the detection of failures in hard drives. I'm very interested in the automation of learning, especially in the applications knowledge discovery in science, such as bioinformatics and astrophysics.
I am currently looking for graduate students to do funded research with me.
Here is my curriculum vitae, here are my publications, and here is my Ph.D. thesis.
Teaching
| CSI 3334 | Data structures and algorithms | Spring 2010, F09, S09, F08, S08, F07, S07, F06, S06, F05, F04 |
| CSI 4336 | Computer science theory | Fall 09, F08, F07, F06, F05 |
| CSI 4v96 | Competitive learning I/II/III | Spring 2010, F08, S08, F07, S07, F06, S06, F05 |
| CSI 5325 | Introduction to Machine Learning | Spring 2010, S09, S08, S07, S06, S05 |
Weekly schedule (Spring 2010)
Here is my usual weekly schedule for this semester. If you can't stop by my office during office hours, please email me or just stop by at another time; I am usually glad to meet with students at other times.
Here is my Google calendar that shows when I have scheduled events.
Publications
- , Making k-means even faster In proceedings of the 2010 SIAM international conference on data mining (SDM 2010), April 2010. [pdf]
- , , Hierarchical Stability-Based Model Selection For Clustering Algorithms In proceedings of the International Conference on Machine Learning and Applications, December 2009.
- , , Improving SimPoint accuracy for small simulation budgets with EDCM clustering In proceedings of the Second workshop on Statistical and Machine learning approaches to ARchitectures and compilaTion (SMART '08), January 2008. [pdf]
- , , , , , , Cross Binary Simulation Points In Proceedings of the International Symposium on Performance Analysis of Systems and Software (ISPASS-2007) March 2007. [pdf]
- , , PG-means: learning the number of clusters in data. In proceedings of the twentieth annual conference on neural information processing systems (NIPS), December 2006. [ps, pdf]
- , , , , , Using Machine Learning to Guide Architecture Simulation. Journal of Machine Learning Research, Volume 7, Pages 343-378, 2006. [abstract, pdf]
- , , , Comparing Multinomial and K-means clustering for SimPoint. In the 2006 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS-2006), 2006. [abstract, pdf]
- , , , , SimPoint 3.0: Faster and more flexible program analysis. Journal on Instruction-Level Parallelism (JILP), September, 2005. [pdf]
- , , , , SimPoint 3.0: Faster and more flexible program analysis. Workshop on Modeling, Benchmarking and Simulation (MoBS), June 2005. [abstract, pdf]
- , , , , SimPoint: Picking Representative Samples to Guide Simulation. Chapter 7 in the book Performance Evaluation and Benchmarking, edited by Lizy Kurian John and Lieven Eeckhout; published by CRC Press, 2005.
- , , , , , Motivation for variable length intervals and hierarchical phase behavior. In the 2005 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS-2005), March 2005. [abstract, pdf]
- , , , , , The strong correlation between code signatures and performance. In the 2005 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS-2005), March 2005. [abstract, pdf]
- , , Exploring perceptron-based register value prediction. In the second value-prediction and value-based optimization workshop, October 2004. [pdf]
- , , , How to use SimPoint to pick simulation points. In ACM SIGMETRICS Performance Evaluation Review, Volume 31(4), March 2004. [abstract, pdf]
- , , , , , Discovering and Exploiting Program Phases. In IEEE Micro: Micro's top picks from computer architecture conferences, November-December 2003 (Vol. 23, No. 6) [pdf].
- , , Learning the k in k-means. In proceedings of the seventeenth annual conference on neural information processing systems (NIPS), pages 281-288, December 2003. [ps, pdf] (Older UCSD technical report CS2002-0716 [ps])
- , , , Picking Statistically Valid and Early Simulation Points. In proceedings of the international conference on parallel architectures and compilation techniques (PACT), September 2003. [abstract, pdf]
- , , , , , Using SimPoint for Accurate and Efficient Simulation. In proceedings of the international conference on measurement and modeling of computer systems (SIGMETRICS), June 2003. [abstract, pdf]
- , , , , Automatically characterizing large scale program behavior. In proceedings of the tenth international conference on architectural support for programming languages and operating systems (ASPLOS), October 2002. [abstract, pdf]
- , , Alternatives to the k-means algorithm that find better clusterings. In proceedings of the ACM conference on information and knowledge management (CIKM), pages 600-607, November 2002. [ps] (Older UCSD technical report CS2002-0702 [ps])
- , , Bayesian approaches to failure prediction for disk drives. In proceedings of the eighteenth international conference on machine learning (ICML), June 2001. [ps]
Here are links to my coauthors and collaborators: , , , , , , , , , , , .
As far as I can tell, my Erdös number is 4. One such path is me → Charles Elkan → Russell Greiner → Michael S. O. Molloy → Paul Erdös.
Thesis
Defended on June 26, 2003.
Learning structure and concepts in data through data clustering.
[ps, 1.8MB]
[pdf, 3.6MB]
Thanks to Tom Stepleton at Sony Japan for catching a typo in one of my
equations.
Funding
My research and teaching work has been generously supported by the following:
- Intel Corporation
- Baylor University Young Investigator's Development Program
Current and former students
- Lei Meng
- Bing Yin, M.S. 2009, currently at Epic Systems
- Josh Johnston, M.S. 2007, working on a Ph.D. at Texas A&M
- Yu Feng, M.S. 2006, currently at Microsoft
Current/former Affiliations
- I am a member of ACM
- DTAI group at the KU Leuven computer science department
- AI lab at the UCSD computer science department
- Computer science department at CalPoly, San Luis Obispo
Links to other things I've done
- Some sites I frequent are in my bookmarks.
- Here is a list of software I have written and released.
- You can also find out more about me on my personal page.
Copyright © 2004 Greg Hamerly
Computer Science Department
Baylor University