Tan Qingzhao (Cynthia) 谭清照


Department of Computer Science and Engineering

College of Engineering,

The Pennsylvania State University,

111 Information Sciences and Technology Building,

University Park,

PA 16802, USA

Email: qtan@cse.psu.edu


General Information

In June 2002 I obtained my Bachelor's degree from School of Computer Science & Engineering, South China University of Technology (SCUT). In July 2004 I got my Master's degree from the Department of Computer Science, Hong Kong University of Science and Technology (HKUST). And in Fall 2008 I got my Ph.D. from Department of Computer Science and Engineering in Penn State University. I was co-advised by Prof. Prasenjit Mitra and Prof. C. Lee Giles.


Research Project

  • ChemXSeer: a project on cyberinfrastructure portal for kinetic chemistry.
  • ArchSeer: a digital library system for archeology.


  • Research Interest

  • Focused crawler, incremental crawler
  • Web information retrieval,  search engine


  • Publication

    Qingzhao Tan, Designing New Crawling and Indexing Techniques for Web Search Engines. PhD dissertation, Octobor, 2008. (link)

    Qingzhao Tan, Prasenjit Mitra, C. Lee Giles, Effectively Searching Maps in Web Documents, in Proceedings of the 31st European Conference on IR Research on Advances in information retrieval (ECIR2009). Toulouse, France, April, 2009.

    Qingzhao Tan, Prasenjit Mitra, C. Lee Giles, Designing Clustering-Based Web Crawling Policies for Search Engine Crawlers, in Proceedings of the 16th Conference on Information and Knowledge Management (CIKM2007). Lisbon, Portugal, November, 2007.

    Qingzhao Tan, Ziming Zhuang, Prasenjit Mitra, C. Lee Giles, Efficiently Detecting Webpage Updates Using Samples, in Proceedings of the 17th International Conference on Web Engineering (ICWE2007). Como, Italy, July, 2007.

    Qingzhao Tan, Ziming Zhuang, Prasenjit Mitra, C. Lee Giles, A Clustering-Based Sampling Approach for Refreshing Search Engine’s Database, in Proceedings of the 10th International Workshop on the Web and Databases (WebDB2007). Beijing, China, June, 2007.

    Qingzhao Tan, Ziming Zhuang, Prasenjit Mitra, C. Lee Giles, Designing Efficient Sampling Techniques to Detect Webpage Updates, in Proceedings of the 16th international conference on World Wide Web (WWW2007). Poster. Banff, Canada, May, 2007.

    Bingjun Sun, Qingzhao Tan, Prasenjit Mitra, C. Lee Giles, Extraction and Search of Chemical Formulae in Text Documents on the Web, in Proceedings of the 16th international conference on World Wide Web (WWW2007). Banff, Canada, May, 2007.

    Huajing Li, Qingzhao Tan, Wang-Chien Lee, Efficient Progressive Skylining for Peer-to-Peer Systems, Proceedings of  the 1st International Conference on Scalable Information Systems (InfoScale2006), Hong Kong, China, May 30 - June 1, 2006.

    Qingzhao Tan, Wang-Chien Lee, Baihua Zheng, Peng Liu, Dik Lun Lee, Balancing Performance and Confidentiality in Air Index, Proceedings of the Conference on Information and Knowledge Management (CIKM2005).

    Qingzhao Tan, Yiping Ke, Wilfred Ng, WUML: A Web Usage Manipulation Language for Querying Web Log Data, Proceedings of the 23rd International Conference on Conceptual Modeling (ER2004), Shanghai, China, November, 8-12, 2004. 

    Lin Deng, Xiaoyong Chai, Qingzhao Tan, Wilfred Ng, Dik-Lun Lee, Spying Out Real User Preferences for Metasearch Engine Personalization, Proceedings of the 6th WEBKDD workshop: Webmining and Web Usage Analysis (WEBKDD2004), in conjunction with the 10th ACM SIGKDD conference (KDD2004), Seattle, Washington, USA, August 22, 2004. 

    Qingzhao Tan, Xiaoyong Chai, Wilfred Ng, Dik-Lun Lee, Applying Co-training to Clickthrough Data for Search Engine Adaptation, Proceedings of the 9th International Conference on Database Systems for Advanced Applications (DASFAA2004), Jeju Island, KOREA, March 17-19, 2004.


    Teaching Assistant

  • CSE541 (Fall 2005)
  • CSE441 (Spring 2006)


  • Useful links

    VLDB

    IEEE Computer Society

    ACM SIGMOD

    ACM SIGCOMM

    ACM SIGIR

    SIGMOD Proceedings

    WWW Proceedings

    SIGIR Proceedings

    ACM Online Calendar