Anagha Kulkarni

Assistant Professor
Computer Science Department
San Francisco State University
Ph: 415 338 2539
Office: Thornton Hall 970
ak@sfsu.edu

About

I am an Assistant Professor in the Computer Science Department at San Francisco State University. The research areas of information retrieval, natural language processing, and machine learning form the foundation for the work that is conducted in my lab. My students and I are currently investigating problems in (a) core IR where the focus is on balancing search precision and recall efficiently, (b) open-domain and live question answering, and (c) biomedical text analytics and retrieval.

I am also involved in an interesting pilot program PINC (Promoting INclusivity in Computing) where the goals are (a) to improve diversity in computing, (b) to improve computing literacy in non-CS fields.

I serve as the Sophomore advisor for the Computer Science department, and have chaired the graduate admissions committee in the past.

Publications

(* indicates undergraduate and graduate student co-authors.)

Previde P.*, Thomas B.*, Wong M., Mallory E.*, Petkovic D., Altman R., and Kulkarni A. (2018) GeneDive: A Gene Interaction Search and Visualization Tool to Facilitate Precision Medicine. To Appear In the Proceedings of Pacific Symposium on Biocomputing. January 2018. Hawaii, USA.

Pithyaachariyakul C.* and Kulkarni A. (2018) Automated Question Answering System for Community-based Questions. To Appear In the Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, Student Abstract and Poster Program. February 2018. New Orleans, USA.

Chuang M.* and Kulkarni A. (2017) Improving Shard Selection for Selective Search. In the Proceedings of the Asia Information Retrieval Societies Conference. November 2017. Jeju, Korea.

Chuang M.* and Kulkarni A. (2017) Balancing Precision and Recall with Selective Search. In the Proceedings of the Annual International Symposium on Information Management and Big Data. September 2017. Lima, Peru.

Pithyaachariyakul C.*, Khvalchik M.*, Kulkarni A. (2017) Automated Question Answering System. In the Proceedings of the Annual International Symposium on Information Management and Big Data. September 2017. Lima, Peru.

Khvalchik M.*, and Kulkarni A. (2017) Open-domain Non-factoid Question Answering. In the Proceedings of the International Conference on Text, Speech, and Dialogue. August 2017. Prague, Czechia.

Khvalchik M.*, Pithyaachariyakul C.*, and Kulkarni A. (2017) Answering the Hard Questions. In the Proceedings of the Language, Data, and Knowledge. June 2017, Galway, Ireland.

Wei W.*, Kulkarni A., Wong M. (2017) PF-Words: Biomedical Literature Based Protein Function Search. In the Proceedings of the International Conference on Bioinformatics and Computational Biology. March 2017, Honolulu, HI, USA.

Saylor B.*, Kulkarni A., Martinez N., and Yoon I. (2016) Optimizing Ecological Sustainability by Integrating Intuition and Machine Learning via Gamification. In the Proceedings of the International Conference on Computational Sustainability. July 2016, Ithaca, NY, USA.

Khvalchik M.* and Kulkarni A. (2016) San Francisco State University at LiveQA Track of TREC 2016. In the Proceedings of the Twenty-Fifth Text REtrieval Conference (TREC 2016). National Institute of Standards and Technology, special publication.

Chuang M*. and Kulkarni A. (2016) San Francisco State University at Total Recall Track of TREC 2016. In the Proceedings of the Twenty-Fifth Text REtrieval Conference (TREC 2016). National Institute of Standards and Technology, special publication.

Kulkarni A. and Callan J. (2015) Selective Search: Efficient and Effective Search of Large Textual Collections. ACM Transactions on Information Systems, 33(4). ACM. 2015.

Kulkarni A. (2015) ShRkC: Shard Rank Cutoff Prediction for Selective Search. In the Proceedings of the International Symposium on String Processing and Information Retrieval, pages 337—349, Sept 2015, London, UK.

Kulkarni A. (2015) Searching Large Textual Dataset With Limited Computational Resources. In the Proceedings of the Grace Hopper Conference. Oct 2015, Houston, USA.

Bhandari A.*, Klinkhammer J.*, and Kulkarni A. (2014) San Francisco State University at TREC 2014: Clinical Decision Support System Track and Microblog Track. In Proceedings of the Twenty-Third Text REtrieval Conference (TREC 2014). National Institute of Standards and Technology, special publication. 2015.

Kulkarni A. (2013) Efficient and Effective Large-scale Search. Carnegie Mellon Unversity.

Kulkarni A., Tigelaar A., Hiemstra D. and Callan J. (2012) Shard Ranking and Cutoff Estimation for Topically Partitioned Collections: In the Proceedings of the ACM Conference on Information and Knowledge Management, pages 555—564, Oct 2012, Maui, USA.

Kulkarni A., Teevan J., Svore K. and Dumais S. (2012) Creating Temporally Dynamic Web Search Snippets: In the Proceedings of the Poster Session of Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 1045-1046, Aug 2012, Portland, USA.

Kulkarni A., Teevan J., Svore K. and Dumais S. (2011) Understanding Temporal Query Dynamics: In the Proceedings of the ACM International Conference on Web Search and Data Mining, pages 167-176, Feb 2011, Hong Kong.

Kulkarni A. and Callan J. (2010) Document Allocation Policies for Selective Searching of Distributed Indexes: In the Proceedings of the ACM Conference on Information and Knowledge Management, pages 449-458, Oct 2010, Toronto, Canada.

Kulkarni A. and Callan J. (2010) Topic-based Index Partitions for Efficient and Effective Selective Search: In the Proceedings of SIGIR 2010 Workshop on Large-Scale Distributed Information Retrieval, July 2010, Geneva, Switzerland.

Kulkarni A. and Pedersen T. (2008) Name Discrimination and E-mail Clustering Using Unsupervised Clustering of Similar Concepts: Journal of Intelligent Systems (Special Issue: Recent Advances in Knowledge-Based Systems and Their Applications), 17(1-3), 37-50, 2008.

Kulkarni A. and Callan J. (2008) Dictionary Definitions based Homograph Identification using a Generative Hierarchical Model: In the Proceedings of the Association for Computational Linguistics: Human Language Technologies, pages 85-88, June 15-20, Columbus, Ohio, USA.

Kulkarni A., Heilman M., Callan J. and Eskenazi M. (2008) Word Sense Disambiguation for Vocabulary: In the Proceedings of the International Conference on Intelligent Tutoring Systems, pages 500-509, June 23-27, 2008, Montreal, Canada.

Kulkarni A., Callan J. and Eskenazi M. (2007) Dictionary Definitions: The Likes and the Unlikes: In the Proceedings of the SLaTE Workshop on Speech and Language Technology in Education, October 1-3, 2007, Farmington, PA, USA.

Pedersen T. and Kulkarni A. (2007) Unsupervised Discrimination of Person Names in Web Contexts: In the Proceedings of the Conference on Intelligent Text Processing and Computational Linguistics, pages 299-310, February 18-24, 2007, Mexico City.

Pedersen T. and Kulkarni A. (2007) Discovering Identities in Web Contexts with Unsupervised: In the Proceedings IJCAI-2007 Workshop on Analytics for Noisy Unstructured Text Data, January 8, 2007, Hyderabad, India.

Kulkarni A. and Pedersen T. (2006) How many different "John Smiths", and who are they?: In the Proceedings of the Student Abstract and Poster Session of the Twenty-First National Conference on Artificial Intelligence, pages 1885-1886, July 16-20, 2006, Boston, Massachusetts.

Pedersen T. and Kulkarni A. (2006) Automatic Cluster Stopping with Criterion Functions and the Gap Statistics: In the Proceedings of the Demonstration Session of the Human Language Technology Conference and the Annual Meeting of the North American Chapter of the Association for Computational Linguistic, pages 276-279, June 6, 2006, New York City.

Pedersen T., Kulkarni A., Angheluta R., Kozareva Z. and Solorio T. (2006) Improving Name Discrimination: A Language Salad Approach: In the Proceedings of the EACL 2006 Workshop on Cross-Language Knowledge Induction, April 3, 2006, Trento, Italy.

Pedersen T. and Kulkarni A. (2006) Selecting the "Right" Number of Senses Based on Clustering Criterion Functions: In the Proceedings of the Posters and Demo Program of the European Chapter of the Association for Computational Linguistics, pages 111-114, April 3-7, 2006, Trento, Italy.

Pedersen T., Kulkarni A., Angheluta R., Kozareva Z. and Solorio T. (2006) An Unsupervised Language Independent Method of Name Discrimination Using Second Order Co-Occurrence Vectors: In the Proceedings of the Conference on Intelligent Text Processing and Computational Linguistics, Lecture Notes in Computer Science, Springer, pages 208-222, February 19-25, 2006, Mexico.

Kulkarni A. and Pedersen T. (2005) Name Discrimination and Email Clustering using Unsupervised Clustering and Labeling of Similar Contexts: In Proceedings of the Indian International Conference on Artificial Intelligence, pages 703-722, December 20-22, 2005, Pune, India.

Pedersen T. and Kulkarni A. (2005) Identifying Similar Words and Contexts in Natural Language with SenseClusters: In AAAI ’05: Proceedings of the National Conference on Artificial Intelligence, pages 1694-1695, July 2005, Pittsburgh, PA, USA. (Intelligent Systems Demonstration)

Kulkarni A. and Pedersen T. (2005) SenseClusters: Unsupervised Clustering and Labeling of Similar Contexts: In Proceedings of the Demonstration and Interactive Poster Session of the Annual Meeting of the Association for Computational Linguistics, pages 105-108, June 26, 2005, Ann Arbor.

Kulkarni A. (2005) Unsupervised Discrimination and Labeling of Ambiguous Names: In Proceedings of the Student Research Workshop of the 43rd Annual Meeting of the Association of Computational Linguistics, June 25-30, 2005, Ann Arbor, MI, USA.

Pedersen T., Purandare A., and Kulkarni A. (2005) Name Discrimination by Clustering Similar Contexts: In Proceedings of the Conference on Intelligent Text Processing and Computational Linguistics, Lecture Notes in Computer Science, Springer, pages 226-237, February 13-19, 2005, Mexico.

Research Group

Mon-Shih Chaung works on large-scale search.

Meghana Dayanand works on biomedical retrieval.

Rajani Maski works on large-scale search problems in the context of open-source search engines.

Lowell Milleken works on biomedical retrieval.

Jose Ortiz-Costa works on automated question-answering.

Chanin Pithyaachariyakul works on automated question-answering.

Nikhil Paratkar works on large-scale search.

Amanda Nikkole Robinsen works on automated question-answering.

Brook Thomas works on biomedical retrieval with a focus on front-end.

Courses Taught

I teach the Search Engines course, and Introduction to Database Management Systems course on regular basis. In the past I have taught Introduction to Computer Programming, and Discrete Mathematical Structures for Computer Science. In Fall 2017 I started instructing a new course (CSc 698: Topics in Computing) for the PINC program. In this 2-semester long course students work in groups on a project of their choice. Here are some of the project topics that the students have undertaken in Fall `17: social media data analysis to detect messaging biases related to stigmatized medical conditions, image processing for biological cells, classification of sequencing data for HIV subtypes, an educational game for teaching genetics, alcohol level detection using facial image processing, audio analysis, and geo locations.