Adam St Arnaud, David Beck, and Grzegorz Kondrak. Identifying Cognate Sets Across Dictionaries of Related Languages. Conference on Empirical Methods in Natural Language Processing (EMNLP 2017), Copenhagen, Denmark, September 2017.
Marc Franco-Salvador, Grzegorz Kondrak. and Paolo Rosso. Bridging the native language and the language variety identification tasks. The 21st International Conference on Knowledge-Based and Intelligent Information & Engineering Systems (KES), Marseille, France, September 2017. [PDF]
Garrett Nicolai and Grzegorz Kondrak. Morphological Analysis without Expert Annotation. The 15th Meeting of the European Chapter of the Association of Computational Linguistics (EACL 2017), Valencia, Spain, April 2017.
Bradley Hauer, Garrett Nicolai and Grzegorz Kondrak. Bootstrapping Unsupervised Bilingual Lexicon Induction. The 15th Meeting of the European Chapter of the Association of Computational Linguistics (EACL 2017), Valencia, Spain, April 2017.
2016Garrett Nicolai and Grzegorz Kondrak. Leveraging Inflection Tables for Stemming and Lemmatization. The 54th Annual Meeting of the Association for Computational Linguistics (ACL 2016), Berlin, Germany, August 2016.
Bradley Hauer, and Grzegorz Kondrak. Decoding Anagrammed Texts Written in an Unknown Language and Script. Transactions of the Association for Computational Linguistics (TACL), April 2016, pp. 75-86.
Mohammad Salameh, Colin Cherry and Grzegorz Kondrak. Integrating Morphological Desegmentation into Phrase-based Decoding. The Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL HLT 2016), San Diego, CA, June 2016.
2015Garrett Nicolai and Grzegorz Kondrak. English orthography is not "close to optimal". The Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL HLT 2015), Denver, CO, June 2015.
Lei Yao and Grzegorz Kondrak. Joint Generation of Transliterations from Multiple Representations. The Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL HLT 2015), Denver, CO, June 2015.
Garrett Nicolai, Colin Cherry and Grzegorz Kondrak. Inflection Generation as Discriminative String Transduction. The Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL HLT 2015), Denver, CO, June 2015.
Garrett Nicolai, Colin Cherry and Grzegorz Kondrak. Morpho-syntactic Regularities in Continuous Word Representations: A multilingual study. Workshop on Vector Space Modeling for Natural Language Processing (VSM-NLP), Denver, CO, June 2015.
Garrett Nicolai, Bradley Hauer, Mohammad Salameh, Adam St Arnaud, Ying Xu, Lei Yao, and Grzegorz Kondrak, Multiple System Combination for Transliteration. The 5th Named Entities Workshop (NEWS 2015). Beijing, China, July 2015.
2014Bradley Hauer, Ryan Hayward and Grzegorz Kondrak. Solving Substitution Ciphers with Combined Language Models. The 25th International Conference on Computational Linguistics (COLING 2014), Dublin, Ireland, August 2014.
Mohammad Salameh, Colin Cherry and Grzegorz Kondrak. Lattice Desegmentation for Statistical Machine Translation. The 52nd Annual Meeting of the Association for Computational Linguistics (ACL 2014), Baltimore, MD, June 2014.
Garrett Nicolai, and Grzegorz Kondrak. Does the Phonology of L1 Show Up in L2 Texts? The 52nd Annual Meeting of the Association for Computational Linguistics (ACL 2014), Baltimore, MD, June 2014.
2013Hua He, Denilson Barbosa and Grzegorz Kondrak. Identification of Speakers in Novels. The 51st Annual Meeting of the Association for Computational Linguistics (ACL 2013), Sofia, Bulgaria. August 2013.
Bradley Hauer and Grzegorz Kondrak. Automatic Generation of English Respellings. The Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT 2013), Atlanta, GA, June 2013. (Best Student Paper Award) [TALK]
Garrett Nicolai, Bradley Hauer, Mohammad Salameh, Lei Yao, and Grzegorz Kondrak. Cognate and Misspelling Features for Native Language Identification. Eighth Workshop on Innovative Use of NLP for Building Educational Applications. NLI Shared Task. Atlanta, GA, June 2013.
Grzegorz Kondrak. Word similarity, cognation, and translational equivalence. Approaches to Measuring Linguistic Differences, Lars Borin and Anju Saxena (editors), pp. 375-386, De Gruyter Mouton, 2013. [PDF]
2012Grzegorz Kondrak, Xingkai Li, and Mohammad Salameh, Transliteration Experiments on Chinese and Arabic. The 4th Named Entities Workshop (NEWS 2012). Jeju, Korea, July 2012.
Aditya Bhargava and Grzegorz Kondrak. Leveraging supplemental representations for sequential transduction. The 13th Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT 2012), Montreal, QC, June 2012.
Grzegorz Kondrak.
Similarity Patterns in Words (invited talk).
The EACL 2012 Workshop on
Uncovering Language History from Multilingual Resources,
Avignon, April 2012.
[PDF]
Aditya Bhargava and Grzegorz Kondrak. How do you pronounce your name? Improving G2P with transliterations. The 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies (ACL 2011), Portland, OR, June 2011.
Cecil H. Brown, David Beck, Grzegorz Kondrak, James K. Watters and Soren Wichmann. Totozoquean. International Journal of American Linguistics, 77(3), July 2011, pp. 323-372.
Jessica Enright and Grzegorz Kondrak. The application of chordal graphs to inferring phylogenetic trees of languages. The 5th International Joint Conference on Natural Language Processing (IJCNLP 2011).
Bradley Hauer and Grzegorz Kondrak. Clustering Semantically Equivalent Words into Cognate Sets in Multilingual Lists. The 5th International Joint Conference on Natural Language Processing (IJCNLP 2011).
Aditya Bhargava, Bradley Hauer and Grzegorz Kondrak. Leveraging Transliterations from Multiple Languages. The 3rd Named Entities Workshop (NEWS 2011).
2010Shane Bergsma, Aditya Bhargava, Hua He, and Grzegorz Kondrak. Predicting the Semantic Compositionality of Prefix Verbs. Conference on Empirical Methods in Natural Language Processing (EMNLP 2010), pp. 293-303, Cambridge, MA, October 2010.
Sittichai Jiampojamarn, and Grzegorz Kondrak. Letter-Phoneme Alignment: An Exploration. The 48th Annual Meeting of the Association for Computational Linguistics (ACL 2010), pp. 780-788, Uppsala, Sweden. July 2010.
Aditya Bhargava and Grzegorz Kondrak. Language identification of names with SVMs. Human Language Technologies: The 11th Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL HLT 2010), pp. 693-696, Los Angeles, CA. June 2010.
Sittichai Jiampojamarn, Colin Cherry and Grzegorz Kondrak. Integrating Joint n-gram Features into a Discriminative Training Framework. Human Language Technologies: The 11th Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL HLT 2010), pp. 697-700, Los Angeles, CA. June 2010.
Sittichai Jiampojamarn, Kenneth Dwyer, Shane Bergsma, Aditya Bhargava, Qing Dou, Mi-Young Kim, and Grzegorz Kondrak. Transliteration Generation and Mining with Limited Training Resources. The 2nd Named Entities Workshop (NEWS 2010).
2009 Grzegorz Kondrak.
Identication of Cognates and Recurrent Sound Correspondences in
Word Lists.
Traitement
automatique des langues et langues anciennes
50(2), October 2009, pp. 201-235.
[PDF]
[BIB]
Qing Dou,
Shane Bergsma,
Sittichai Jiampojamarn,
and
Grzegorz Kondrak.
A Ranking Approach to Stress Prediction for Letter-to-Phoneme
Conversion.
Joint Conference of the 47th Annual Meeting of the Association for
Computational Linguistics and 4th International Joint Conference on
Natural Language Processing of the AFNLP
(ACL-IJCNLP 2009),
pp. 118-126, Singapore, August 2009.
[PDF]
[BIB]
Kenneth Dwyer
and
Grzegorz Kondrak.
Reducing the Annotation Effort for Letter-to-Phoneme Conversion.
Joint Conference of the 47th Annual Meeting of the Association for
Computational Linguistics and 4th International Joint Conference on
Natural Language Processing of the AFNLP
(ACL-IJCNLP 2009),
pp. 127-135, Singapore, August 2009.
[PDF]
[BIB]
Susan Bartlett,
Grzegorz Kondrak
and
Colin Cherry.
On the Syllabification of Phonemes.
Proceedings of
Human Language Technologies: The Annual Conference of the North
American Chapter of the Association for Computational Linguistics
(NAACL-HLT 2009),
pp. 308-316, Boulder, CO, June 2009.
[PDF]
[BIB]
Aditya Bhargava
and
Grzegorz Kondrak.
Multiple Word Alignment with Profile Hidden Markov Models.
Proceedings of Human Language Technologies: The Annual Conference
of the North American Chapter of the Association for Computational
Linguistics, Companion Volume: Student Research Workshop and Doctoral
Consortium
(NAACL-HLT 2009),
pp. 43-48,
Boulder, CO, June 2009.
[PDF]
[BIB]
Sittichai Jiampojamarn, Aditya Bhargava, Qing Dou, Kenneth Dwyer, and Grzegorz Kondrak. DirecTL: a Language-Independent Approach to Transliteration. The 1st Named Entities Workshop (NEWS 2009).
2008
Susan Bartlett,
Grzegorz Kondrak
and
Colin Cherry.
Automatic Syllabification with Structured SVMs for
Letter-To-Phoneme Conversion.
46th Annual Meeting of the Association for Computational
Linguistics: Human Language Technologies
(ACL-08: HLT), pp. 568-576,
Columbus, OH, June 2008.
(Best Student Paper Award)
[PDF]
[BIB]
Sittichai Jiampojamarn,
Colin Cherry
and
Grzegorz Kondrak.
Joint Processing and Discriminative Training for Letter-to-Phoneme
Conversion.
46th Annual Meeting of the Association for Computational
Linguistics: Human Language Technologies
(ACL-08: HLT), pp. 905-913,
Columbus, OH, June 2008.
[PDF]
[BIB]
Michelle Annett
and
Grzegorz Kondrak.
A Comparison of Sentiment Analysis Techniques: Polarizing Movie Blogs.
Proceedings of the
Twenty-First Canadian Conference on Artificial Intelligence,
pp. 25-35,
Windsor, ON, May 2008.
(Lecture Notes in
Artificial Intelligence 5032, Springer-Verlag)
[Abstract (HTML)]
[PostScript]
[PDF]
Shane Bergsma
and
Grzegorz Kondrak.
Multilingual Cognate Identification using Integer
Linear Programming.
Proceedings of the
International Workshop on
Acquisition and Management of Multilingual Lexicons,
pp. 11-18, Borovets, Bulgaria, September 2007.
[Abstract (HTML)]
[PostScript]
[PDF]
[BIB]
Sittichai Jiampojamarn,
Grzegorz Kondrak
and
Colin Cherry.
Biomedical Term Recognition Using Discriminative Training.
Proceedings of the International Conference on Recent Advances in
Natural Language Processing
(RANLP 2007),
pp. 310-316, Borovets, Bulgaria, September 2007.
[Abstract (HTML)]
[PostScript]
[PDF]
Tarek Sherif
and
Grzegorz Kondrak.
Substring-Based Transliteration.
Proceedings of
the 45th Annual Meeting of the Association for Computational Linguistics
(ACL 2007),
pp. 944-951, Prague, Czech Republic, June 2007.
[Abstract (HTML)]
[PostScript]
[PDF]
[BIB]
Shane Bergsma
and
Grzegorz Kondrak.
Alignment-Based Discriminative String Similarity.
Proceedings of
the 45th Annual Meeting of the Association for Computational Linguistics
(ACL 2007),
pp. 656-663, Prague, Czech Republic, June 2007.
[Abstract (HTML)]
[PostScript]
[PDF]
[BIB]
Tarek Sherif
and
Grzegorz Kondrak.
Bootstrapping a Stochastic Transducer for Arabic-English
Transliteration Extraction.
Proceedings of
the 45th Annual Meeting of the Association for Computational Linguistics
(ACL 2007),
pp. 864-871, Prague, Czech Republic, June 2007.
[Abstract (HTML)]
[PostScript]
[PDF]
[BIB]
John Nerbonne,
T. Mark Ellison
and
Grzegorz Kondrak.
Computing and Historical Phonology.
Proceedings of the
ACL
Workshop on
Computing and Historical Phonology,
(Ninth Meeting of the ACL
Special Interest Group for Computational
Morphology and Phonology),
pp. 1-5, Prague, Czech Republic, June 2007.
[Abstract (HTML)]
[PostScript]
[PDF]
[BIB]
Grzegorz Kondrak,
David Beck
and
Philip Dilts.
Creating a Comparative Dictionary of Totonac-Tepehua.
Proceedings of the
ACL
Workshop on
Computing and Historical Phonology,
(Ninth Meeting of the ACL
Special Interest Group for Computational
Morphology and Phonology),
pp. 134-141, Prague, Czech Republic, June 2007.
[Abstract (HTML)]
[PostScript]
[PDF]
[BIB]
Sittichai Jiampojamarn,
Grzegorz Kondrak
and
Tarek Sherif.
Applying Many-to-Many Alignments and HMMs to
Letter-to-Phoneme Conversion.
Proceedings of the
Annual Conference of the North American Chapter of the Association for
Computational Linguistics
(NAACL-HLT 2007),
pp. 372-379, Rochester, NY, April 2007.
[Abstract (HTML)]
[PostScript]
[PDF]
[BIB]
Jessica Enright
and
Grzegorz Kondrak.
A Fast Method for Parallel Document Identification.
Proceedings of Human Language Technologies:
The Conference of
the North American Chapter of the Association for Computational Linguistics
(HLT-NAACL 2007))
companion volume, pp. 29-32, Rochester, NY, April 2007.
[Abstract (HTML)]
[PostScript]
[PDF]
[BIB]
Grzegorz Kondrak
and Tarek Sherif.
Evaluation of Several Phonetic Similarity Algorithms
on the Task of Cognate Identification.
Proceedings of the COLING-ACL
Workshop on
Linguistic Distances,
pp. 43-50, Sydney, Australia, July 2006.
[Abstract (HTML)]
[PostScript]
[PDF]
Sittichai Jiampojamarn,
Grzegorz Kondrak
and
Colin Cherry.
Biomedical Term Recognition With the Perceptron HMM Algorithm.
Proceedings of the HLT-NAACL 2006
Workshop on Linking Natural Language
Processing and Biology: Towards Deeper Biological Literature Analysis
(BioNLP'06),
pp. 114-115, New York, June 2006.
[Abstract (HTML)]
[PostScript]
[PDF]
Theresa Jickels
and Grzegorz Kondrak.
Unsupervised Labeling of Noun Clusters.
Proceedings of the Nineteenth Canadian Conference on Artificial Intelligence
(Canadian AI 2006),
pp. 278-287,
Quebec City, June 2006.
(Lecture Notes in Computer
Science 4013, Springer-Verlag)
[Abstract (HTML)]
[PostScript]
[PDF]
Grzegorz Kondrak and
Bonnie J. Dorr.
Automatic Identification of Confusable Drug Names.
Artificial
Intelligence in Medicine
36(1), January 2006, pp. 29-42.
[Abstract (HTML)]
Grzegorz Kondrak.
N-gram similarity and distance.
Proceedings of
the Twelfth International Conference on
String Processing and Information Retrieval
(SPIRE 2005),
pp. 115-126, Buenos Aires, Argentina, November 2005.
[Abstract (HTML)]
[PostScript]
[PDF]
Farooq Ahmad
and Grzegorz Kondrak.
Learning a Spelling Error Model from Search Query Logs.
Proceedings of the Human Technology Conference and Conference on
Empirical Methods in Natural Language Processing
(HLT/EMNLP 2005),
pp. 955-962, Vancouver, British Columbia, October 2005.
[Abstract (HTML)]
[PostScript]
[PDF]
Diana Inkpen,
Oana Frunza
and Grzegorz Kondrak.
Automatic Identification of Cognates and False Friends
in French and English.
Proceedings of the International Conference on Recent Advances in
Natural Language Processing
(RANLP 2005),
pp. 251-257, Borovets, Bulgaria, September 2005.
[Abstract (HTML)]
[PostScript]
[PDF]
Grzegorz Kondrak.
Cognates and Word Alignment in Bitexts.
Proceedings of the Tenth Machine Translation Summit
(MT Summit X),
pp. 305-312, Phuket, Thailand, September 2005.
[Abstract (HTML)]
[PostScript]
[PDF]
Wesley Mackay
and Grzegorz Kondrak.
Computing Word Similarity and Identifying Cognates
with Pair Hidden Markov Models.
Proceedings of the Ninth Conference on Computational Natural Language Learning
(CoNLL 2005),
pp. 40-47, Ann Arbor, Michigan, June 2005.
[Abstract (HTML)]
[PostScript]
[PDF]
Grzegorz Kondrak and
Bonnie J. Dorr.
Identification of Confusable Drug Names:
A New Approach and Evaluation Methodology.
Proceedings of the
Twentieth International Conference on Computational Linguistics
(COLING 2004)
pp. 952-958, Geneva, Switzerland, August 2004.
[Abstract (HTML)]
[PostScript]
[PDF]
Grzegorz Kondrak.
Combining Evidence in Cognate Identification.
Proceedings of the Seventeenth Canadian Conference on Artificial Intelligence
(Canadian AI 2004),
pp. 44-59, London, ON, May 2004.
(Lecture Notes in Computer
Science 3060, Springer-Verlag)
(Best Paper Award)
[Abstract (HTML)]
[PostScript]
[PDF]
Grzegorz Kondrak.
Phonetic Alignment and Similarity.
Computers and the Humanities
37(3), August 2003, pp. 273-291.
[Abstract (HTML)]
[PostScript]
[PDF]
Grzegorz Kondrak,
Daniel Marcu
and Kevin Knight.
Cognates Can Improve Statistical Translation Models.
Human Language Technology Conference of
the North American Chapter of the Association for Computational Linguistics
(
HLT-NAACL 2003)
companion volume, pp. 46-48, Edmonton, Alberta, May 2003.
[Abstract (HTML)]
[PostScript]
[PDF]
Grzegorz Kondrak.
Identifying Complex Sound Correspondences in Bilingual Wordlists.
Proceedings of
the Fourth International Conference on Computational Linguistics
and Intelligent Text Processing
(CICLING 2003),
pp. 432-443, Mexico City, February 2003.
(Lecture Notes in Computer
Science 2588, Springer-Verlag)
[Abstract (HTML)]
[PostScript]
[PDF]
Grzegorz Kondrak.
Determining Recurrent Sound Correspondences
by Inducing Translation Models.
Proceedings of
the Nineteenth International Conference on Computational Linguistics
(COLING 2002),
pp. 488-494, Taipei, August 2002.
[Abstract (HTML)]
[PostScript]
[PDF]
Grzegorz Kondrak.
Algorithms for Language Reconstruction.
Ph.D Thesis,
University of Toronto, July 2002.
[Abstract (HTML)]
[PostScript]
[PDF]
Grzegorz Kondrak. Review of Brett Kessler's "The Significance of Word Lists." Computational Linguistics. 27(4), December 2001, pp. 588-591. [PDF]
Grzegorz Kondrak.
Identifying Cognates by Phonetic and Semantic Similarity.
Proceedings of the Second Meeting of
the North American Chapter of the Association for Computational Linguistics
(NAACL 2001),
pp. 103-110, Pittsburgh, June 2001.
[Abstract (HTML)]
[PostScript]
[PDF]
Grzegorz Kondrak.
A New Algorithm for the Alignment of Phonetic Sequences.
Proceedings of the First Meeting of
the North American Chapter of the Association for Computational Linguistics
(ANLP-NAACL 2000),
pp. 288-295, Seattle, April 2000.
[Abstract (HTML)]
[PostScript]
[PDF]
Grzegorz Kondrak
and Peter van Beek.
A Theoretical Evaluation of Selected Backtracking Algorithms.
Artificial Intelligence Journal
(89)1-2 (1997) pp. 365-387.
[Abstract]
[Full
Text]
Grzegorz Kondrak
and Peter van Beek.
A Theoretical Evaluation of Selected Backtracking Algorithms.
Proceedings of the Fourteenth International Joint Conference
on Artificial Intelligence
(IJCAI),
pp. 541-547, Montreal, August, 1995.
(Best Paper Award)
[Compressed PostScript]
Grzegorz Kondrak.
A Theoretical Evaluation of Selected Backtracking Algorithms.
Master's thesis, University of Alberta, 1994.
Also published as Technical Report TR94-10,
Department of Computing Science, University of Alberta.
[Full Text] (52 pages, 176 kB)