Books Authored

  • Anna Feldman and Jirka Hana. 2010. A Resource-light Approach to Morphosyntactic Tagging. Language and Computers 70: Studies in Practical Linguistics, eds. Christian Mair, Charles F. Meyer & Nelleke Oostdijk, Rodopi Press, Amsterdam-New York. XIV, 185 pp. ISBN: 978-90-420-2768-8; ,

Edited Volumes 



Peer-reviewed papers


  • Lee, P., Trujillo, A. C., Plancarte, D. C., Ojo, O. E., Liu, X., Shode I., Zhao Y., J.Peng and A. Feldman. 2024. MEDs for PETs: Multilingual Euphemism Disambiguation for Potentially Euphemistic Terms. To appear in the Findings of the ACL: EACL 2024. [pdf]
  • Shode I., Adelani D.I., Peng J., Feldman A. 2023. NollySenti: Leveraging Transfer Learning and Machine Translation for Nigerian Movie Sentiment Classification. In Proceedings of ACL 2023, pages 986-998.[pdf]
  • Patrick L., Shode I., Trujillo A.C., Zhao Y., Ojo O.E., Cuevas Plancarte D., Feldman A., and J. Peng. 2023. FEED PETs: Further Experimentation and Expansion on the Disambiguation of Potentially Euphemistic Terms. In Proceedings of the 12th Joint Conference on Lexical and Computational Semantics (*SEM)[pdf]
  • Lee P., Gavidia M., Feldman A., Peng J. 2022. Searching for PETs: Using distributional and sentiment-based methods to find potentially euphemistic terms. In Proceedings of the Second Workshop on Understanding Implicit and Underspecified Language [pdf]
  • Gavidia M., Lee P., Feldman A., Peng J. 2022. CATs are fuzzy PETs: A corpus and analysis of potentially euphemistic terms. LREC 2022.[pdf]
  • Shode I., Adelani D.I., Feldman A. 2022. YOSM: A new Yoruba sentiment corpus for movie reviews. In 3rd Workshop on African Natural Language Processing.[pdf]
  • Lee P., Feldman A., Peng J. 2022. A Report on the Euphemisms Detection Shared Task. In Third Workshop on Processing Figurative Language (at EMNLP 2022).[pdf]
  • Feldman A. 2022. Hard Nut to Crack: Automatic Idiom Detection. In Interférences littéraires/Literaire interferenties. ISSN 2031-2970.
  • Shaar S., Alam F., Da San Martino G., Nikolov A., Zaghouani W., Nakov P., and A. Feldman 2021. Findings of the NLP4IF-2021 Shared Tasks on Fighting the COVID-19 Infodemic and Censorship Detection Fourth NAACL 2021 Workshop on Natural Language Processing for Internet Freedom (NLP4IF) Workshop: Censorship, Disinformation, and Propaganda, June 6, 2021 [pdf]
  • Ducret M., Kruse L., Martinez C., Feldman A., and J. Peng. 2021. You Don’t Say... Linguistic Features in Sarcasm Detection. CLIC-IT 2021: Seventh Italian Conference on Computational Linguistics Bologna, 1 - 3 March, 2021 [pdf]
  • Ng Kei Y., Feldman A., and J. Peng. 2020. Linguistic Fingerprints of Internet Censorship: The Case of Sina Weibo. Thirty-Fourth AAAI Conference on Artificial Intelligence (AAAI-20). New York, NY. February, 2020 [pdf][appendix]
  • Bhagat P., A. Varde, and A. Feldman. 2019. WordPrep: Word-based Preposition Prediction Tool. IEEE International Conference on Big Data. 4th Special Session on Intelligent Data Mining. December, Los Angeles, CA.
  • Ng Kei Y., Feldman A., Peng J., and C. Leberknight 2019. Neural Network Prediction of Censorable Language. In Proceedings of the 3rd Workshop on NLP and Computational Social Science (NLP+CSS) held in conjunction with NAACL 2019 [pdf]
  • C. Leberknight and Feldman A. 2019. Leveraging NLP and Social Network Analytic Techniques to Detect Censored Keywords: System Design and Experiments. In Proceedings of  the 52nd Hawaii International Conference on System Sciences [pdf]
  • E.Joyce, Goldeck M., Leberknight C., and Feldman A. 2018. Apollo: A System for Tracking Internet Censorship. In Proceedings of  the 13th Pre-ICIS Workshop on Information Security and Privacy [pdf]
  • Ng Kei Y., Feldman A., Peng J. and C. Leberknight. 2018. Linguistic Characteristics of Censorable Language on SinaWeibo. In Proceedings of  The COLING 1st Natural Language Processing for Information Freedom workshop. [pdf] [dataset]
  • Ng Kei Y., Feldman A., and C. Leberknight. 2018. Detecting Censorable Content in Social Media: A Pilot Study. In Proceedings of  Natural Language Processing for Social Media Analysis (NLP4SMA) 2018. 10th Hellenic Conference on Artificial Intelligence (SETN-2018). [pdf]
  • Kateryna Kaplun, Christopher Leberknight, Anna Feldman. 2018. Controversy and Sentiment: An Exploratory Study. In Proceedings of  Natural Language Processing for Social Media Analysis (NLP4SMA) 2018. 10th Hellenic Conference on Artificial Intelligence (SETN-2018)[pdf]
  • Kateryna Kaplun, Christopher Leberknight, Anna Feldman. 2018. A Comparison of Lexicons for Detecting Controversy. In Proceedings of  the LREC 2018 Workshop: Natural Language Processing meets Journalism III, Miyazaki (Japan) [pdf]
  • Katsiaryna Aharodnik, Anna Feldman, Jing Peng. 2018. Designing a Russian Idiom-Annotated Corpus. In Proceedings of  the 11th edition of the Language Resources and Evaluation Conference, 7-12 May 2018, Miyazaki (Japan) [pdf]
  • Jing Peng, Katsiaryna Aharodnik, Anna Feldman. 2018. A Distributional Semantics Model for Idiom Detection: The Case of English and Russian. In Proceedings of  the 10th International Conference on Agents and Artificial Intelligence (Special Session on Natural Language Processing in Artificial Intelligence - NLPinAI 2018 [pdf]
  • Manali Pradhan, Jing Peng, Anna Feldman, Bianca Wright. 2017. Idioms: Humans or machines, it’s all about context. In Proceedings of  18th International Conference on Computational Linguistics and Intelligent Text Processing. Budapest. Hungary. [preprint] Best paper award, 2nd place; best presentation award, 1st place.
  • Jing Peng, Anna Feldman. 2016. Experiments in Idiom Recognition. Proceedings of  the 26th International Conference on Computational Linguistics (COLING). Osaka, Japan. [pdf]
  • Jing Peng and Anna Feldman. 2016.  “Automatic Idiom Recognition with Word Embeddings.” Communications in Computer and Information Science, Vol. 656. Springer.
  • Jing Peng, Anna Feldman. 2016. In God We Trust. All Others Must Bring Data. — W. Edwards Deming — Using word embeddings to recognize idioms. In Proceedings of the 3rd Annual International Symposium on Information Management and Big Data — SIMBig, Cusco, Peru. [pdf]
  • Jing Peng, Anna Feldman, and Hamza Jazmati. 2015.  Classifying Idiomatic and Literal Expressions Using Vector Space Representations. In Proceedings of the Recent Advances in Natural Language Processing (RANLP) conference 2015, Hissar, Bulgaria, September 2015. [pdf]
  • Jing Peng, Anna Feldman, and Ekaterina Vylomova. 2014. Classifying Idiomatic and Literal Expressions Using Topic Models and Intensity of Emotions. In Proceedings of the 2014 Empirical Methods for Natural Language Processing Conference (EMNLP). [pdf]
  • Rosen A., J. Hana, B. Stindlova, A. Feldman. 2013. Evaluating and automating the annotation of a learner corpus. In Language Resources and Evaluation 47 (April). Springer. DOI: 10.1007/s10579-013-9226-3
  • Katsiaryna Aharodnik, Marco Chang, Anna Feldman and Jirka Hana. 2013. Automatic identification of learners' language background based on their writing in Czech. In Proceedings of the International Joint Conference on Natural Language Processing (IJCNLP) 2013.[pdf]
  • Anna Feldman and Jing Peng. 2013. Automatic Detection of Idiomatic Clauses. In Proceedings of Computational Linguistics and Intelligent Text Processing (CICLing 2013), Part I, LNCS 7816, pp. 435-446, Springer Heidelberg. [preprint]. Best paper award, 1st place.
  • Jirka Hana and Anna Feldman. 2012. Resource-light approaches to computational morphology. Part I: Monolingual Approaches . In Language and Linguistics Compass Journal (Computational Linguistics Section). Vol.6, Issue 10, pp. 622-634, Blackwell.[preprint]
  • Jirka Hana, Boris Lehecka, Anna Feldman, Alena Cerna, Karel Oliva. 2012. Building a corpus of Old Czech. In Adaptation of Language Resources and Tools for Processin g Cultural Heritage Objects Workshop associated with the LREC 2012 Conference (21-27 May 2012) [pdf]
  • Anna Feldman. 2012. Review of Roark & Sproat's Computational approaches to morphology and syntax. In Word Structure, 5:2, Edinburgh University Press.
  • Jirka Hana, Anna Feldman, and Katsiaryna Aharodnik. 2011. A low-budget tagger for Old Czech. In Proceedings of the 5th Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities (LaTeCH 2011) held in conjunction with the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies (ACL/HLT 2011).[pdf]
  • Jing Peng, Anna Feldman, and Laura Street. 2010. Computing Linear Discriminants for Idiomatic Sentence Detection. In Research in Computing Science, Special issue: Natural Language Processing and its Applications, Vol. 46, pp. 17-28, Instituto Politecnico Nacional Centro de Investigacin en Computacien Mexico 2010, ISSN 1870-4069. [pdf]
  • Jirka Hana and Anna Feldman. 2010. Challenges of Cheap Resource Creation. In Proceedings of the 4th Linguistic Annotation Workshop held in conjunction with ACL 2010.[pdf]
  • Amal Kaluarachchi, Aparna Varde, Srikanta Bedathur, Gerhard Weikum, Jing Peng and Anna Feldman 2010. Incorporating Terminology Evolution for Query Translation in Text Retrieval with Association Rules. In Proceedings of the 19th ACM International Conference on Information and Knowledge Management (CIKM). [pdf]
  • Amal C. Kaluarachchige, Aparna Varde, Jing Peng, and Anna Feldman. 2010. Intelligent Time-Aware Query Translation for Text Sources. In Proceedings of the 24th AAAI Conference on Artificial Intelligence (AAAI-10). [pdf]
  • Jirka Hana and Anna Feldman. 2010. A New Positional Tagset System for Russian.  In Proceedings of the 7th International Conference on Language Resources and Evaluation (LREC 2010). [pdf]
  • Laura Street, Rachel Silverstein, Nathan Michalov, Felicia Flowers, Angela Talucci, Michael Reynolds,Priscilla Pereira, Gabriella Morgon, Samantha Siegel, Marci Barousse, Lurdes Ruela, Antequa Anderson, Tashom Carroll, and Anna Feldman. 2010. Like Finding a Needle in a Haystack: Annotating the American  National Corpus for Idiomatic Expressions. In Proceedings of the 7th International Conference on Language Resources and Evaluation (LREC 2010). [BA/MA Fall 09 Class project].[pdf]
  • Hiroki Yamakawa, Jing Peng, and Anna Feldman. 2010. Semantic Enrichment of Text Representation with Wikipedia for Text Classi cation. In Proceedings of the 2010 IEEE Conference on Systems, Man and Cybernetics (SMC2010), Istanbul, Turkey. [pdf]
  • Anna Feldman. 2010. Book Notice of Rasinger, Sebastian M.'s Quantitative Research in Linguistics. In Studies in Language, 24:1, pp. 212-214, John Benjamins.
  • Ghazi Abuhakema, Anna Feldman, and Eileen Fitzpatrick. 2009. ARIDA: An Arabic Interlanguage Database and Its Applications: A Pilot Study. Journal of the National Council of Less Commonly Taught Languages (NCOLCTL). Vol. 7, pp. 161-184.  
  • Anna Feldman and Jing Peng. 2009. An Approach to Automatic Figurative Language Detection: A Pilot Study. In Proceedings of the Corpus-Based Approaches for Figurative Language Colloquium held in conjunction with the Corpus Linguistic 2009 endorsed by Researching and Applying Metaphor (RaAM), Liverpool, UK. ISSN 1368-9223 [pdf]
  • Anna Feldman. 2008. Tagset Design, Inflected Languages, and N-gram Tagging. The Linguistics Journal, 3(1), pp. 155-177, Time Taylor International, ISSN: 17182298 ISSN Print: 1718-2301.
  • Serge Sharoff, Mikhail Kopotev, Tomac Erjavec, Anna Feldman and Dagmar Divjac. 2008. Designing ad Evaluating a Russian Tagset. In Proceedings of the sixth international conference on Language Resources and Evaluation (LREC 2008), Marrakech (Morocco). [pdf]
  • Ghazi Abuhakema, Reem Faraj, Anna Feldman, and Eileen Fitzpatrick. 2008. Annotating and Arabic Learner Corpus for Error. In Proceedings of the sixth international conference on Language Resources and Evaluation (LREC 2008), Marrakech (Morocco).[pdf]
  • Anna Feldman, Ghazi Abuhakema, and Eileen Fitzpatrick. 2008. ARIDA: An Arabic Interlanguage Database and Its Applications: A Pilot Study. In Proceedings of the 21th International Florida Arti cial Intelligence Research Society Conference (FLAIRS-08). Coconut Grove, FL: AAAI Press. [pdf]
  • Stefan Dy la and Anna Feldman. 2008. "On Comitative Constructions in Polish and Russian." In Zybatow, Gerhild et al. (eds.). Formal Description of Slavic Languages: The Fifth Conference, Leipzig 2003. Peter Lang: Frankfurt am Main.[pdf]
  • Anna Feldman and Katya Arshavskaya. 2007. "Russian and English Event Annotation: A Pilot Study". Annotating Variation and Change. Studies in Variation, Contacts and Change in English, Vol. 1, Meurman-Solin, Anneli & Arja Nurmi (eds.) 
  • Anna Feldman, Jirka Hana, and Chris Brew. 2006. "A Cross-language Approach to Rapid Creation of New Morpho-syntactically Annotated Resources". In Proceedings of the fth international conference on Language Resources and Evaluation (LREC 2006). Genoa, Italy. [pdf]
  • Jirka Hana, Anna Feldman, Luiz Amaral, and Chris Brew. 2006. "Tagging Portuguese with a Spanish Tagger Using Cognates". In Proceedings of the Workshop on Cross-language Knowledge Induction hosted in conjunction with the 11th Conference of the European Chapter of the Association for Computational Linguistics (EACL-2006). Trento, Italy. pp. 33-40. [pdf]
  • Anna Feldman, Jirka Hana, and Chris Brew. 2006. "Experiments in Morphological Annotation Transfer". In Proceedings of Computational Linguistics and Intelligent Text Processing, CICLing, A. Gelbukh (editor), Lecture Notes in Computer Science, Springer-Verlag, 2006. pp. 41-50. [pdf]
  • Anna Feldman. 2006. Book Review of Bolshakov, Igor A. and Alexander Gelbukh's Computational Linguistics: Models, Resources, Applications. In Computational Linguistics, 32(3), pp. 443-444, MIT Press.
  • Anna Feldman, Jirka Hana, and Chris Brew. 2005. "Buy One, Get One Free or What to Do When Your Linguistic Resources are Limited". In Proceedings of the third international seminar on Computer Treatment of Slavic and East-European Languages (Slovko 2005). Bratislava, Slovakia.
  • Jirka Hana, Anna Feldman, and Chris Brew. 2004. "A Resource-light Approach to Russian Morphology: Tagging Russian using Czech resources". In Proceedings of the 2004 Conference on Empirical Methods in Natural Language Processing (EMNLP). pp.222-229. [pdf]
  • Jirka Hana and Anna Feldman. 2004. "Portable Language Technology: Russian via Czech." In Proceedings of the 2004 Midwest Computational Linguistics Colloquium. Bloomington, Indiana. [pdf]
  • Anna Feldman. 2003. "Kim and Sandy, Kim with Sandy, Just Me or Both of Us?" In Proceedings of European Summer School of Logic, Language, and Information (ESSLLI 2002). Trento, Italy. [pdf]
  • Anna Feldman. 2003. Comitative and Plural Pronoun Constructions". In Proceedings of the 17th Annual Meeting of the Israel Association of Theoretical Linguistics (IATL). Jerusalem, Israel. 2001. http://atar.mscc.huji.ac.il/~english/IATL/17/TOC.html
  • Anna Feldman. 2003. "On S-Coordination and Plural Pronoun Constructions" In Balkan and Slavic Linguistics, vol.2, ed. Daniel E. Collins and Andrea D. Sims, The Ohio State University. pp. 49-75.
  • Anna Feldman. 2002. "On NP-coordination." The UiL OTS 2002 Yearbook. Utrecht, the Netherlands. pp. 39-66. [pdf]
  • Anna Feldman. Book review, 2001. Bresnan, Joan. (2001) Lexical Functional Syntax, Oxford: Blackwell Publishers. Linguist List. http://linguistlist.org/issues/12/12-2230.html#1
  • Anna Feldman. 2000. "Discourse Markers -- Accessing "Hearer-Old" Information The Case of Russian "Zhe". In Proceedings of the 27th Linguistic Association of Canada and the United States (LACUS) Forum. Houston, Texas. pp. 187-202.


Other Publications

  • Jirka Hana and Anna Feldman. 2013. European Summer School for Logic, Language, and Information (ESSLLI 2013). Course: Computational Morphology. August 5-9, 2013. Course package. 74pp.
  • Anna Feldman and Jirka Hana. 2010. European Summer School for Logic, Language, and Information (ESSLLI 2010). Course: Resource-light Morphological Analysis of Highly Inflected Languages. August 9-20, 2010. Course package. 80pp.
  • Jirka Hana and Anna Feldman. 2008. Manual for Morphological Annotation with Positional Tags. Published online.