Prof. Dr. Stephanie Evert
Department of German Language and Literature
Chair of Computational Corpus Linguistics
FAU MoD Steering Committee / Vice-Spokesperson
Friedrich-Alexander-Universität Erlangen-Nürnberg
stephanie.evert@fau.de
See my profile at FAU
Computational Linguistics, Corpus Linguistics, Digital Humanities, Data Sciences, Machine Learning, Deep Learning, Qualitative-Quantitative Methods
• Measuring productivity and fixedness in lexico-syntactic constructions
• Measuring Keyness
LinkedIn
X (Twitter)
Publications
2024
Auslegung des KI-VO-E zur Evaluation von Verfahren der Künstlichen Intelligenz am Beispiel der automatischen Anonymisierung von Gerichtsentscheidungen
In: Erich Schweighofer / Stefan Eder / Federico Costantini / Felix Schmautzer / Jonas Pfister (ed.): Sprachmodelle: Juristische Papageien oder mehr? – Tagungsband des 27. Internationalen Rechtsinformatik Symposions IRIS 2024, 2024, p. 205 - 215 , , , :
2023
AUTOMATISCHE ANONYMISIERUNG VON GERICHTSURTEILEN – EINE VISION SCHEINT REALISIERBAR
In: Jusletter IT (2023), p. 211-220
ISSN: 1664-848X
DOI: 10.38023/14A32D75-E299-40D4-9523-3AF8BD445F95 , , , , :
Automatische Anonymisierung von Gerichtsurteilen – Eine Vision scheint realisierbar
In: Erich Schweighofer / Jakob Zanol / Stefan Eder (ed.): Rechtsinformatik als Methodenwissenschaft des Rechts – Tagungsband des 26. Internationalen Rechtsinformatik Symposions IRIS 2023, Editions Weblaw, 2023, p. 211 - 220
ISBN: 978-3-98595-714-9 , , , , :
A reference constructicon as a database
In: Yearbook of the German Cognitive Linguistics Association 11 (2023), p. 175-202
ISSN: 2197-2796
DOI: 10.1515/gcla-2023-0009 , , , :
2022
Entwicklung und Evaluation automatischer Verfahren zur Anonymisierung von Gerichtsentscheidungen
In: LegalTech (2022), p. 233-238
ISSN: 2750-4603
URL: https://beck-online.beck.de/Bcid/Y-300-Z-LTZ-B-2022-S-233-N-1 , , , , :
Manuelle und automatische Anonymisierung von Urteilen
In: Adrian, Axel/Kohlhase, Michael/Evert, Stephanie/Zwickel, Martin (ed.): Digitalisierung von Zivilprozess und Rechtsdurchsetzung, 2022, p. 173-197
ISBN: 978-3-428-18644-0 , , , , , :
Exploring Lexical Diversities
Digital Humanities 2022 (Tokyo, July 25, 2022 - July 29, 2022)
In: Digital Humanities 2022. Conference Abstracts 2022
URL: https://dh2022.dhii.asia/dh2022bookofabsts.pdf , , , , , :
Retrieving Twitter argumentation with corpus queries and discourse analysis
In: Susanne Flach, Martin Hilpert (ed.): Broadening the Spectrum of Corpus Linguistics: New approaches to variability and change, John Benjamins Publishing Company, 2022, p. 229-256 (Studies in Corpus Linguistics, Vol.105)
ISBN: 9789027212665
DOI: 10.1075/scl.105.08dyk , , :
2021
Argument parsing via corpus queries
In: it - Information Technology 63 (2021), p. 31-44
ISSN: 1611-2776
DOI: 10.1515/itit-2020-0051 , , , , :
Anonymisierung von Gerichtsurteilen – Eine wesentliche Voraussetzung für E-Justice –
In: Schweighofer E, Eder S, Hanke P, Kummer F, Saarenpää A (ed.): Cybergovernance - Tagungsband des 24. Internationalen Rechtsinformatik Symposions IRIS 2021, Editions Weblaw, 2021, p. 137 - 149
ISBN: 978-3-96966-452-0
DOI: 10.38023/8a6f3e93-06e9-4655-84ec-ecf2c55db3e1 , , , , :
How Will Your Tweet Be Received? Predicting the Sentiment Polarity of Tweet Replies
IEEE 15th International Conference on Semantic Computing (ICSC) (Laguna Hills, CA, January 27, 2021 - January 29, 2021)
In: IEEE (ed.): 2021 IEEE 15th International Conference on Semantic Computing (ICSC) 2021
DOI: 10.1109/ICSC50631.2021.00068
URL: https://ieeexplore.ieee.org/document/9364527 , , , , , , , :
2020
A new German Reddit corpus
15th Conference on Natural Language Processing, KONVENS 2019 (Erlangen-Nurnberg, October 9, 2019 - October 11, 2019)
In: Proceedings of the 15th Conference on Natural Language Processing, KONVENS 2019 2020 , , , , , :
Reconstructing Arguments from Noisy Text
In: Datenbank-Spektrum 20 (2020), p. 123-129
ISSN: 1618-2162
DOI: 10.1007/s13222-020-00342-y
URL: https://link.springer.com/article/10.1007/s13222-020-00342-y , , , , :
Corpus query lingua franca part II: Ontology
12th International Conference on Language Resources and Evaluation, LREC 2020 (Marseille, May 11, 2020 - May 16, 2020)
In: Nicoletta Calzolari, Frederic Bechet, Philippe Blache, Khalid Choukri, Christopher Cieri, Thierry Declerck, Sara Goggi, Hitoshi Isahara, Bente Maegaard, Joseph Mariani, Helene Mazo, Asuncion Moreno, Jan Odijk, Stelios Piperidis (ed.): LREC 2020 - 12th International Conference on Language Resources and Evaluation, Conference Proceedings 2020 , , , :
Multimodal Approaches to Media Discourses: Reconstructing the Age of Austerity in the United Kingdom
London: 2020 , , (ed.):
Possibilities and Challenges of Corpus-Assisted Discourse Analyses of Austerity in the United Kingdom
In: Griebel T, Evert S, Heinrich P (ed.): Multimodal Approaches to Media Discourses: Reconstructing the Age of Austerity in the United Kingdom, London: Routledge, 2020, p. 1 - 10
DOI: 10.4324/9780367332907-1 , , :
EmpiriST Corpus 2.0: Adding Manual Normalization, Lemmatization and Semantic Tagging to a German Web and CMC Corpus
12th International Conference on Language Resources and Evaluation, LREC 2020 (Marseille, May 11, 2020 - May 16, 2020)
In: Nicoletta Calzolari, Frederic Bechet, Philippe Blache, Khalid Choukri, Christopher Cieri, Thierry Declerck, Sara Goggi, Hitoshi Isahara, Bente Maegaard, Joseph Mariani, Helene Mazo, Asuncion Moreno, Jan Odijk, Stelios Piperidis (ed.): LREC 2020 - 12th International Conference on Language Resources and Evaluation, Conference Proceedings 2020
Open Access: https://www.aclweb.org/anthology/2020.lrec-1.754
URL: https://www.aclweb.org/anthology/2020.lrec-1.754 , , , , , :
2019
Arguing Brexit on Twitter. A corpus linguistic study
European Conference on Argumentation 2019 (Groningen, June 24, 2019 - June 27, 2019) , , :
Reconstructing Twitter arguments with corpus linguistics
ICAME40: Language in Time, Time in Language (Neuchâtel, June 1, 2019 - June 5, 2019) , , :
Combining Machine Learning and Semantic Features in the Classification of Corporate Disclosures
In: Journal of Logic, Language and Information (2019), p. 309-330
ISSN: 0925-8531
DOI: 10.1007/s10849-019-09283-6 , , , , , , :
2018
A quantitative evaluation of keyword measures for corpus-based discourse analysis
URL: http://www.stefan-evert.de/PUB/EvertEtc2018_CAD_slides.pdf , , :
A Transnational Analysis of News and Tweets about Nuclear Phase-Out in the Aftermath of the Fukushima Incident
Workshop on Computational Impact Detection from Text Data (Miyazaki, May 8, 2018 - May 8, 2018)
In: Andreas Witt, Jana Diesner, Georg Rehm (ed.): Proceedings of the LREC 2018 “Workshop on Computational Impact Detection from Text Data”, Paris: 2018 , , , , :
Delta vs. N-Gram Tracing: Evaluating the Robustness of Authorship Attribution Methods
11th Language Resources and Evaluation Conference (Miyazaki, May 7, 2018 - May 12, 2018)
In: Calzolari N, Choukri K, Cieri C, Declerck T, Goggi S, Hasida K, Isahara H, Maegaard B, Mariani J, Mazo H, Moreno A, Odijk J, Piperidis S, Tokunaga T (ed.): Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018), Miyazaki: 2018
Open Access: http://www.lrec-conf.org/proceedings/lrec2018/pdf/835.pdf
URL: http://www.lrec-conf.org/proceedings/lrec2018/pdf/835.pdf , , , , , :
EmotiKLUE at IEST 2018: Topic-Informed Classification of Implicit Emotions
9th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis (Brüssel, October 31, 2018 - October 31, 2018)
In: Balahur A, Mohammad SM, Hoste V, Klinger R (ed.): Proceedings of the 9th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis, Brussels: 2018
DOI: 10.18653/v1/w18-6234
URL: http://aclweb.org/anthology/W18-6234 , , , :
Collocation Candidate Extraction from Dependency-Annotated Corpora: Exploring Differences across Parsers and Dependency Annotation Schemes
In: Cantos-Gómez P, Almela-Sánchez M (ed.): Lexical Collocation Analysis: Advances and Applications, Cham: Springer International Publishing, 2018, p. 111–140
ISBN: 978-3-319-92582-0
DOI: 10.1007/978-3-319-92582-0_6 , , :
2017
»Delta« in der stilometrischen Autorschaftsattribution
In: Zeitschrift für digitale Geisteswissenschaften (2017)
ISSN: 2510-1358
DOI: 10.17175/2017_006
URL: http://www.zfdg.de/2017_006 , , , , , , , , :
Combining Machine Learning and Semantic Features in the Classification of Corporate Disclosures
Logic and Algorithms in Computational Linguistics 2017 (LACompLing2017) (Stockholm, August 16, 2017 - August 19, 2017)
In: Loukanova R, Liefke K (ed.): Proceedings of the Workshop on Logic and Algorithms in Computational Linguistics 2017 (LACompLing2017), Stockholm: 2017
URL: http://su.diva-portal.org/smash/get/diva2:1140018/FULLTEXT03.pdf , , , , , :
The impact of translation direction on characteristics of translated texts. A multivariate analysis for English and German
In: De Sutter G, Lefer M, Delaere I (ed.): Empirical Translation Studies. New Theoretical and Methodological Traditions, Berlin: Mouton de Gruyter, 2017, p. 47-80 (Trends in Linguistics. Studies and Monographs (TiLSM), Vol.300)
ISBN: 978-3-11-045958-6
URL: http://www.stefan-evert.de/PUB/EvertNeumann2017/ , :
Understanding and explaining Delta measures for authorship attribution
In: Digital Scholarship in the Humanities 32 (2017), p. ii4–ii16
ISSN: 2055-7671
DOI: 10.1093/llc/fqx023 , , , , , , :
E-VIEW-Alation – a Large-Scale Evaluation Study of Association Measures for Collocation Identification
eLex 2017 (Leiden, September 19, 2017 - September 21, 2017)
In: Iztok K, Carole T, Miloš J, Jelena K, Simon K, and Vít B (ed.): Electronic Lexicography in the 21st Century. Proceedings of the eLex 2017 Conference, Brno: 2017
Open Access: https://elex.link/elex2017/wp-content/uploads/2017/09/paper32.pdf
URL: https://elex.link/elex2017/wp-content/uploads/2017/09/paper32.pdf , , , :
Reliable measures of syntactic and lexical complexity: The case of Iris Murdoch
In: Proceedings of the Corpus Linguistics 2017 Conference, Birmingham, UK: 2017
URL: http://purl.org/stefan.evert/PUB/EvertWankerlNoeth2017.pdf , , :
Large-scale evaluation of dependency-based DSMs: Are they worth the effort?
In: Proceedings of the 15th Annual Meeting of the European Association for Computational Linguistics (EACL 2017): Volume 2, Short Papers, Valencia, Spain: 2017
Open Access: https://www.aclweb.org/anthology/E/E17/E17-2063.pdf
URL: http://www.linguistik.fau.de/dsmeval/ , :
Translation Inference across Dictionaries via a Combination of Graph-based Methods and Co-occurrence Statistics
Shared Task on Translation Inference Across Dictionaries (Galway, June 18, 2017 - June 18, 2017)
In: McCrae J, Bond F, Buitelaar P, Cimiano P, Declerck T, Gracia J, Kernerman I, Ponsoda E, Ordan N, Piasecki M (ed.): Proceedings of the LDK 2017 Workshops: 1st Workshop on the OntoLex Model (OntoLex-2017), Shared Task on Translation Inference Across Dictionaries & Challenges for Wordnets 2017
Open Access: http://ceur-ws.org/Vol-1899/TIAD17_paper_1.pdf
URL: http://ceur-ws.org/Vol-1899/TIAD17_paper_1.pdf , , , :
Japan's 2014 General Election: Political Bots, Right-Wing Internet Activism and PM Abe Shinzō’s Hidden Nationalist Agenda
In: Big Data 5 (2017), p. 1 - 16
ISSN: 2167-6461 , , :
2016
CogALex-V Shared Task: Mach5 – A traditional DSM approach to semantic relatedness
In: Proceedings of the 5th Workshop on Cognitive Aspects of the Lexicon (CogALex-V), Osaka, Japan: 2016
Open Access: http://aclweb.org/anthology/W16-5312
URL: http://www.collocations.de/data/#mach5 :
EmpiriST 2015: A Shared Task on the Automatic Linguistic Annotation of Computer-Mediated Communication and Web Corpora
54th Annual Meeting of the Association for Computational Linguistics (ACL 2016) (Berlin)
In: Proceedings of the 10th Web as Corpus Workshop (WAC-X) and the EmpiriST Shared Task, Berlin, Germany: 2016
Open Access: https://aclweb.org/anthology/W/W16/W16-2606.pdf
URL: https://sites.google.com/site/empirist2015/ , , , :
A Distributional Approach to Open Questions in Market Research
In: Computers in Industry 78 (2016), p. 16-28
ISSN: 0166-3615
DOI: 10.1016/j.compind.2015.10.008 , , , :
„Delta“ in der stilometrischen Autorschaftsattribution
DHd 2016 (Leipzig, March 7, 2016 - March 12, 2016)
In: DHd 2016. Konferenzabstracts, Leipzig: 2016
Open Access: http://www.dhd2016.de/abstracts/sektionen-002.html
URL: http://www.dhd2016.de/abstracts/sektionen-002.html , , , , , , , , :
The CogALex-V Shared Task on the Corpus-Based Identification of Semantic Relations
In: Proceedings of the 5th Workshop on Cognitive Aspects of the Lexicon (CogALex-V), Osaka, Japan: 2016
Open Access: http://aclweb.org/anthology/W16-5309
URL: https://sites.google.com/site/cogalex2016/home/shared-task , , , :
An Analysis of Perplexity to Reveal the Effects of Alzheimer's Disease on Language
In: ITG-Fachbericht 267: Speech Communication, Paderborn, Germany: 2016 , , :
2015
Some theoretical and experimental observations on naïve discriminative learning
In: Proceedings of the 6th Conference on Quantitative Investigations in Theoretical Linguistics (QITL-6), Tübingen, Germany: 2015 , :
Ziggurat: A new data model and indexing format for large annotated text corpora
In: Proceedings of the 3rd Workshop on the Challenges in the Management of Large Corpora (CMLC-3), Lancaster, UK: 2015
Open Access: http://ids-pub.bsz-bw.de/files/3826/Evert_Hardie_Ziggurat_A_new_data_model_and_indexing_format_2015.pdf , :
Towards a better understanding of Burrows's Delta in literary authorship attribution
In: Proceedings of the Fourth Workshop on Computational Linguistics for Literature, Denver, CO: 2015
Open Access: http://www.aclweb.org/anthology/W15-0709
URL: http://www.aclweb.org/anthology/W15-0709 , , , , , :
KLUEless: Polarity Classification and Association
In: Proceedings of the 9th International Workshop on Semantic Evaluation (SemEval 2015), Denver, Colorado: 2015
URL: http://www.aclweb.org/anthology/S15-2103 , , , , , , :
SemantiKLUE: Semantic Textual Similarity with Maximum Weight Matching
In: Proceedings of the 9th International Workshop on Semantic Evaluation (SemEval 2015), Denver, Colorado: 2015
URL: http://www.aclweb.org/anthology/S15-2020 , , , :
2014
Towards a Firthian Notion of Collocation
In: Abel A, Lemnitzer L (ed.): Vernetzungsstrategien, Zugriffsstrukturen und automatisch ermittelte Angaben in Internetwörterbüchern, Mannheim: Institut für Deutsche Sprache, 2014, p. 48–61 (OPAL – Online publizierte Arbeiten zur Linguistik)
Open Access: http://ids-pub.bsz-bw.de/frontdoor/index/index/docId/2402 , :
A weakly supervised multivariate approach to the study of language variation
In: Szmrecsanyi B, Wälchli B (ed.): Aggregating Dialectology, Typology, and Register Analysis. Linguistic Variation in Text and Speech, Berlin, Boston: De Gruyter, 2014, p. 174–204 (Linguae et Litterae: Publications of the School of Language and Literature, Freiburg Institute for Advanced Studies)
URL: http://www.degruyter.com/viewbooktoc/product/207699 , , :
Distributional Semantics in R with the wordspace Package
In: Proceedings of COLING 2014, the 25th International Conference on Computational Linguistics: System Demonstrations, Dublin, Ireland: 2014
Open Access: http://www.aclweb.org/anthology/C14-2024
URL: http://wordspace.r-forge.r-project.org :
SentiKLUE: Updating a polarity classifier in 48 hours
In: Proceedings of the 8th International Workshop on Semantic Evaluation (SemEval-2014), Dublin, Ireland: 2014
Open Access: http://www.aclweb.org/anthology/S14-2096
URL: http://www.aclweb.org/anthology/S14-2096 , , , :
A Large Scale Evaluation of Distributional Semantic Models: Parameters, Interactions and Model Selection
In: Transactions of the Association for Computational Linguistics 2 (2014), p. 531–545
ISSN: 2307-387X
Open Access: https://tacl2013.cs.columbia.edu/ojs/index.php/tacl/article/view/457
URL: http://www.linguistik.fau.de/dsmeval/ , :
NaDiR: Naive Distributional Response Generation
In: Proceedings of the 4th Workshop on Cognitive Aspects of the Lexicon (CogALex), Dublin, Ireland: 2014
Open Access: http://www.aclweb.org/anthology/W14-4707
URL: http://www.aclweb.org/anthology/W14-4707 , :
Contrasting Syntagmatic and Paradigmatic Relations: Insights from Distributional Semantic Models
In: Proceedings of the Third Joint Conference on Lexical and Computational Semantics (*SEM 2014), Dublin, Ireland: 2014
Open Access: http://www.aclweb.org/anthology/S14-1020
URL: http://www.aclweb.org/anthology/S14-1020 , , :
SemantiKLUE: Robust semantic similarity at multiple levels using maximum weight matching
In: Proceedings of the 8th International Workshop on Semantic Evaluation (SemEval-2014), Dublin, Ireland: 2014
Open Access: http://www.aclweb.org/anthology/S14-2093
URL: http://www.aclweb.org/anthology/S14-2093 , , , :
SNAP: A Multi-Stage XML-Pipeline for Aspect Based Sentiment Analysis
In: Proceedings of the 8th International Workshop on Semantic Evaluation (SemEval 2014), Dublin, Ireland: 2014
Open Access: http://www.aclweb.org/anthology/S14-2101
URL: http://www.aclweb.org/anthology/S14-2101 , , , , , , , , , , :
2013
Conditional automaticity in subliminal morphosyntactic priming
In: Psychological research 77 (2013), p. 399–421
ISSN: 0340-0727 , , , , :
Scalable Construction of High-Quality Web Corpora
In: Journal for language technology and computational linguistics 28 (2013), p. 23–59
ISSN: 0175-1336
Open Access: http://www.jlcl.org/2013_Heft2/2Biemann.pdf , , , , , , , , :
Tools for the acquisition of lexical combinatorics
In: Gouws RH, Heid U, Schweickard W, Wiegand HE (ed.): Dictionaries. An International Encyclopedia of Lexicography. Supplementary volume: Recent Developments with Focus on Electronic and Computational Lexicography (HSK 5.4), Berlin, New York: Mouton de Gruyter, 2013, p. 1415–1432 :
KLUE-CORE: A regression model of semantic textual similarity
In: Second Joint Conference on Lexical and Computational Semantics (*SEM), Volume 1: Proceedings of the Main Conference and the Shared Task: Semantic Textual Similarity, Atlanta, Georgia, USA: 2013
Open Access: http://www.aclweb.org/anthology/S13-1026
URL: http://aclweb.org/anthology/S13-1026 , , , :
Evaluating Neighbor Rank and Distance Measures as Predictors of Semantic Priming
In: Proceedings of the ACL Workshop on Cognitive Modeling and Computational Linguistics (CMCL 2013), Sofia, Bulgaria: 2013
Open Access: https://www.aclweb.org/anthology/W/W13/W13-2608.pdf , :
KLUE: Simple and robust methods for polarity classification
In: Second Joint Conference on Lexical and Computational Semantics (*SEM), Volume 2: Proceedings of the Seventh International Workshop on Semantic Evaluation (SemEval 2013), Atlanta, GA: 2013
Open Access: http://www.aclweb.org/anthology/S13-2065
URL: http://aclweb.org/anthology/S13-2065 , , , :
2012
Adjectives as Saturators vs. Modifiers: Statistical Evidence
In: Aloni M, Kimmelman V, Roelofsen F, Sassoon GW, Schulz K, Westera M (ed.): Logic, Language and Meaning. Proceedings of the 18th Amsterdam Colloquium, Berlin, Heidelberg: Springer, 2012, p. 112–121 (Lecture Notes in Computer Science, Vol.7218)
ISBN: 978-3-642-31481-0
DOI: 10.1007/978-3-642-31482-7_12 , , , :
2011
Focus Marking via Gestures
In: Reich I (ed.): Proceedings of Sinn & Bedeutung 15, Saarbrücken, Germany: 2011 , , :
Twenty-first century Corpus Workbench: Updating a query architecture for the new millennium
In: Proceedings of the Corpus Linguistics 2011 Conference, Birmingham, UK: 2011
Open Access: http://www.birmingham.ac.uk/documents/college-artslaw/corpus/conference-archives/2011/Paper-153.pdf , :
Asymmetry in Corpus-Derived and Human Word Associations
In: Corpus linguistics and linguistic theory 7 (2011), p. 245–276
ISSN: 1613-7027 , , :
2010
Google Web 1T5 N-Grams Made Easy (but not for the computer)
In: Proceedings of the 6th Web as Corpus Workshop (WAC-6), Los Angeles, CA: 2010
Open Access: http://aclweb.org/anthology/W/W10/W10-1505.pdf :
Special issue on multiword expressions: hard going or plain sailing?
In: Language Resources and Evaluation 44 (2010)
ISSN: 1574-020X , , , , :
2009
Semantik
In: Carstensen K, Ebert C, Ebert C, Jekat S, Klabunde R, Langer H (ed.): Computerlinguistik und Sprachtechnologie: Eine Einführung, Heidelberg: Spektrum Akademischer Verlag, 2009, p. 330-393 , , , :
Statistische Grundlagen
In: Carstensen K, Ebert C, Ebert C, Jekat S, Klabunde R, Langer H (ed.): Computerlinguistik und Sprachtechnologie: Eine Einführung, Heidelberg: Spektrum Akademischer Verlag, 2009, p. 114-158
URL: http://www.cl.uzh.ch/CL/CLBuch/ , , :
Part-of-Speech Tagging – A Solved Task? An evaluation of POS taggers for the Web as corpus
In: Alegria I, Leturia I, Sharoff S (ed.): Proceedings of the 5th Web as Corpus Workshop (WAC5), San Sebastian, Spain: 2009
Open Access: https://www.sigwac.org.uk/attachment/wiki/WAC5/WAC5_proceedings.pdf
URL: http://purl.org/stefan.evert/PUB/GiesbrechtEvert2009_Tagging.pdf , :
2008
Statistical methods for corpus exploitation
In: Lüdeling A, Kytö M (ed.): Corpus Linguistics. An International Handbook, Berlin, New York: Mouton de Gruyter, 2008, p. 777-803 , :
A lightweight and efficient tool for cleaning Web pages
In: Proceedings of the 6th International Conference on Language Resources and Evaluation (LREC 2008), Marrakech, Morocco: 2008
URL: http://purl.org/stefan.evert/PUB/Evert2008_NCleaner.pdf :
Corpora and collocations
In: Lüdeling A, Kytö M (ed.): Corpus Linguistics. An International Handbook, Berlin, New York: Mouton de Gruyter, 2008, p. 1212-1248 :
Corpus Linguistics with BNCweb – a Practical Guide
Frankfurt am Main: Peter Lang, 2008
(English Corpus Linguistics, Vol.6)
ISBN: 978-3-631-56315-1
URL: http://corpora.lancs.ac.uk/BNCweb/ , , , , :
2007
Words and Echoes: Assessing and Mitigating the Non-Randomness Problem in Word Frequency Distribution Modeling
In: Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic: 2007
Open Access: http://aclweb.org/anthology/P/P07/P07-1114.pdf , :
zipfR: Word Frequency Distributions in R
In: Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Posters and Demonstrations Sessions, Prague, Czech Republic: 2007
Open Access: http://aclweb.org/anthology/P/P07/P07-2008.pdf , :
2006
How Random is a Corpus? The Library Metaphor
In: Zeitschrift für Anglistik und Amerikanistik 54 (2006), p. 177-190
ISSN: 0044-2305
URL: http://purl.org/stefan.evert/PUB/Evert2006.pdf :
2005
The NITE XML Toolkit: data model and query language
In: Language Resources and Evaluation 39 (2005), p. 313-334
ISSN: 1574-020X , , , , :
Using Small Random Samples for the Manual Evaluation of Statistical Association Measures
In: Computer Speech and Language 19 (2005), p. 450-466
ISSN: 0885-2308
URL: http://purl.org/stefan.evert/PUB/EvertKrenn2005.pdf , :
The emergence of productive non-medical -itis. Corpus evidence and qualitative analysis
In: Kepser S, Reis M (ed.): Linguistic Evidence. Empirical, Theoretical, and Computational Perspectives, Berlin: Mouton de Gruyter, 2005, p. 351-370
URL: http://purl.org/stefan.evert/PUB/LuedelingEvert2005.pdf , :
2004
A Simple LNRE Model for Random Character Sequences
In: Proceedings of the 7èmes Journées Internationales d'Analyse Statistique des Données Textuelles (JADT 2004), Louvain-la-Neuve, Belgium: 2004
URL: http://purl.org/stefan.evert/PUB/Evert2004a.pdf :
Significance tests for the evaluation of ranking methods
In: Proceedings of the 20th International Conference on Computational Linguistics (Coling 2004), Geneva, Switzerland: 2004
Open Access: http://aclweb.org/anthology/C/C04/C04-1136.pdf :
The Statistics of Word Cooccurrences: Word Pairs and Collocations (Dissertation, 2004)
URL: http://www.collocations.de/phd.html :
Determining Intercoder Agreement for a Collocation Identification Task
In: Proceedings of KONVENS 2004, Vienna, Austria: 2004
URL: http://purl.org/stefan.evert/PUB/KrennEvertZinsmeister2004.pdf , , :
2000
Searchable Metaspaces
In: Proceedings of the EAGLES/ISLE Workshop on Metadata, Athens, Greece: 2000 , , :
Proceedings of the 9th EURALEX International Congress
Stuttgart, Germany: 2000 , , , (ed.):
Methoden zum Vergleich von Signifikanzmaßen zur Kollokationsidentifikation
In: Zühlke W, Schukat-Talamazzini EG (ed.): KONVENS-2000 Sprachkommunikation, Ilmenau, Germany: 2000 , , :
On Measuring Morphological Productivity
In: Zühlke W, Schukat-Talamazzini EG (ed.): KONVENS-2000 Sprachkommunikation, Ilmenau, Germany: 2000 , , :
A data collection for semi-automatic corpus-based updating of dictionaries
In: Heid U, Evert S, Lehmann E, Rohrer C (ed.): Proceedings of the 9th EURALEX International Congress, Stuttgart, Germany: 2000 , , , , :
You might like!
• Our FAU MoD Community
• About us
• FAU MoD Lecture Series
• Upcoming events
Trends in Mathematical Sciences
Next June 10 - 14, 2024, our FAU MoD, Researcher Center for Mathematics of Data at FAU, Friedrich-Alexander-Universität Erlangen-Nürnberg is ...
FAU MoD Lecture: Using system knowledge for improved sample efficiency in data-driven modeling and control of complex technical systems
Date: Wed. May 15, 2024 Event: FAU MoD Lecture Organized by: FAU MoD, the Research Center for Mathematics of Data ...
CIN-PDE 2024 Workshop on Control, Inversion and Numerics for PDEs
Next October, the Fudan University is hosting the 2nd. edition of the Workshop on Control, Inversion and Numerics for PDEs ...