Diese Seite bietet eine Literaturauswahl zum Thema Distinktivitätsmaße, die für den Einstieg besonders geeignet ist. Die (im Aufbau begriffene) Projektbibliografie ist auf Zotero verfügbar.

Die Liste der empfohlenen Literatur siehe hier.

Bibliografie durchsuchen:

Achananuparp, Palakorn, Xiaohua Hu, and Xiajiong Shen, ‘The Evaluation of Sentence Similarity Measures’, in Data Warehousing and Knowledge Discovery, ed. by Il-Yeol Song, Johann Eder, and Tho Manh Nguyen (Berlin, Heidelberg: Springer Berlin Heidelberg, 2008), mmmmmclxxxii, 305–16 <https://doi.org/10.1007/978-3-540-85836-2_29>
Albitar, Shereen, Sébastien Fournier, and Bernard Espinasse, ‘An Effective TF/IDF-Based Text-to-Text Semantic Similarity Measure for Text Classification’, in Web Information Systems Engineering – WISE 2014, ed. by Boualem Benatallah, Azer Bestavros, Yannis Manolopoulos, Athena Vakali, and Yanchun Zhang (Cham: Springer International Publishing, 2014), 105–14 <https://doi.org/10.1007/978-3-319-11749-2_8>
Altmann, Eduardo G., Janet B. Pierrehumbert, and Adilson E. Motter, ‘Beyond Word Frequency: Bursts, Lulls, and Scaling in the Temporal Distributions of Words’, ed. by Enrico Scalas, PLoS ONE, 4.11 (2009), e7678 <https://doi.org/10.1371/journal.pone.0007678>
André Salem, Ludovic Lebart, ‘Statistique Textuelle’, ResearchGate, 1994 <https://www.researchgate.net/publication/44832136_Statistique_textuelle> [accessed 7 September 2019]
Angenot, Marc, Le roman populaire: recherches en paralittérature (Montreal: Presses de l’université du Québec, 1975) <http://digitale-objekte.hbz-nrw.de/storage/2009/11/07/file_18/3292944.pdf> [accessed 31 October 2018]
Anthony, Laurence, ‘AntConc: Design and Development of a Freeware Corpus Analysis Toolkit for the Technical Writing Classroom’, 2005, pp. 729–37 <https://doi.org/10.1109/IPCC.2005.1494244>
Auer, P., ‘Anmerkungen Zum Salienzbegriff in Der Soziolinguistik’, Linguistik Online, 66.4 (2014) <https://doi.org/https://doi.org/10.13092/lo.66.1569>
Auerbach, Erich, Mimesis: dargestellte Wirklichkeit in der abendländischen Literatur, Sammlung Dalp, 11. Auflage (Tübingen: A. Francke Verlag, 2015)
Baayen, Harald, ‘Statistical Models for Word Frequency Distributions: A Linguistic Evaluation’, Computers and the Humanities, 26.5–6 (1992), 347–63 <https://doi.org/10.1007/BF00136980>
Baayen, R. H., Analyzing Linguistic Data: A Practical Introduction to Statistics Using R (Cambridge: Cambridge University Press, 2008) <https://doi.org/10.1017/CBO9780511801686>
Baeza-Yates, Ricardo, and Berthier Ribeiro Neto, Modern Information Retrieval (Harlow, 1999)
Baker, L. Douglas, and Andrew Kachites McCallum, ‘Distributional Clustering of Words for Text Classification’, in Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval  - SIGIR ’98 (presented at the the 21st annual international ACM SIGIR conference, Melbourne, Australia: ACM Press, 1998), pp. 96–103 <https://doi.org/10.1145/290941.290970>
Baron, Alistair, Paul Rayson, and Dawn Archer, ‘Word Frequency and Key Word Statistics in Historical Corpus Linguistics’, Anglistik, 20.1 (2009)
Bassi, Erica, ‘A Contrastive Analysis of Keywords in Newspaper Articles on the “Kyoto Protocol”’, in Keyness in Texts, 2010 <https://benjamins.com/catalog/scl.41.15bas> [accessed 23 July 2020]
Beauvisage, Thomas, ‘Exploiter des données morphosyntaxiques pour l’étude statistique des genres. Application au roman policier’, Texto!, 2001 <http://www.revue-texto.net/index.php?id=629>
Bertels, Ann, and Dirk Speelman, ‘“Keywords Method” versus “Calcul Des Spécificités”: A Comparison of Tools and Methods’, International Journal of Corpus Linguistics, 18.4 (2013), 536–60 <https://doi.org/10.1075/ijcl.18.4.04ber>
Bestgen, Yves, ‘Evaluating the Frequency Threshold for Selecting Lexical Bundles by Means of an Extension of the Fisher’s Exact Test’, Corpora, 13.2 (2018), 205–28 <https://doi.org/10.3366/cor.2018.0144>
Bestgen, Yves, ‘Inadequacy of the Chi-Squared Test to Examine Vocabulary Differences between Corpora’, Literary and Linguistic Computing, 29.2 (2014), 164–70 <https://doi.org/10.1093/llc/fqt020>
Biber, Douglas, ‘Methodological Issues Regarding Corpus-Based Analyses of Linguistic Variation’, Literary and Linguistic Computing, 5.4 (1990), 257–69 <https://doi.org/10.1093/llc/5.4.257>
Biber, Douglas, Biber Douglas, Professor Douglas Biber, Susan Conrad, and Randi Reppen, Corpus Linguistics: Investigating Language Structure and Use (Cambridge University Press, 1998)
Biemann, Chris, and Alexander Mehler, Text Mining: From Ontology Learning to Automated Text Processing Applications (Springer, 2014)
Blumenthal-Dramé, Alice, Adriana Hanulíková, and Bernd Kortmann, ‘Editorial: Perceptual Linguistic Salience: Modeling Causes and Consequences’, Frontiers in Psychology, 8 (2017) <https://doi.org/10.3389/fpsyg.2017.00411>
Boileau, Pierre, and Thomas Narcejac, Le roman policier [1975] (Paris: PUF, 1994)
Bondi, Marina, ‘Perspectives on Keywords and Keyness’, in Keyness in Texts, 2010 <https://benjamins.com/catalog/scl.41.01bon> [accessed 23 July 2020]
Bonin, Emmanuel, and Alain Dallo, ‘Hyperbase et Lexico 3, outils lexicométriques pour l’historien’, Histoire & mesure, XVIII.3/4 (2003), 389–402 <https://doi.org/10.4000/histoiremesure.840>
Bordet, Geneviève, ‘Marina Bondi (Dir.), Mike Scott (Dir.), Keyness in Texts. Amsterdam/ Philadelphia: John Benjamins Publishing Company, 2010’, ASp. La Revue Du GERAS, 71, 2017, 179–88 <http://journals.openedition.org/asp/4932> [accessed 23 July 2020]
Brezina, Vaclav, and Miriam Meyerhoff, ‘Significant or Random?: A Critical Review of Sociolinguistic Generalisations Based on Large Corpora’, International Journal of Corpus Linguistics, 19.1 (2014), 1–28 <https://doi.org/10.1075/ijcl.19.1.01bre>
Brinegar, Claude S., ‘Mark Twain and the Quintus Curtius Snodgrass Letters: A Statistical Test of Authorship’, Journal of the American Statistical Association, 58.301 (1963), 85–96 <https://doi.org/10.1080/01621459.1963.10500834>
Bruza, P. D., D. W. Song, and K. F. Wong, ‘Aboutness from a Commonsense Perspective’, Journal of the American Society for Information Science, 51.12 (2000), 1090–1105 <https://doi.org/10.1002/1097-4571(2000)9999:9999<::AID-ASI1026>3.0.CO;2-Y>
Burrows, J. F., ‘Not Unless You Ask Nicely: The Interpretative Nexus Between Analysis and Information’, Literary and Linguistic Computing, 7.2 (1992), 91–109 <https://doi.org/10.1093/llc/7.2.91>
Burrows, John, ‘Who Wrote Shamela? Verifying the Authorship of a Parodic Text’, Digital Scholarship in the Humanities, 20.4 (2005), 437–50 <https://doi.org/10.1093/llc/fqi049>
Burrows, John, ‘All the Way Through: Testing for Authorship in Different Frequency Strata’, Literary and Linguistic Computing, 22.1 (2007), 27–47 <https://doi.org/10.1093/llc/fqi067>
Burrows, John, and Hugh Craig, ‘Lucy Hutchinson and the Authorship of Two Seventeenth-Century Poems: A Computational Approach’, The Seventeenth Century, 16.2 (2001), 259–82 <https://doi.org/10.1080/0268117X.2001.10555493>
Chen, Francine R., Thorsten H. Brants, and Annie E. Zaenen, ‘Systems and Methods for Sentence Based Interactive Topic-Based Text Summarization’, 2008 <https://patents.google.com/patent/US7376893B2/en> [accessed 17 September 2019]
Chen, Kewen, Zuping Zhang, Jun Long, and Hao Zhang, ‘Turning from TF-IDF to TF-IGM for Term Weighting in Text Classification’, Expert Systems with Applications, 66 (2016), 245–60 <https://doi.org/10.1016/j.eswa.2016.09.009>
Church, Kenneth, and William Gale, ‘Inverse Document Frequency (IDF): A Measure of Deviations from Poisson’, in Third Workshop on Very Large Corpora, 1995 <https://www.aclweb.org/anthology/W95-0110> [accessed 12 June 2020]
Clement, R., ‘Ngram and Bayesian Classification of Documents for Topic and Authorship’, Literary and Linguistic Computing, 18.4 (2003), 423–47 <https://doi.org/10.1093/llc/18.4.423>
Conference on Artificial Intelligence, Innovative Applications of Artificial Intelligence Conference, and Association for the Advancement of Artificial Intelligence, eds., Single Document Keyphrase Extraction Using Neighborhood Knowledge (Menlo Park, Calif: AAAI Press, 2008)
Constans, Ellen, Parlez-moi d’amour: le roman sentimental ; des romans grecs aux collections de l’an 2000 (Limoges: PULIM, 1999)
Cormack, Gordon V., and Thomas R. Lynam, ‘Validity and Power of T-Test for Comparing MAP and GMAP’, in Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR ’07 (New York, NY, USA: ACM, 2007), pp. 753–754 <https://doi.org/10.1145/1277741.1277892>
Craig, Hugh, and Arthur F. Kinney, eds., Shakespeare, Computers, and the Mystery of Authorship, 1st edn (Cambridge University Press, 2009)
Cressie, Noel A. C., and Timothy R. C. Read, ‘Pearsons-X2 and the Loglikelihood Ratio Statistic-G2: A Comparative Review’, 1989 <https://doi.org/10.2307/1403582>
Culpeper, Jonathan, ‘Keyness’, Ijcl.14.1.03cul <https://benjamins.com/catalog/ijcl.14.1.03cul> [accessed 12 June 2020]
Damian-Gaillard, Béatrice, ‘Les romans sentimentaux des collections Harlequin : quelle(s) figure(s) de l’amoureux ? Quel(s) modèle(s) de relation(s) amoureuse(s) ?’, Questions de communication, 20, 2011, 317–36 <https://doi.org/10.4000/questionsdecommunication.2130>
Danilevsky, Marina, Chi Wang, Nihit Desai, Xiang Ren, Jingyi Guo, and Jiawei Han, ‘Automatic Construction and Ranking of Topical Keyphrases on Collections of Short Documents’, in Proceedings of the 2014 SIAM International Conference on Data Mining (presented at the Proceedings of the 2014 SIAM International Conference on Data Mining, Society for Industrial and Applied Mathematics, 2014), pp. 398–406 <https://doi.org/10.1137/1.9781611973440.46>
David L. Hoover, ‘Using the Zeta and Iota Spreadsheet’, 2017 <https://wp.nyu.edu/exceltextanalysis/zetaiotawidespectrum/usingzetaiota/> [accessed 17 September 2019]
Deleuze, Gilles, Differenz und Wiederholung, trans. by Joseph Vogl, 3. Auflage (Paderborn: Wilhelm Fink Verlag, 2007)
Deng, Xuelian, Yuqing Li, Jian Weng, and Jilian Zhang, ‘Feature Selection for Text Classification: A Review’, Multimedia Tools and Applications, 78.3 (2019), 3797–3816 <https://doi.org/10.1007/s11042-018-6083-5>
Drouin, Patrick, ‘Term Extraction Using Non-Technical Corpora as a Point of Leverage’, Terminology, 9.1 (2003), 99–115 <https://doi.org/10.1075/term.9.1.06dro>
Dubois, Jacques, Le roman policier, ou la modernité, Le texte à l’œuvre (Paris: Colin, 2005)