Skip to main content

Home/ LangLit WWW/ Group items tagged corpus linguistics

Rss Feed Group items tagged

marrti

Corpus Linguistics - 0 views

  •  
    Corpus Linguistics is a site designed as a supplement to the book Corpus Linguistics, although it can be used on its own. The project is funded by IHE (Innovation in Higher Education). The site consists of four major sections: Early Corpus Linguistics and the Chomskyan Revolution; What is a Corpus and What is in it?; Quantitative Data; and The Use of Corpora in Language Studies
marrti

Research & Development Unit for English Studies (RDUES) - 0 views

  •  
    The Research and Development Unit for English Studies (RDUES), based in the School of English at Birmingham City University, is a small team of corpus linguists, software engineers and statisticians. We carry out fundamental and applied research in corpus linguistics, developing new descriptions of the language in use and tools for the extraction and management of knowledge in databases. The Unit's linguistic background is broad: corpus-based linguistics, lexicography, applied linguistics, the study of modern English language, modern languages, TEFL. Activities include:
marrti

Corpora and corpus linguistics resources for Language Learning and Teaching - 0 views

  •  
    This website provides links and references for the use of corpora, corpus linguistics and corpus analysis in the context of language learning and teaching. It also links to two ongoing research and development projects in which I am involved, the EU MINERVA project SACODEYL (development and pedagogical exploitation of youth language corpora) and the ELISA corpus, which I started to develop at the University of Tübingen (Applied English Linguistics) as a resource for language learning and teaching.
marrti

Corpus Linguistics SiBol Group - 0 views

  •  
    The Bologna group of corpus linguists has a particular interest in Corpus-assisted Discourse Studies (CADS) in which such statistical analyses are married to the more traditional kind of analyses employed in discourse studies; there is typically a "shunting" (Halliday) between statistical analyses and close textual reading. The aim of the CADS approach is the uncovering, in the discourse type under study, of what we might call non-obvious meaning, that is, meaning which might not be readily available to naked-eye perusal. Together with researchers at the Universities of Siena and Portsmouth the Bologna group has also devised a new form of CADS, denominated Modern-Diachronic Corpus-assisted Discourse Studies (MD-CADS) where large corpora of a parallel structure and content from different moments of contemporary time are employed in order to track changes in modern language usage but also social, cultural and political changes over modern times, as reflected in language.
marrti

International Corpus of English (ICE) Homepage - 0 views

  •  
    The International Corpus of English (ICE) website presents a corpus compilation project that aims to provide comparable corpora of English from different English-speaking regions around the world. Each corpus will contain one million words of spoken and written language, taken from a wide range of sources and situations
marrti

JRC-Acquis Multilingual Parallel Corpus - 0 views

  •  
    JRC-Acquis Multilingual Parallel Corpus is a collection of European Union legal texts, in 22 of the member state's languages, that has been aligned and coded in XML, providing an invaluable tool for linguistic research and a resource for computational linguistic applications. The corpus consists of a selection of texts from the Acquis Communautaire (AC), the total body of European Union (EU) law, applicable in the the EU member states and contains some 636 million word tokens. The languages included are, Bulgarian; Czech; Swedish; German; Greek; English; Spanish; Estonian; Finnish; French; Hungarian; Italian; Lithuanian; Latvian; Maltese; Dutch; Polish; Portuguese; Romanian; Slovak; Slovene; and Danish.
marrti

EUSTACE home page - 0 views

  •  
    EUSTACE is the Edinburgh University Speech Timing Archive and Corpus of English. The EUSTACE corpus comprises 4608 spoken sentences recorded at the department of Theoretical and Applied Linguistics of Edinburgh University. These sentences, spoken by six speakers of British English, were designed to examine a number of durational effects in speech and are controlled for length and phonetic content. Subconstituents of key words in each sentence have been identified by labels in xlabel (ESPS) format and notes have been made about the prosodic realisation of the sentences. It is hoped that these recordings will be useful for phonetics researchers and speech technologists working on synthesis and recognition.
marrti

The IViE Corpus - 0 views

  •  
    The IViE corpus: English Intonation in the British Isles website provides information about the Intonational Variation in English (IViE) project and access to the IViE corpus
marrti

Corpus Encoding Standard - 0 views

  •  
    Corpus Encoding Standard (CES) is an online set of guidelines developed by the Expert Advisory Group on Language Engineering Standards (EAGLES), designed to be optimally suited for use in language engineering research and applications, in order to serve as a set of encoding standards for corpus-based work in natural language processing applications
marrti

WebCorp: The Web as Corpus - 0 views

  •  
    an interface that lets you analyse the web using corpus linguistic tools
marrti

French oral narrative - 0 views

  •  
    Welcome to the French Oral Narrative Corpus, the first digitised corpus of oral storytelling in French. The project was funded by the Arts and Humanities Research Council and the British Academy. It was created in partnership with the Conservatoire contemporain de Littérature Orale in Vendôme (one of France's leading centres for oral storytelling) and the Oxford Text Archive, a digitised collection of thousands of texts in over twenty-five languages, including a range of linguistic corpora.
marrti

Clavier - 0 views

  •  
    The Corpus and LAnguage Variation In English Research Group (CLAVIER) is a Research Centre recently founded by the Universities of Bergamo, Firenze, Modena and Reggio Emilia, Roma La Sapienza and Siena, and currently based in Modena. The shared methodological framework combines two complementary strands of linguistic investigation - corpus analysis and discourse analysis - to research on language variation in English, both in quantitative and qualitative terms. Thus, one of the main aims of CLAVIER is to contribute to an integration of quantitative and qualitative methods in research on language variation in terms of diachronic, geographic and socio-cultural dimensions through the methodological tools offered by corpora. Particular attention is also given to the analysis of different genres and methodological registers.
marrti

Coventry University - British Academic Written English Corpus (BAWE) - 0 views

  •  
    British Academic Written English Corpus (BAWE)
marrti

Real Academia Española - CREA - 0 views

  •  
    Real Academia Española - Corpus de Referencia del Español Actual (CREA)
marrti

Real Academia Española - CREA - 0 views

  •  
    Real Academia Española - Corpus Diacrónico del Español (CORDE)
marrti

Corpus of Contemporary American English (COCA) - 0 views

  •  
    THE CORPUS OF CONTEMPORARY AMERICAN ENGLISH (COCA)
1 - 20 of 51 Next › Last »
Showing 20 items per page