Skip to main content

Home/ MHC Languages/ Group items tagged corpora

Rss Feed Group items tagged

Daryl Beres

Using corpora and concordances in the classroom « IH Journal - 0 views

  •  
    "I was unaware of exactly what could be gained from using corpora, having never done so, nor had I seen it mentioned in the numerous coursebooks I had used. In fact, the extent to which corpora have infiltrated teaching is not non-existent but remains marginal. A DELTA assignment later and the week before I was to give an input session on using concordances in the classroom, various colleagues sidled up to me muttering, "So, er, what's a concordance?" This is perhaps the most sensible place to begin."
LRC MHC

French Learner Language Oral Corpora (FLLOC) - 1 views

  •  
    Our long term goal is to promote research relating to the acquisition of French as a second/foreign language, by providing access to a growing database of French Learner Language Oral Corpora. The contents of the database are being made freely available to the research community, in the form of digital sound files and related transcripts formatted using CHILDES software. A search facility is also provided which enables researchers to select the sound files and transcripts they wish to download from the site. The database currently contains over 4000 files (sound files, transcripts and morphosyntactically tagged transcripts). The website also aims to include an inventory of other French interlanguage corpora not available within FLLOC but held by other researchers and institutions.
LRC MHC

Michigan Corpus Linguistics Home >> ELI Corpora & UM ACL - 0 views

  •  
    The Michigan Corpus Linguistics team consists of researchers and students at the University of Michigan (U-M) English Language Institute (ELI). We create corpora of spoken and written academic English, provide corpus-based materials for EAP (English for Academic Purposes) teaching, and carry out research in different areas of corpus linguistics. On our website you find information about the corpora we make available, the projects we work on, and the training we provide to ELI visiting scholars and University of Michigan students.
LRC MHC

Corpora for Language Learning and Teaching, Tools & Websites - 1 views

  •  
    This page offers information about some common corpus tools and links to resources on the web: * Online search in corpora * Online full-text search in books * Text and media archives * Online text/corpus analysis tools * Offline text/corpus analysis tools (concordancers) * Further resources * Corpus linguistics websites
LRC MHC

Sketch Engine - 0 views

  •  
    "The Sketch Engine (SkE, also known as Word Sketch Engine) is a Corpus Query System incorporating word sketches, grammatical relations, and a distributional thesaurus. A word sketch is a one-page, automatic, corpus-derived summary of a word's grammatical and collocational behaviour. A Sketch Engine account gives you * Pre-loaded corpora (60M-2B words) for o Chinese, English, French, German, Italian, Japanese, Portuguese, Spanish, Slovene o Other languages to follow * WebBootCaT o Build your own instant corpus o Extract keywords o Specialist terminology, any language * CorpusBuilder o Upload and install your own corpora Web service using standard browsers. No software installation required. "
LRC MHC

Mannheim Corpus: Cyril Belica: Kookkurrenzdatenbank CCDB. Eine korpuslinguist... - 0 views

  •  
    Mannheim Corpus: A very big - and free - corpus of German texts maintained by the Institut für Deutsche Sprache, including a choice of corpora and a lot of search facilities:
Daryl Beres

SACODEYL European Youth Language - 0 views

  •  
    SACODEYL presents an innovative ICT-based solution for the compilation and pedagogical, language learning-oriented exploitation of linguistic teen talk oral corpora. SACODEYL is an integration initiative that takes into account areas as: 1. multilingual corpus compilation and innovation support, 2. pedagogical adaptation of existing ICT tools for the integration of corpora and corpus-based methods of Data Driven Learning (DDL) into mainstream language teaching, and 3. know-how building which will contribute to the future drafting of specifications that can take both DDL and the use of oral corpora in language education a step forward. SACODEYL sees itself as a pedagogical mediator in the language learning process of young European, exploiting ICT resources to deliver learning experiences based on data driven, constructivist approaches to language acquisition. SACODEYL will demand young learners of languages across Europe to personally explore, and engage in learning tasks of different nature.
  •  
    SACODEYL is a web based system for the assisted compilation and open distribution of European teen talk in the context of language education. The project includes the collection and distribution of English, French, German, Italian, Lithuanian, Romanian, and Spanish teen talk. SACODEYL sees itself as a pedagogical mediator in the language learning process of young Europeans, exploiting web multimedia resources to deliver learning experiences based on data driven, constructivist approaches to language acquisition.
LRC MHC

UAM CorpusTool Homepage - 0 views

  •  
    "The UAM CorpusTool is a state-of-the-art environment for annotation of text corpora. So, whether you are annotating a corpus as part of a linguistic study, or building a training set for use in statistical language processing, this is the tool for you."
Daryl Beres

Google as a Quick 'n Dirty Corpus Tool - 0 views

  •  
    "Until recently it was assumed that specialized software was required to do concordancing, but it turns out that a search engine such as Google can generate queries into almost limitless corpora (using the Advanced Search feature from the main portal page, for example). This paper by Tom Robb addresses more refined issues regarding the integrity of the data thus derived, and how we might improve on the integrity of that data through more defined searches, as explained here. "
Daryl Beres

The PolyU Language Bank - 0 views

  •  
    The PolyU Language Bank, developed in the Department of English at Hong Kong PolyU, is a large archive of language corpora made up of a wide range of written and spoken texts totalling over 12 million words. Corpus searches can be performed using the Bank's built-in Web-based concordancer, enabling the easy use of corpus resources for language teaching and research.
LRC MHC

Web Concordancer - 1 views

  •  
    You can search for concordances and concgrams with these programs, find whole sentences with the Sentence Concordancer, browse the Indexed Corpus Files or you can upload and search your own Personal Corpus. Includes English, Chinese, French and Japanese corpora
Daryl Beres

Language Learning & Technology Vol5Num3 - 0 views

  •  
    Special Issue: Using Corpora in Language Teaching and Learning
LRC MHC

Moteur de recherche - Corpus Lexicaux Québécois - 0 views

  •  
    Corpus Lexicaux Québécois: Canadian French corpora with search facilities
LRC MHC

غلطاوي:التصحيح التلقائي العربي Ghalatawi:Arabic AutoCorrect - 0 views

  •  
    Tools for Arabic study, including spell-checker, autocorrect, corpora, word tagger, morphological analyzers, dictionaries, games, etc.
Daryl Beres

Transcriber - 1 views

  •  
    Transcriber is a tool for assisting the manual annotation of speech signals. It provides a user-friendly graphical user interface for segmenting long duration speech recordings, transcribing them, and labeling speech turns, topic changes and acoustic conditions. It is more specifically designed for the annotation of broadcast news recordings, for creating corpora used in the development of automatic broadcast news transcription systems, but its features might be found useful in other areas of speech research.
LRC MHC

The Generation 1.5 Project >> ELI Corpora & UM ACL - 0 views

  •  
    The Generation 1.5 project was launched in 2006 by a small group of researchers at the University of Michigan (U-M) English Language Institute (ELI). We are collecting texts from writers who are just starting their academic careers at the university, specifically 1st and 2nd year undergraduates, who speak a variety of Korean or Chinese at home. Ultimately, the aim of this data collection is to provide an empirical basis for developing recommendations for writing/language instruction specialized for the Generation 1.5 group.
Daryl Beres

Alfred Lord Tennyson: Mariana - 0 views

  •  
    Sample activity for literature analysis use concordance of a poem
Daryl Beres

Tennyson: MARIANA, The concordance. - 0 views

  •  
    Sample activity using concordance for literature analysis with a poem by Alfred Lord Tennyson.
Daryl Beres

VISL Corpuseye - 0 views

  •  
    Corpus and treebank search including Danish, English French, German, Italian, Norwegian, Portuguese, Spanish, Esperanto, Romanian, Swedish, Icelandic, Faroese
Daryl Beres

WebCorp: The Web as Corpus - 0 views

  •  
    An improved version of the original WebCorp, designed to search the web for concordances in real time. Can limit by country code or to specific domains.
1 - 20 of 51 Next › Last »
Showing 20 items per page