Source: http://www.perezparedes.es/big-data-and-corpus-linguistics [accessed: 14/05/2015]
Excellent news! A couple of days ago, Andrew Hardie released a virtual machine with a preconfigured version of CQPweb installed:
> From: a.hardie(*at*)lancaster.ac.uk
> To: cwb(*at*)sslmit.unibo.it
> Date: Thu, 2 Apr 2015 05:20:33 +0000
> Subject: [CWB] Announcing CQPwebInABox
> Hi everybody,
> This is just a quick note to announce the availability of CQPwebInABox
> – a virtual machine image containing a pre-installed copy of CQPweb.
> This is designed to get beginners past the hump of having to install
> all the different components.
> The image (1.6GB) can be downloaded here:
> To run it, you will need to install VirtualBox (although I believe
> other virtualisation tools can also use the same file format, I haven’t
> yet tested this).
> You can get VirtualBox here:
> Then “import appliance” from the .ova download.
> The virtual machine runs Linux – however, I have set it up in such a
> way as to make the interface as similar to Windows as possible. So
> don’t fear the Linux!
> I will create some video tutorials & put them on YouTube as soon as I can.
> Feedback welcome.
CQPweb is a web-based graphical user interface (GUI) for some elements of the CWB – and in particular, the CQP query processor.
CQPweb is designed to replicate the user-interface of the popular BNCweb tool, which also (in its most recent versions) uses CQP as a back-end. Like BNCweb, CQPweb uses a database alongside the CWB to provide extra functions beyond those built into CWB/CQP. However, unlike BNCweb, CQPweb can be used with any corpus.
CQPweb is especially suitable for students, non-linguists, and others for whom a Unix-like command-line is a terrifying prospect. […]
[More] screenshots of CQPweb can be downloaded from this link.
Source: http://cwb.sourceforge.net/cqpweb.php [accessed: 14/01/2014]
Related posts on langui.ch:
Instant colourization of grammar structures, instant learning activities from any page on the web. Supported languages: English, German, Spanish.
The SketchEngine development team has just released a new open-source version of their tools (
finlib – Download-Links), including the following highlights:
- extended support for parallel corpora
- support for virtual corpora
- asynchronous query processing showing partial results as they are computed
- corpus info page providing an overall overview of the corpus stats
- lots of smaller enhancements in the functionality and usability of the user interface
- lots of speed enhancements, both for run time (query evaluation) and compile time (corpus indexing)
- lots of bugfixes
Source: http://nlp.fi.muni.cz/trac/noske/wiki/Downloads [accessed: 13/06/2014]
The package provides a simple, yet effective interface for a parallel corpus
using OpenCWB (http://cwb.sourceforge.net). It should work on any linux machine
with only minimal changes in the INI files to reflect paths, and possibly
adjustments concerning language codes. All settings are found in the settings
This web interface to CWB was initially written by Roland Meyer for use with the
ParaSol corpus (then Regensburg Parallel Corpus) in 2006 and has since been in
development by its three authors. The java script based functionality was mainly
added by Andreas Zeman, XSLT-support in the new modular interface mainly by
Ruprecht von Waldenfels, who has supervised the publication as open source. Part
of the architecture is described in Waldenfels (2011). We thank the Center for
the Study of Language and Society, University of Berne, (http://www.csls.unibe.ch)
for granting financial support enabling the publication of ParaVoz as open source.
Source: https://bitbucket.org/rvwfels/paravoz [accessed: 26/03/2014]
Quote Tool as:
- Roland Meyer, Ruprecht von Waldenfels, Andreas Zeman (2006-2014): ParaVoz –
a simple web interface for querying parallel corpora. Bern, Regensburg, Berlin.
- Ruprecht von Waldenfels (2011): Recent Developments in ParaSol: Breadth for Depth and XSLT based web concordancing with CWB. In: Daniela Majchráková and Radovan Garabík (eds.): Natural Language Processing, Multilinguality. Proceedings of Slovko 2011, Bratislava: Tribun, 156-162. Available online.
Related posts on langui.ch:
If you try to launch AntConc on a Debian-based 64-bit system, you get the following error message (tested with versions 3.2.4u and 3.4.1u):
./antconc3.2.4u: No such file or directory or ./AntConc: No such file or directory
Important note: Please respect Laurence Anthony’s licensing terms and ask for permission before using AntConc in a server/group environment (see README section ‘LEGAL MATTER’ (p. 11) for details).
i386 architecture on 64-bit systems:
apt-get install libc6-i386 dpkg --add-architecture i386
2) Install missing 32-bit libraries:
apt-get install libx11-6:i386 libxss1:i386 libxft2:i386
Class-room friendly collocations dictionary:
[Last update: 03/06/2015]
- Kilgarriff, A. (2014, March). “Corpora in the classroom without scaring the students.” British Council – EnglishAgenda Seminar. Retrieved from http://www.youtube.com/watch?v=2APIUxE_i6M [Adam’s talk starts at 1:09:35]
- Adam Kilgarriff, Miloš Husák, Katy McAdam, Michael Rundell, Pavel Rychlý (2008). “GDEX: Automatically Finding Good Dictionary Examples in a Corpus.” In Elisenda Bernal, Janet DeCesaris (Ed.), Proceedings of the 13th EURALEX International Congress (pp. 425–432). Barcelona, Spain: Institut Universitari de Linguistica Aplicada, Universitat Pompeu Fabra. Retrieved from EURALEX 2008