Category Archives: Python
spaCy
Extremely promising new Python NLP tool: spaCy (commercial open-source software):
Unfortunately, it is only able to deal with English input at the moment and installation on Windows seems to be tricky. The project is currently under intense development and it will be interesting to check the following links on a regular basis:
License: AGPLv3 (free for open-source projects), changed to MIT License (27 Sep 2015)
Source: http://spacy.io/index.html#detailed-speed-comparison [accessed: 24/07/2015]
Recent overview of Python NLP resources on DataScienceCentral
Link to community-edited list on Pansop
Source: http://www.datasciencecentral.com/profiles/blogs/python-nlp-tools [accessed: 24/07/2015]
New release: textblob-de 0.4.2
New release of German language extension textblob-de
for the popular textblob
package:
See overview of working features compared to main package.
Overview for development branch: Click here.
textblob-de
New release of German language extension textblob-de
for the popular textblob
package:
See overview of working features compared to main package.
Overview for development branch: Click here.
Installing nltk3 for Python3 in virtual environment on Linux
[Last update: 04/01/2015]
Prerequisites: Install Python and set up virtual environment & Install pip in virtual environment
Tested on: Xubuntu 12.04 LTS / 13.10 / 14.04 LTS (64-bit) with ActivePython installed in /opt
- (skip, if activated) Activate the virtual environment you would like to install nltk (v3) in:
-
cd ~/venv/apy3341 source bin/activate
- Use
pip
to install the latest version ofnltk
:
-
(apy3341):~$ pip install -U nltk
- [January 2015] Workaround for users with
setuptools
versions>=10
(see NLTK Issue #824):
-
(apy3341):~$ pip install setuptools==9.1 (apy3341):~$ pip install -U nltk
- (skip, if nltk_data already present on system) Start Python interpreter in virtual environment and download nltk_data (corpora, packages & models):
(apy3341) mki@ubuntu:~$ python ActivePython 3.3.4.1 (ActiveState Software Inc.) based on Python 3.3.4 (default, Feb 25 2014, 14:59:15) [GCC 4.0.2 20051125 (Red Hat 4.0.2-8)] on linux Type "help", "copyright", "credits" or "license" for more information. >>> import nltk >>> nltk.download()
pip installations of sphinx, numpy & imapclient in virtual environment on Linux work out-of-the-box
Prerequisites: Install Python and set up virtual environment & Install pip in virtual environment
Tested on: Xubuntu 12.04 LTS / 13.10 / 14.04 LTS (64-bit) with ActivePython installed in /opt
[Last update: 03/07/2014]
- (skip, if activated) Activate the virtual environment you would like to install additional python packages in:
-
cd ~/venv/apy3341 source bin/activate
- Use pip to install/update additional packages (the installation of the following packages worked out-of-the-box):
-
bin/pip install -U sphinx
-
bin/pip install -U numpy
-
bin/pip install -U imapclient
Installing lxml in virtual environment on Linux
Prerequisites: Install Python and set up virtual environment & Install pip in virtual environment
Tested on: Xubuntu 12.04 LTS / 13.10 / 14.04 LTS (64-bit) with ActivePython installed in /opt
[Last update: 03/07/2014]
- (skip, if activated) Activate the virtual environment you would like to install lxml in:
-
cd ~/venv/apy3341 source bin/activate
- Install/check for lxml dependencies (for pip to be able to compile lxml from source code):
-
sudo apt-get install libxml2-dev libxslt1-dev zlib1g zlib1g-dev
- Use pip to install/update lxml:
-
bin/pip install -U lxml
Installing ipython in virtual environment on Linux
Prerequisites: Install Python and set up virtual environment & Install pip in virtual environment
Tested on: Xubuntu 12.04 LTS / 13.10 / 14.04 LTS (64-bit) with ActivePython installed in /opt
[Last update: 03/07/2014]
- (skip, if activated) Activate the virtual environment you would like to install ipython in:
-
cd ~/venv/apy3341 source bin/activate
- Make sure that readline support is installed in your virtual environment (prerequisite for ipython’s colour prompts).
- Use pip to install/update readline:
-
bin/pip install -U ipython
Installing readline support in ActivePython virtual environment on Linux
Prerequisites: Install Python and set up virtual environment & Install pip in virtual environment
Tested on: Xubuntu 12.04 LTS / 13.10 / 14.04 LTS (64-bit) with ActivePython installed in /opt
[Last update: 03/07/2014]
- (skip, if activated) Activate the virtual environment you would like to install readline in:
-
cd ~/venv/apy3341 source bin/activate
- Install/check for readline dependencies (for pip to be able to compile readline from source code):
-
sudo apt-get install libncurses5-dev
- Use pip to install/update readline:
-
bin/pip install -U readline