Importing data into AntConc
OverviewTeaching: 10 min
Exercises: 0 minQuestions
How do I get data into AntConc?Objectives
Successfully import data into AntConc
What kinds of data files can I import?
AntConc works only with plain text files, such as those with the file extension .txt. Other types of plain text file (e.g. .csv, .tsv, .xml,.html) can be imported into AntConc, though - depending on your use case - doing so may not enable you to use the dataset as intended. AntConc will not read common formats like .doc, .xls, or .pdf. You will need to convert these into .txt files to use AntConc. A common approach to catalogue data is to process it so that particular fields can be analysed in AntConc. For information on how we processed the .txt files in BM-MDG.zip for use in AntConc, see Creation of the BMSatire Descriptions corpus.
Create your first AntConc project (using provided data)
To import the data for the exercise below, follow the instructions in Setup to download the data and run AntConc.
- Once AntConc is launched, click
Filefrom the navbar and select
- Navigate to where you unzipped BM-MDG.zip, and whilst holding
cmdfor Mac) click on each of the twelve .txt files (note: holding
shiftand hitting the
down arrowalso works here). Alternatively, the
Open Diroption in the
Filedropdown can be used to open a whole directory.
Open. The names of the twelve .txt files will now appear in the left-hand
- Note that although this module asks you to upload twelve .txt files, they are in fact a single corpus of around 1.2 million words seperated into parts containing roughly 100,000 words each. AntConc peforms better with many smaller files than it does with one large file, so if you expect to be working with a large corpus, or notice AntConc running slowly (or even crashing!), consider dividing up your corpus to achieve performance benefits.
- Files can be added to the AntConc ‘corpus’ at any time during analysis, just note that your results will change depending on the files listed under
- If at any time you want to remove a file from AntConc, highlight it in the
Corpus Filespane, go to the navbar, click
Close Selected File(s).
Openoption to import data
You can import individual files or a folder
AntConc works only with plain text files, for example those with the file extension .txt
AntConc will not read common formats like .doc, .xls, or .pdf. You will need to convert these into .txt files to use AntConc.