Unit 8. Batch Processing Command Line Tools

Level

Advanced

Time

This Unit should not take you more than 3 hours.

Learning Outcomes

By the end of this unit you should be able to:

execute command line functions from a Python script
apply the file listing script, written previously, to batch process files of a certain type.

Further Reading

GDAL –http://www.gdal.org/
OGR –http://www.gdal.org/ogr/
Python Documentation – http://www.python.org/doc/
Core Python Programming (Second Edition), W.J. Chun, Prentice Hall ISBN 0-13-226993-7 (Also available online – http://www.network-theory.co.uk/docs/pytut/)
Learn UNIX in 10 minutes – http://freeengineer.org/learnUNIXin10minutes.html

– See more at: http://learningzone.rspsoc.org.uk/index.php/Learning-Materials/Python-Scripting/Unit-8-Batch-Processing-Command-Line-Tools#sthash.yqoSsUIe.dpuf

8.1 Introduction

There are many command line tools and utilities available for the main types of platform, Windows, Linux and Mac OSX. These tools are extremely useful and they range in function from simple tasks such as renaming a file to more complex tasks such as merging ESRI shapefiles. One problem with these functions is that if you have a large number of files which need to be processed in the same way, it is time consuming and error-prone to manually run the command for each file. Therefore, if we can write scripts to do this work for us then processing large number of individual files becomes a much simpler, error free task.

For this unit you will need to have the command line tools that come with the GDAL/OGR (http://www.gdal.org) open source software library installed and available with your path. With the installation of Python(x,y) the Python libraries for GDAL/OGR have been installed but not the command line utilities which can be used with these libraries.

To install these command lines tools on Windows the FWTools (http://fwtools.maptools.org/) package is recommend. To install, download FWTools and run the installer. Following installation it is recommended that you set the PATH environmental variable to the tools provided by FWTools to allow for easier access later on.

To set the PATH variable you first need to identify the directory path where FWTools has been installed.

To do this, open Windows Explorer (“My Computer”), navigate to ‘Program Files’ and look for a directory named FWTools<version>. Notes that the version number will change depending on when you have downloaded the software. I have version 2.3.0 therefore my path is:
C:\Program Files\FWTools2.3.0\bin
The bin directory contains the executable programs and this needs adding to the path.

The next step is to set the environmental variable, PATH. To do this right-click on ‘My Computer’ and select Properties, Figure 8.1.

Figure 8.1: Right-click on ‘My Computer’ and select Properties

Once open, select the ‘Advanced’ tab and then click the ‘Environment Variables’ button. Within the list of variables scroll down the list of ‘System Variables’ until you find the ‘Path’ variable and select edit. Enter a semi-colon (;) after the last entry and copy-and-paste your path following the semi-colon. Finally, select OK and all the open dialog boxes and the variable should now be set, Figure 8.2.

Figure 8.2: Select the ‘Advanced’ tab and then click the ‘Environment Variables’ Scroll down the list of ‘System Variables’ until you find the ‘Path’ variable and select edit. Add in the path identified above and select OK and all open dialog boxes.

– See more at: http://learningzone.rspsoc.org.uk/index.php/Learning-Materials/Python-Scripting/8.1-Introduction#sthash.UGzDOqIG.dpuf

8.2 Merging ESRI Shapefiles

The first example illustrates how the `ogr2ogr‘ command can be used to merge shapefiles and a how a Python script can be used to turn this command into a batch process where a whole directory of shapefiles can be merged.

To perform this operation two commands are required. The first makes a copy of the first shapefile in the list of files into a new file, shown below:

> ogr2ogr <inputfile> <outputfile>

The second command appends the contents of the inputted shapefile onto the end of an existing shapefile (i.e., the one just copied):

> ogr2ogr -update -append <inputfile> <outputfile> -nln <outputfilename>

For both these commands the shapefiles all need to be of the same type (point, polyline or polygon) and contain the same attributes. Therefore, your first exercise is to understand the use of the ogr2ogr command and try it from the command line with the data provided. Hint, running ogr2ogr without any options will result in the help file being displayed.

The second stage is to develop a Python script to call the appropriate commands to perform the required operation. The following processes will be required:

Get the user inputs.
List the contents of the input directory.
Iterate through the directory and run the required commands.

But the first step is to create the class structure in which the code will fit; this will be something similar to that shown below:

#! /usr/bin/env python

#######################################
# MergeSHPfiles.py
# A python script to merge shapefiles
# Author: <YOUR NAME>
# Email: <YOUR EMAIL>
# Date: DD/MM/YYYY
# Version: 1.0
#######################################

import os

class MergeSHPfiles (object):

# A function which controls the rest of the script
def run(self):
# Define the input directory
filePath = ‘C:\\PythonCourse\\unit8\\TreeCrowns\\’
# Define the output file
newSHPfile = ‘C:\\PythonCourse\\unit8\\Merged_shapefile.shp’

# The start of the code
if __name__ == ‘__main__’:
# Make an instance of the class
obj = MergeSHPfiles()
# Call the function run()
obj.run()

The script will have the input directory and output file hard coded (as shown) within the run function. Therefore, you need to edit these file paths to the location you have saved the files. Please note that under Windows you need to insert a double slash (i.e., \\) within the file path as a single slash is an escape character (e.g., \n for new line) within strings.

Time	First				Last
Time	Eastrings	Northings	Height	Intensity	Eastings	Northings	Height	Intensity

– See more at: http://learningzone.rspsoc.org.uk/index.php/Learning-Materials/Python-Scripting/10.2-Gridding-LiDAR-data#sthash.QeJkmJ7z.dpuf

10.3 Visualisation

10.4 Scatter Plots

10.5 Summary

Exercises

Trending Articles