MOVIE_DATA_REFORMAT
Reformat Movie Data to CSV Format


MOVIE_DATA_REFORMAT is a FORTRAN90 program which reformats movie data, copied from THE NUMBERS movie data web page, into a CSV format suitable for input to a statistical analysis package such as "R".

A browser can copy the tabular movie data for one year but stores the copied data as a list, one item per line.

This program puts all the data for a movie on one line, separating data items by commas, quoting string data, removing dollar signs and commas from numbers.

Empty string data is set to "?".

It also removes control characters.

Usage:

movie_data_reformat movie_data_file
reads data items from movie_data_file, and creates consecutive records of 9 data items per line, each associated with a particular movie. The output file has the same "first name" as the input file, but now has the extension ".csv", for "Comma Separated Values".

Licensing:

The computer code and data files described and made available on this web page are distributed under the GNU LGPL license.

Languages:

MOVIE_DATA_REFORMAT is available in a FORTRAN90 version.

Related Data and Programs:

CSV, a data directory which contains examples of Comma Separated Value files;

CSV_IO, a FORTRAN90 library which reads and writes files that use the CSV, or comma separated value, format.

CSV_IO, FORTRAN90 programs which illustrate how to read or write data in the CSV (comma separated value) format.

FILUM, a FORTRAN90 library which can work with information in text files.

REFORMAT, a FORTRAN90 program which makes a copy of a file with a given number of "words" per line.

STATS, a dataset directory which contains datasets for computational statistics, including biometric measurements, movie attendance and sales, taxicab fares and distances;

Reference:

  1. http://www.the-numbers.com, THE NUMBERS movie data web site.

Source Code:

Examples and Tests:

List of Routines:

You can go up one level to the FORTRAN90 source codes.


Last revised on Wed Jan 4 14:58:46 2012