ExcelGuide.pdf

CONOP9 EXCEL USERS’ GUIDE
1
CONOP9 QUICKSTART GUIDE FOR EXCEL USERS
WHAT CONOP WILL DO
CONOP9 is primarily used to correlate stratigraphic sections which contain fossils. The
correlation may be an end in itself, or one step in the construction of a diversity curve or a
calibrated time scale. CONOP9 makes a best estimate of the overall sequence of first and last
appearances of fossil taxa, based on the ranges of taxa preserved in each section. It can handle
large numbers of taxa from many sections. It constructs composite range charts, fence charts,
species richness curves, taxon longevity curves, etc.
This guide assumes you are familiar with Excel and will use it to enter your data. The important
steps in the process are
1) installing CONOP9
2) gathering the data
3) entering the data
4) adjusting settings in CONOP’s configuration file
5) running the program
6) looking at the results,
7) adding non-taxonomic data like marker beds and age dates.
INSTALLING CONOP
1. Copy the distribution files onto your hard drive. The following narrative assumes that all the
folders are stored within one named CONOP. In most distributed formats there will be a
‘DOCS’ folder that contains the complete user’s manual, (Manual9.doc/pdf may be split into
two files Guide.doc/pdf and Reference.doc/pdf), a NOVICE folder for inexperienced users,
and an EXCEL USERS folder with this manual. An example data set may be nested within the
Excel folder or must be sought in the general suite of data folders.
2. On some computers, the files copied from a CD source retain “read only” properties. Make
sure the files you will need to use are changed to ‘archive’. If you don’t know how to do
this, see INSTALLING CONOP in the NoviceGuide.doc file.
3. Make a new folder within the CONOP folder for your own project.
4. Move copies of the CONOP9.EXE and CONOP9.CFG files into your folder.
THE DATA YOU NEED
1. Two or more measured stratigraphic sections.
2. Taxon identifications with at least two taxa shared between sections.
3. The levels at which each taxon found in each section first appears (FAD = first appearance
datum) and last appears (LAD = last appearance datum).
ENTERING THE DATA
You next have to put your data into input files, within your folder, in a format that the CONOP
programs can read. The CONOP program will need these in text file format but you can build
them in Excel and then save them as “TEXT (tab delimited)” files (space delimited should also
work).
2
CONOP9 EXCEL USERS’ GUIDE
1. The .SCT file labels your sections, assigns them numbers, and a place in the final fence
diagram. You can put in a header row while you are entering data if it helps you keep track of
which data goes into which column but it must be removed before you do a final save_as.
There are five columns (fields) of information and one row (record, line) for each section.
Inverted commas (single quotes) are necessary in the text file when there are blank spaces within
the entry meant to be in one column. Excel uses the leading inverted comma for something else
so you will have to enter a space before the inverted comma OR add them later, after converting
the file to .txt format.
.SCT file format
SECTION#
ABBREVIATION
consecutive
integers starting
with 1 in first
row
Nickname for each
section, three letter
maximum, in single
quotes.
A
1
2
1
2
FENCE
ORDER
consecutive integers 1
through number of
sections in any order
B
‘DL1’
‘KT2’
C
1
2
NAME
INCLUDE
Your identifier or name for each
section, in single quotes, 20 space
maximum.
Just put 1 for
all rows
D
‘Denmark section DL4’
‘Seymour Island KT2’
E
1
1
Column A on each line is the section number which you will be using when you enter your data
in the .DAT file so make a record of it in your notes for that section. Number the sections
consecutively, starting with 1, omit no numbers, and list them in the .SCT file in order, 1 first.
Column B is a three-letter abbreviation used to identify the column in output files. It must be
preceded and followed by single quote marks if it includes a blank space. It can be an
abbreviation of the name or any code that refers to other databases
Column C is a number that tells the program what order to draw the sections in for fence
diagrams. They can be in any order but must be 1 through however many sections you have; i.e.
the same numbers as column A, but in custom order.
Column D is a longer description of the section for your own identification purposes. it cannot
be more than 20 letters. Set this entry off with single quote marks in Excel (space first!).
Don’t worry about the final column for now, just enter a 1 for each section.
If you made a header row, save the .xls file then erase the header before saving the file as a TEXT
(tab delimited)(*.txt) file. Using .SCT as the extension might help you find it again more easily
than .txt. Close the file in Excel and then open it in a simple text editor like WordPad to make
sure the inverted commas are where they should be and the data are properly separated. It is not
necessary that the columns align vertically, or that the white spaces are of the same length in all
sections. For proof-reading it may be easier to align the columns. Achieve this with tabs and/or
a non-proportionally spaced font.
3
CONOP9 EXCEL USERS’ GUIDE
2. The .EVT file labels your taxa (and other types of events) and assigns them a number and
nickname. Again, you may want to label a header row initially but it will have to be removed
before the final save.
.EVT file format
TAXON#
NICKNAME
NAME
consecutive integers starting
with 1 in the first row
Shorthand code or
abbreviation for the name
Taxon name
A
1
2
3
B
1
2
3
C
‘Bispathodus aculeatus aculeatus’
‘Bispathodus anteposicornis’
‘Bispathodus costatus’
‘Baa’
‘Ban’
‘Bco’
Column A is the event number and must be 1 in the first row, 2 in the second, etc.
Column B is for the taxon nickname. Start with a space and single quote first, ‘, then a shorthand
code for that taxon. It might just be a shortened version of the name, like ‘P.col’ for Polygnathus
collinsoni, or a code that refers to another database. Follow it with another single quote.
Start Column C with another space and single quote and then the taxon name, followed by
another single quote, like ‘Polygnathus collinsoni’.
Enter a line like that for each taxon in your sections. Erase the header row if you made one and
save the file as TEXT (tab delimited)(*.txt) with your own file name and the extension .EVT.
3. The .DAT file will contain all of the information on FAD and LAD levels for all taxa in all
sections. There will be one row (line, record) You will be using the section numbers you
assigned in the .SCT file and Taxon numbers you assigned in the .EVT file.
.DAT file format.
TAXON#
From the
.EVT file
EVENT
TYPE
1=FAD
2=LAD
A
1
2
3
4
1
1
2
2
SECTION#
From the
.SCT file
B
1
2
1
2
STRAT LEVEL
LEVEL #
in meters or
whatever from
your data (not 0)
see below
C
1
1
1
1
D
0.62
9.3
8.4
8.4
ALLOWED
MOVES
1 if FAD
2 if LAD
E
1
3
2
2
WEIGHT
UP
WEIGHT
DOWN
just put
1
just put
1
F
1
2
1
2
G
1
1
1
1
H
1
1
1
1
You can enter the data in any order and then do a “sort” to get it ready for CONOP to use. I have
found it convenient and quick to enter one section at a time in the following manner. For each
taxon in the first section, note the taxon number from the .EVT file. In Column A enter each of
those taxon numbers twice, i.e. in two consecutive rows. You should have no rows for taxa that
were not found in this first section. The first row for each taxon is for the FAD and the second
row for the LAD; so go down Column B entering 1,2,1,2,1,2 and so on. Now enter the section
CONOP9 EXCEL USERS’ GUIDE
4
number, the one you assigned in the .SCT file, in each row in Column C. Now you should have a
row for every first and every last appearance of every taxon found in the first section.
In Column D enter the stratigraphic level at which the event occurred, either the lowest level at
which it was found for the FAD or the highest level for the LAD. If the taxon was found at only
one level, enter the same number in both the FAD and LAD rows of column D. These numbers
may be entered with or without decimal places. It is safest to avoid using “0” in this column -early versions of CONOP used zero for taxa not seen and it is possible that some little-used
algorithms retain this feature. If one of your levels is 0, either change it to 0.01 or add an
arbitrary constant, like 100 or 1000.00, to all of the levels in the section. You can enter negative
numbers for well depth; this ensures that the values for both well depths and measured sections
will increase up-section from older to younger.
Filling in Column E is where Excel comes in handy. Highlight all rows of data for the section
you are working on. Do an ascending sort on column D. Your entries should now be in reverse
stratigraphic order with events that occur at the same level grouped together in column D. Now
number the levels in column E consecutively, use 1 for the lowest level. All events that occur at
the same level should have the same number in column E.
In column F, enter a 1 if column B is 1 or a 2 if column B is 2. You can also just highlight
column B and copy and paste it into column F.
Enter a 1 or 1.00 in both column G and H for each row. Don’t worry about what they mean at
this point.
Now you are ready to enter the next section. If you keep it separate from the first section you
entered you can highlight just that section when you sort and assign level numbers for that
section in column E. Each section has its own level numbers starting with one for the lowest.
Once you have entered all of your sections, go back and remove any blank rows you left between
sections and the header row if you made one.
You need to sort the data one last time for CONOP. Highlight the entire worksheet and do an
ascending sort on columns A, B, and C, in that order. Now the first rows of the spreadsheet
should group together all of the first occurrences of the first taxon in order by section. Next will
be the last appearances for that same taxon, in the same order by section.
You may prepare an unsorted file, with or without column E, and use CONSORT9.EXE to complete
and sort it. Nevertheless, it is more comforting to use the “hands-on,” and “what you do is what
you get” control offered by Excel.
Save the file now and be sure to do so in text file format with any name you choose but with the
extension .DAT. Close the file and Excel. There are usually some errors in data entry and you
can find some of those by trying to run the data and reading the error messages in CONOP.
CONOP9 EXCEL USERS’ GUIDE
5
SETTING UP THE CONOP9.CFG FILE
You have to tell the CONOP algorithms a few things about your particular data before you run the
program. You do that in the configuration file, CONOP9.CFG. Double click on the CONOP9.CFG
file in Windows Explorer to open it in WordPad. Scroll down to 1.1 Input Data. Replace the
words or numbers in the following lines with your own, in single quotes where shown. You
don’t need to use italics, they are just used here to show what parts you enter.
PROJECT = ‘your own title’
SECTIONS = the number of sections in your .SCT file
TAXA = the number of taxa in your .EVT file
EVENTS = 0
MAX_LEVELS=100 (change only if you have more than 100 levels in a single section)
MAX_LABELS=30
LOADFILE = ‘the name of your .DAT file’
PREPFILE = ‘OFF.dis’
SECTFILE = ‘the name of your .SCT file’
LABELFILE = ‘OFF.lbl’
EVENTFILE = ‘the name of your .EVT file’
BESTKNOWN=0.0
In the next section,
1.2, you may not
have to make any
changes. The
settings should be:
PENALTY='LEVEL'
LETCONTRACT='OFF'
WEIGHTING='on'
USENEGATIVE='OFF'
NEARENOUGH=5.00
EXCLUSIVES='yes'
FORCECOEX='ss'
HOMERANGE='sl'
SMOOTHER=0.00
SQUEEZER=0.00
SHRINKER=0.00
TEASER=1.0
STACKER='COEX'
Section 1.3 will look
something like this. You
will be coming back later to
change the STEPS,
TRIALS, and STARTEMP
settings. Leave the other
settings as they are except
for the last two. You can
either enter ‘OFF’ in front
of the file name there or
replace the file name with
your own, ‘KT3x15.grd’
for example.
SOLVER='anneal'
STEPS=500
TRIALS=100
STARTEMP=200
RATIO=0.98
HOODSIZE='big'
STARTYPE='rand'
STARTSECT=1
STARTEVENT=0
SHOWMOVIES='CHT'
TRAJECTORY='ALL'
VIDEOMODE='SVGA'
PAUSES='ON'
CURVFILE='OFFfilename.grd'
CRV2FILE='OFFfilename.gr2'
CONOP9 EXCEL USERS’ GUIDE
Section 1.4 should
look something like
this and you
probably won’t
have to make any
changes here.
6
COLUMNS=7
UNLOADMAIN='outmain.txt'
FITS_OUT='ON'
CNFG_OUT='ON'
SEQN_OUT='ON'
INCR_OUT='ON'
LOC_OUT='ON'
OBS_OUT='ON'
COMP_OUT='ON'
UNLOADSECT='outsect.txt'
SECT_OUT='OFF'
UNLOADEVNT='outevnt.txt'
EVNT_OUT='ON'
COEX_OUT='COUNT'
RUNLOGFILE='runlog.txt'
CULLFILE='cull.txt'
SOLNLIST='OFFfilename.sln'
STARTFILE='soln.dat'
STEPFILE='OFFstepsoln.dat'
BESTARTFILE='bestsoln.dat'
COMPOSFILE='cmpst.dat'
COMPOSNMBR=1
COMPOSTYPE='ZST'
OBSDFILE='ab.dat'
PLCDFILE='albet.dat'
EXTNFILE='delta.dat'
COEXISTFILE='coex.dat'
ORDERFILE='ordr.dat'
ERROR MESSAGES AND CORRECTIONS
Now save the file and close it. You are now ready to try a run of CONOP9. In Microsoft Explorer
double click on CONOP9.EXE in your folder. You will probably get some errors in the data and
some error messages -- these will result from incompatible or inconsistent values entered into
the excel spreadsheet. Errors in the .DAT file are usually identified well enough in the error
message to find and correct them. Omitting a single quote mark in the .SCT or .EVT files will also
bring up error messages, usually in a blue box. Correct the errors, save the corrected file, and
close the CONOP program, and try again by double clicking on the .EXE file.
CONOP9 EXCEL USERS’ GUIDE
7
RUN THE PROGRAM
Repeat trying to run the program and making necessary error corrections until the program
accepts the data and builds an array to use, then press enter when prompted to start the run. You
should see and animated range chart twinkling on the screen. Ranges will extend and contract
through numerous trials until the program reaches a solution.
The “answer” you are looking for is the “lowest penalty,” or smallest misfit between the model
sequence of events and all the local stratigraphic sections. The misfit penalty is shown in green
in small numbers to the left, near the bottom of the animated range chart diagram. As the
program runs, it adds a green box whenever it finds a model sequence (trial solution) with a
lower misfit penalty. The lowest penalty is the solution that extends the ranges as little as
possible to fit the data in all the sections to a single sequence. There may be more than one
sequence of events that gives the lowest penalty. Especially when there are few sections, several
events may fall at the same level in the “answer” although they can appear in any order amongst
themselves and not change the penalty. Not every run will necessarily yield the lowest penalty –
see remarks about the “cooling schedule.”
When the program stops click Enter to see the final range chart it came up with and then Enter
again for a screen with a menu bar that includes pull-down menus for various graphical and text
outputs. Close this window and run the program again. Repeat several runs to see if you
consistently get the lowest penalty. If not, you can adjust the settings in the .CFG file to try to
improve the penalty and consistency. This is called “adjusting the cooling schedule”.
ADJUSTING THE COOLING SCHEDULE
This is basically a matter of trial and error but with practice gained from watching the animation
you get a feel for what changes might be needed. Reopen the CONOP9.CFG file, scroll down to
section 1.3 and find the lines STEPS, TRIALS, and STARTEMP. Experiment changing the values,
saving the file, and then doing more runs to try to get the lowest penalty more consistently and
faster. The larger the data set the less likely it is you will get the same low penalty each time.
Length of time it takes to do a run will be a factor in larger data sets.
Some Example Settings:
PROBLEM SIZE
STEPS
TRIALS
STARTEMP
7 sections, 62 taxa
27 sections, 131 taxa
198 sections, 1170 taxa
500
500
50
100
500
10000
200
100
100
The number of steps is how many times the program lowers the temperature and moves on to
another set of trials. If the green boxes were still appearing near the end of the run then add steps
by increasing the STEPS = number. Try increasing it by intervals of 50. Decrease steps if the
latter part of the run showed no improvement (green boxes).
The number of trials is the number of trial sequences the program examines at each temperature
step, trying to find a better solution. Well behaved data sets, where there is generally good
agreement in the order of events from section to section, can reach a good solution with fewer
CONOP9 EXCEL USERS’ GUIDE
8
trials. You might need just 1 trial or you might need 10000. You might be able to lower the
number of steps if you increase the trials. But beware, in combination with the cooling ratio, the
number of steps ensures that the final temperature is suitably low to freeze-in a good solution. In
general, increase trials rather than steps.
The STARTEMP is too high if the program runs for awhile with no green boxes appearing. If it is
too low and the data is poorly constrained you risk getting stuck in a bad solution -- during the
animation, the line of green boxes dives rapidly down the screen and stops abruptly without the
characteristic negative exponential curve of diminishing returns. If the starting temperature is
raised substantially, it may be necessary to increase the number of steps in order to achieve a low
enough final temperature.
THE RESULTS
Once you have found the parameters that run the data through CONOP to a consistent result, you
will want to look more carefully at the results. You may have noticed already that once you got
the program to run more files appeared in the folder. Which files these are depends on the ‘ON’
‘OFF’ settings in the .CFG file. You can also choose to create some more once you have a good
run through with the lowest penalty. Perform another run and when it stops, click ‘Enter’ once.
This is your range chart, with the ranges shown from top to bottom in order of the taxon number
assigned in the .EVT file. One of the ranges is light blue instead of white. The name of the taxon
for that range appears in blue at the bottom of the left side of the screen. You can examine each
range in turn and identify the taxon by moving through the ranges using the “+” and “-“ keys.
When you have exhausted your interest in this amusement, click on the ‘enter’ key once more.
This is a blank page but at the top, on the menu bar, are some output options to examine. Click
‘TEXT-FILE OUTPUT’ and a pull down menu will appear. Under ‘RANGE CHARTS’ click on ‘In FAD
order’ and then click ‘enter’ when prompted. This will create an output file for you that will list
the taxon ranges in order of FADs for each section. Now create one for the composite ranges by
clicking on the TEXT-FILE OUTPUT again and selecting ‘best sequence’ under the COMPOSITE
RANGES heading. Create the cpcht.txt file by pressing ‘enter’ when prompted. (Alternatively,
select ‘best sequence’ under ‘RANGE SEQUENCE’.) Those two files should have the output data
you need.
Correlation of sections.
To find the levels at which CONOP has placed events in each section you will want to look in the
FADchart.txt file. The file lists, for each section in order, the projected levels for each taxon’s
FAD and LAD. This includes FADs that might be projected to occur lower and LADs higher in the
section than they were actually observed as well as the levels at which taxa not observed (‘not
seen’) in the section but which are believed to be contemporaneous would appear or disappear if
they did occur there. The taxa whose ranges do not overlap with that section will not be listed.
Note that there will be a plethora of FADs listed at the bottom and LADs at the top which represent
taxa which originated before the base of the section or disappeared after the top collection level
respectively.
Composite Range Chart
If you are, instead, interested in building a range chart for the taxa, establishing a record of the
order of their appearances and disappearances, you then can look in the cpcht.txt file. The FAD
and LAD levels are listed for each taxon in a calculated “master section” where the spacing of
CONOP9 EXCEL USERS’ GUIDE
9
the levels is based on the correlation of individual sections you entered. 1000 was added to the
levels along the way in the program so you could subtract 1000 if you wanted. The sqcht.txt
option is similar but the levels are numbered sequentially and not spaced to any scale.
GRAPHICAL OUTPUT
You can create your own range charts and fence diagrams from data in the output files using
Excel or a drafting program. Some are available in simple form in the “GRAPHICAL OUTPUT”
pull-down menu after you have done a CONOP run. Most of these require more explanation than
we can go into here but try the “TAXON RANGE CHARTS”, “FENCE DIAGRAM” and “SECTION RANGE
CHARTS”. Note that the section range charts give each FAD and LAD a separate line even though
several might actually occur at the same level. You navigate through a variety of styles with
simultaneous ‘x’ plus ‘+’ or ‘-‘ keys and through events with the + and – keys. This may take
some experimentation to master but also see the full Manual9.doc for more detailed explanations
and navigation directions for each type. Return to the CONOP program by clicking the mouse and
then pressing the escape key.
ADDING MARKER BEDS AND HORIZONS
There may be additional types of information with measured stratigraphic sections that can be
used by CONOP in conjunction with the taxon ranges. Unique beds or horizons can force
correlations and absolute ages can add absolute time to the scale. The .SCT input file should not
be affected by including these types of events unless you are adding a section which is only tied
to the others by marker beds and dated horizons.
“Acme Horizons” = type 3 events
An acme horizon is a layer in which a taxon, for which you also have FAD and LAD information,
is unusually abundant. The taxon involved should already be defined in the .EVT file. However,
add a row to the .DAT file for each section in which the layer appears. The first column contains
the same taxon number as you assigned to the FAD and LAD, the second column should be ‘3’.
The ‘allowed moves’, column F, should also be ‘3’. There must also be an FAD and LAD for the
same taxon in the same section. In the .CFG file you must count this as an event so go to section
1.1 and set EVENTS = (number of type 3, 4, and/or 5 events). This event type has only had
limited testing so examine results carefully.
Unique Chronostratigraphical Marker Horizon = type 4 events
A distinctive ash bed would be an example of this type of event. You will have to add an entry
in the .EVT file to define each marker bed and assign it a number. In the .DAT file you will have
to add one row for each section in which it appears. The event type in column B is ‘4’ and the
allowed moves in column F is ‘0’. In the .CFG file you must go to section 1.1 and set EVENTS =
(number of type 3,4, and/or 5 events).
Age Dates = type 5 events
Age dates must also be given their own event numbers in the .EVT file. In the .DAT file they
are entered as event type ‘5’ in column B and allowed moves in column F should be ‘0’. Since
the last two columns in the .DAT file have no meaning with regard to weight in this instance
they are used to record the age range on the date (+ or – 2 sigma), youngest in column G. In the
.CFG file you must go to section 1.1 and set EVENTS = (number of type 3,4, and/or 5 events).
CONOP9 EXCEL USERS’ GUIDE
10
THERE’S MORE
For a more detailed treatment of all options, input, output and aspects of CONOP9 see the file
MANUAL9.DOC or MANUAL9.PDF in the ‘Docs’ folder within the CONOP folder. In some versions,
MANUAL9 has been split into two parts: GUIDE9.PDF and REFERENCE9.PDF. You can also find
more information in text at the end of the CONOP.CFG file. There are directions on and examples
of a couple of ways of making range charts in Excel in the .xls files. ExcelHRangeCht.xls
illustrates a chart with horizontal ranges. ExcelVRangeCht.xls illustrates a chart with vertical
ranges.
Marilyn Ann Kooser
Geology Museum
University of California, Riverside
Summer 2002 – Winter 2003