Locating America's Hidden Psychic Population

GitHub repo for this project

Contents

  1. Introduction
  2. Collecting Data
    1. FM Radio
    2. Colleges and Universities
    3. Vortexes
    4. Ley Lines
    5. Assemble Data
  3. Analysis
    1. FM Radio Stations
    2. Colleges and Universities
    3. Radio & Colleges/Universities
    4. Vortexes
    5. Ley Lines
    6. Vortexes & Ley Lines
  4. Conclusion

Introduction

The goal of this project is to identify which areas in the contiguous United States are most likely to be secretly home to a proportionately large psychic population. The method I am using requires that I make a few assumptions:

In order to measure any effect radio waves have on cognitive ability spatially, I will find those areas where there is the strongest negative correlation between:

  1. The measured performance in colleges/universities, and
  2. The number of FM signals in range of that college/university

This analysis will result in a map of the US symbolizing the spots most likely to be home to psychics according to the above methodology. The next step will be to compare this to an aggregate map of the conventional ley lines and vortexes, or other superstitious energies.

I will need the following data for this project:

Initial Setup

I made a "Project" folder in my removable drive, within which I made a "Data" folder to save intermediate data and my file geodatabase into, and a "Maps" folder to hold my ArcMap project. I also made a "Reporting" folder to hold screenshots, notes, and eventually my PowerPoint presentations. I will write this guide assuming the same setup.

Collecting Data

To Top

FM Radio

Unfortunately, this data is not delimited by tabs or given a way to easily download it, so we'll have to turn it into something we can work with. There were 26829 entries when I did this. Another problem was that the location data was given in a string format, with the compass direction, degrees, minutes, and seconds separated by spaces. We don't want to go to the trouble of finding a geocoder that can parse that.

Figure 1: CustomDelimiter.java

Figure 2: Tab-delimited FM radio data in Excel

Figure 3: Use of the Text to Column tool

Now when we import this table into ArcGIS we can use these new columns to calculate the latitude and longitude as decimal degrees. Then we will be able to geocode the radio stations.

To Top

Colleges and Universities

Figure 4: The HIFLD Colleges and Universities data

This shapefile is thorough and contains the locations of every college/university in the US, but doesn't contain any data that we could use to measure academic success or cognitive ability. So we'll get a table that does have that information but doesn't have location.

This table will also need some grooming before it is ready to bring into the project. Both of our colleges/universities data have an IPEDS ID, which is what we'll use to join them, but we need the columns to be of the same type. As well, the % signs in the percentage fields will cause those columns to be imported as text, so that needs to be fixed.

Figure 5: Correctly groomed IPEDS data

To Top

Vortexes (Vortices?)

A vortex is apparently a word for a site of high spiritual energy, or where the Earth's energy rises up like a vortex. It's not a well-defined convention, but a convention nonetheless. There is a map of "known vortexes" (I will refrain from using quotation marks from here on, as it would get exhausting) at www.vortexhunters.com/vortex-map.html which I almost used, but I realized that it would skew my analysis; These are reported vortexes, and so naturally they occur most in areas of high population. I don't want that skew in my final comparison. This seems to be the most complete list of vortexes, and relies on report, so I opted instead to focus on a smaller number of famous vortexes, such as at Sedona, and consider them to represent very substantial concentrations of superstitious energy.

Figure 6: Google search for prominent US vortexes

Figure 7: Getting vortex coordinates

Figure 8: Vortexes data filled in from scratch

To Top

Ley Lines

Another convention of spirituality or superstition is that of ley lines. It is an old concept; The idea is that there are invisible lines across the surface of the Earth that represent the flow of energies (or magnetism, depends on who you ask). Part of the concept is that these lines naturally cross through areas of power or spirituality. The intersections of these lines are especially powerful, (and as such I will stack their energies in my aggregate energy layer). These intersections, or "nodes" are also often vortex sites. I would think, given the number of purported areas of energy, it would be trivial to draw lines that intersect them. Yet, there at least seems to be almost a consensus as to where these lines are located over the US. One of the mappings of the lines has turned up more than once.

Figure 9: Examples of the ley line convention chosen

To Top

Assemble Data

Analysis

To Top

FM Radio Stations

We want to turn the FM Radio table into a shapefile that shows us the range of each station. First we need to turn each row into a point feature. We must geocode the table using decimal degrees.

Figure 10: Latitude Field Calculation

Now we can geocode the FM station data, because we have the coordinates in a form that the geocoder can understand. We're going to make a new point shapefile in the geodatabase.

The FCC provides very little in the way of metadata for this dataset. The coordinate system is not provided. However, a standard for such data as this that covers the entire USA is NAD 1983. I tries this as the geographic coordinate system when geocoding, and it matched up nicely with the rest of my data across the whole USA, so it seems to be the right choice. For this map, we will use the USA Contiguous Albers Equal Area Conic USGS projection.

Figure 11: All US FM radio stations

Another thing the FCC does not provide is the algorithm they use for calculating signal reach from a radio station at a given dBu (decibel unit). The standard dBu at which it can still be considered commercial grade signal strength is 60. There is a tool on the FCC site that gives the reach radius from any FM station, but it would be enormously slow to put each one through that page, since we have thousands of stations. Instead I attempted to reverse engineer the algorithm on the page. Given a station's antenna height above average terrain (HAAT) in meters, and effective radiated power (ERP) in kiloWatts, and some target strength in dBu, it is possible to find a radius of reach. My quick look through some old documents leads me to suspect that the FCC doesn't use a pure math algorithm, but instead a reference table, along with interpolation for in-between values. My attempt at spoofing their algorithm lead to:

Radius (km) @60dBu = ERP (kW) ^(0.394) * HAAT (m) ^(0.444)

This is not a total analog but is a close enough approximation, and is at least proportionate. Let's remember we're here for the GIS. Before we start calculating for every station however, notice that some have faulty data, or data that indicates the station is not actually in use (or otherwise cannot produce a signal). This includes any rows with an ERP or HAAT that is 0 or below, or null. ERP >= 0 means that no signal is generated. HAAT >= 0 means the signal cannot appreciably propagate.

Figure 12: All FM radio signals over the contiguous USA

To Top

Colleges and Universities

At this point we have a point shapefile of all US colleges and universities, and a table of graduation and transfer-out statistics for those institutions. We can join these data to correspond each spatial point feature to its statistics.

But how do we calculate student cognition or academic ability from this data? One fairly fair way is to consider a high graduation rate as a measure of success, a high drop-out rate as a measure of failure, and a high transfer-out rate as something in-between. Transfer-out students may be performing very well or very badly, there's not enough information to generalize. Of course, this method may seem a bit ignorant, as many students drop out for many reasons, including geniuses, but it seems the most fair method, and less likely to require normalizing for some other spatial attribute. My exact equation was:

Successful_Rate (%) = Graduation_Rate (%) + (Transfer-out_Rate (%) / 2)

This is equivalent to saying graduating is success, transferring out is half-success, and dropping-out (or otherwise not graduating or transferring after 150% of the time it should take) is no success.

We know have a quantified measure of academic ability (or cognition) for each spatially referenced college/university. As you can see in the figure below, this data tells us very little so far, except that there are people graduating, transferring out, and dropping out all over the country. Remember that what we are looking for is not a pattern of low to high cognition across the US, but a pattern of strong negative correlation between cognition and number of radio stations in range for each institution.

Figure 13: Colleges/Universities symbolized by "SuccessfulRate", from red (low) to blue (high)

Now let's see both of these layers at once:

Figure 14: Radio ranges and schools by success

There's not much to see except a general indication of population. But it is not yet clear which spots have the most radio stations, as the buffers overlap each other.

To Top

Radio & Colleges/Universities

We are not done processing the FM radio data. We want to know, for a given point (namely a college/university) how many stations are in range. There may be a few ways to figure this out, but I've only found one after a long time searching, dabbling in many different geoprocessing tools.

Figure 15: Intersecting FM buffers with college data

You'll see you now have a feature class with ~74000 rows in its attribute table. That is one entry for every intersection of a college and a radio buffer. That means that for every college, there are N rows with that college's data and a radio station's data, and N stations in range of that college. Importantly, each intersect row has an entry of 1 for "NumSignals".

Figure 16: Dissolving the radio/college intersections and summing the number of signals

The resulting dataset has the same number of rows as the earlier colleges data, and a field "SUM_NumSignals" which is the number of signals in range of that college. This is a breakthrough for us ( -for me at least- it took a while to figure out how to get this and none of theideas for similar problems online seemed to work here).

Figure 17: Beautiful range and distribution

What might immediately strike you is that the range of signals in range is approximately 0-100. This makes an analysis on the correlation between this number and the percentage of success field a bit easier. Since each field can be considered a percentage (percentage of successful students, and percentage of possible FM signals in range), we can measure a negative correlation as the difference between the two.

Figure 18: Getting the correlation between number of frequencies in range and academic performance, for each institution

Figure 19: Radio effect on cognition finally mapped

Figure 20: Alternate symbolization. Cohort size determines point size

The next step is the pièce de résistance. It requires raster tools - spatial analysis. We'll make sure we're equipped for it. To define the area over which we will calculate each raster cell, we first need a shapefile of the contiguous USA. I won't get into details, as how you do this is somewhat arbitrary and there are many ways to do it. I created a polygon feature roughly outlining the US, similar to how the I draw ley lines as described further down.

Figure 21: Preparing the Point Statistics tool

The output is a raster feature class, symbolized by the average RadioCognitionEffect in its radius. There are blank spots in the West where no colleges/universities exist for long stretches. I symbolized this raster Blue to Yellow for Low to High effect on cognition.

Figure 22: Negative correlation between number of FM frequencies in range and academic performance, rasterized

To Top

Vortexes

We will do a similar sequence of steps with this dataset as we did already with the much larger FM radio dataset from the FCC.

Now we have point features for some major vortexes. You'll notice now, if you hadn't before, that many of them are in the Sedona, Arizona area. It's supposed to be about the most powerful area in the US. We want areas though, not points. The energies are supposed to be present within some radius of the sites, and we need a way to represent higher power at Sedona where the sites are all together. I could try to do something similar to above, trying to find the number of buffers overlapping, but it would be easier to use a partly transparent symbology to see the summing up of energies.

To Top

Ley Lines

We need to georeference the ley lines maps. We will use just the lines that appear in both maps, and position them as close to both references as possible, to create a georeferenced line shapefile.

Figure 23: Beginning to georeference a Ley Line map

Now we add control points to each of these images. These points are in pairs, one on each image, and are the points that we assert are in the same location. Because we do not have a basemap, let's select some radio points and colleges in our current map, one at a time, and look online for where they are located in relation to the border of the US. That will give us a more accurate idea of how to place the control points.

After making a second control point, the image will become correctly sized, but will still not be projected correctly, so more control points are necessary. It helps to make them at opposite coasts, like one in Florida, one in Washington, one in Maine, one in California, etc. The curve of the lines of latitude can be achieved using points on the Canadian and Mexican borders. Once you've changed the transformation to Spline, you may see a few places where things are still off and need more control points. And don't forget to add a couple control points way inland.

Figure 24: The resulting curvature of using Spline transformation to map to our projection

Figure 25: Second tiff georeferenced

Now that both images are georeferenced we can confirm that they mostly agree with each other, but we see that they have little coherence with the vortexes we chose. That's okay. Now is time to create our ley lines line shapefile.

Figure 26: Display showing the ley lines created using the tiffs as reference

To Top

Vortexes & Ley Lines

Now we want to do similar to what we did with the vortexes, and make several buffers from each ley line.

Now, you should have six buffer energy polygons: Vortex_Buffer50km, Vortex_Buffer100km, Vortex_Buffer200km, Ley_Lines_Buffer50km, Ley_Lines_Buffer100km, Ley_LinesBuffer200km. The names aren't that important. I chose 50km as the base buffer because one of the many sources I had to review to get data on Ley lines, etc., reported that their energies are 100km wide.

Figure 27: Importing symbology so you only have to configure it once

Figure 28: Distribution of Ley and Vortex energies over the US

To Top

Conclusion

The resulting map is aggregated spiritual energy achieved without raster tools. The only step left in the analysis is to compare the radio-cognition findings with the vortex-ley findings. There does not seem to be any agreement between the two, overall, that cannot be attributed to coincidence. Of course, I do not believe in superstitious energies, but whether they might have any correlation with the presence of psychics is another question.

The resulting map is aggregated spiritual energy achieved without raster tools. The only step left in the analysis is to compare the radio-cognition findings with the vortex-ley findings. There does not seem to be any agreement between the two, overall, that cannot be attributed to coincidence. Of course, I do not believe in superstitious energies, but whether they might have any correlation with the presence of psychics is another question.

The real purpose of this project was to demonstrate some ArcGIS analysis. I don't hold any beliefs in hiding psychics or ley lines.

See TIF Map/Report Visual

Scott Waechter

Oh it didn't work...