![]() |
![]() |
Spectrographic Analysis for the EVP Researcher What is a spectrogram? Sound – invisible, always in motion and intangible. Our ears perceive sound as a series of waves with changes in amplitude, frequency and harmonics moving through the air. The intangible properties and constant motion of sound are the exact reason why it is so hard to interpret sound patterns. Spectrograms change all of that. Spectrograms give us the power to take a sound or series of sounds, plot them in real time and make an interpretable data formation out of them. Spectrograms, also known as sonograms, are computer programs that convert the varying intensity of an acoustic signal into a visual representation that is plotted on a graph according to its intensity. In most traditional spectrograms a grayscale pallet has been used, however as times and technology has changed so too has the spectrogram. In the grayscale format the degree of darkness at any given point is directly proportional to its amplitude (sound energy) at that time and frequency. Most modern spectrogram software programs are created with a color pallet that displays shades and hues that are representative of the sound energy. The intensity of the sound energy varies with the colors in the pallet within the spectrogram program and thusly creates a more easily identifiable spectrum of data for the interpreter. The color intensity representations vary from one spectrogram software program to another so be sure to completely read the manufacturers instructions on how to interpret their color pallet. This “picture of sounds” is an incredibly useful tool to a paranormal investigator, especially when it comes to EVP work.
10.2 – This is a basic illustration of the qualities of a spectrogram Spectrograms use a technique called Fourier analysis to convert the sounds into pictures. More precisely, a Fourier transform is applied to the electronic analog representation of an acoustic wave where the spectrogram then derives the translation of the frequencies and amplitudes of its component simplex waves into an interpretable data form. When this process is performed it renders a picture of sound that we can then interpret into meaningful data. Depending on the depth of detail programmed into the analytical properties of the Fourier analysis window of the Spectrogram there can be a variety of different levels of resolution and detail that can be observed from one spectrogram to the next. Most spectrographic programs use a short-analysis wide-band window where the adjacent harmonic frequencies seem to be adjoined rather closely in the visual representation, but, this shortcoming allows for better real-time resolution when performing sound analysis of EVP files. “Seeing” Sounds In order to “read” the images of sounds that a spectrogram will create for us we must first understand the qualities of normal human speech so that we will have a baseline to compare our EVP recordings to. Before we get too far into this portion of the chapter I should mention that the sounds that are going to be analyzed will be those of a native speaker of North American English. Please note that sounds in all languages are organized into a sequence of abstract units called phonemes. For those of you who are not native speakers of North American English I would suggest researching a set of phonemes that you are accustomed to hearing before you try interpreting a language that is not your native tongue. Most times, web sites on speech and speech therapy can be found in your native language through a basic internet search. This research will make your understanding of the subject more complete and your EVP analysis more accurate. When analyzing speech patterns on a spectrogram the program will apply a mathematical technique called Fourier analysis to the acoustical sound waves so that we may know what frequency that particular waveform is on at any given time. Typically what occurs during this process is that the recording is played through the spectrogram and then broken down into very small pieces of data from ½ of a second to 1 second. These pieces of the whole are then analyzed to find what frequency they are on. Once the analysis of each of the data pieces is complete the results are added up and then divided by the number of data bits that were used. This gives us the mean frequency of the overall EVP. What you will find in the next few paragraphs is a basic lesson on the function of sounds within the speech of the North American English language. Only the primary categories of vowels and consonants will be covered so that we can, at a minimum, understand the contrast between the most basic speech sounds and patterns for analytical purposes. The seven categories of speech sounds are divided by their mode of phonation: Consonants: Approximants, Nasals, Fricatives, Plosives and Affricates There are four (4) Approximants, also known as semi-vowels, which have sounds midway between a consonant and a vowel in the American English language. It has been found that in these phoneme categories there tends to be more constriction in the larynx for these vowels than for normal vowels, but conversely there is less tension for these consonants than in the other consonant categories. American English only has three (3) Nasals in which the air flow is completely blocked from the vocal tract. An example of this type of nasal can be found in the “ng” portion of the word “sing,” the “m” in the word me, and the “n” in the word new. The nasals have much less energy than many of the other phonation categories. This is because the oral tract is completely blocked, and sound waves radiate principally from the nose. The American English language has nine (9) Fricatives which are denoted by either weak or strong noises when produced, however this is the case only if the articulators are close enough together to cause a disruption of the air flow. These are “F”, “H”, “S”, “V”, “Z”, “th”, and “sh.” The “th” and “sh” account for two sounds a piece as they have both hard and soft pronunciations. The fricatives do not necessarily involve any voicing, although the voiced fricatives may have a very low voice bar. The signature of fricatives is in their high-frequency regions, which are more random in their energy distribution than voicing. There are six (6) Plosives in the American English language system and, as the name would imply, these are bursts or “explosions” of acoustic energy following a short period of silence; because of the silence during which the vocal tract is completely blocked, these phonemes are also called stops. The signature of plosives is an almost instantaneous passage from little or no acoustic energy to a short burst of high-energy in a wide frequency band. These sounds are created by a relatively complete closure of the vocal tract followed by a rapid release of the larynx to make “B”, “D”, “G”, “K”, “P”, and “T” sounds. There are only two (2) Affricates in the American English language system which is really plosives that are released as if they were fricatives. These would be the sounds found in the “ch” portion of the word change and “dge” in the word grudge. Vowels: Monophthongs & Diphthongs Monophthong vowels are characterized by strong stable voicing, and, speaking in terms of American English there are eleven (11) monophthong vowels which have a single vowel quality and two (2) that have a reduced quality. American English also has six (6) diphthongs which, unlike monophthongs have a strong but liquid voicing to them. These are vowels that generally manifest a clear change in quality from the start of the sound to the end of it. Common Sounds When interpreting a potential EVP with a spectrogram I will generally break down the various sounds into four categories: voicing, A/Nr noise, NVA noise and anomalous sounds. This is where the analysis of our potential EVP file will begin. We must first determine what sounds are found within the recording. Category 1: Voicing As the name suggests, voicing sounds are just that – someone talking in either the foreground (usually the investigator holding the mic or recorder) or possibly someone in the background who is speaking. Either way this type of sound is generated by a first party individual who is in the area at the time the sound file is being recorded. This type of sound will show up on the spectrogram as strong vertical striations in a waveform analysis, as brightly hued bands in a color spectrogram, or as deep black areas on a black and white sonogram. Often times, when working with EVP that has voicing in it I will create a “voice bar,” that is a notation on the highest pitch and the lowest pitch of the voicing sounds as a gauge against other potentially anomalous sounds that I may find. In the section entitled “The Mechanics behind EVP” it was stated that the general accepted range of human vocal sounds is between 280 Hz and 1000 Hz, however the human ear can hear in ranges of 20 Hz to 20,000 Hz (20 kHz). Essentially, what this means is that humans have a far greater range and capability to hear than they do to produce sounds. Although sound production is “limited” there is still enough of a range within the human voice to perplex even some of the best EVP researchers at times. Category 2: A/Nr (Arrhythmic / Non-repetitive) Noise A/Nr noise is an arrhythmic short-term sound which is non-repetitive and can be attributed to a source that is generally mechanical in nature such as a passing car, a train horn, a plane in the distance, machinery, a door slamming, etc. Noises such as this are, for the most part, easily differentiated from any type of anomalous sound as they tend to have a mechanical sound to them rather than a voicing sound. A/Nr sounds, due to their brief nature are generally something that is unexpected and out of the control of the investigator. Although A/Nr sounds are generally not detrimental to the attempt to capture EVP they can be annoying when the EVP is being analyzed as they are one more thing to have to listen to and determine whether it is an anomaly or not. Category 3: NVA (Non-Voicing / Ambient) Noise NVA is considered to be an aperiodic non-voicing or ambient background noise that can be caused by, but is in no way limited to: echo, wind, electrical hum, water, silence, etc. NVA is generally understood as “environmental” factors that investigators generally do not have control over. These, like A/Nr noises, are generally easy to distinguish from anomalous sounds since they tend to be in the background and are generally just noise rather than any type of organized, intelligible sounds. Category 4: Anomalous Sounds These are sounds that do not fall into any of the previous three categories and can not generally be defined as such by logical, rational or analytical means by the investigator. Anomalous sounds, at times, can be very illusory due to the fact that they can sound either like voicing noises or A/Nr sounds, especially when dealing with Class A EVP. When analyzed however, it has been found that these sounds are not actually voicing or A/Nr sounds. When they are checked against biological factors and the limitations of the human voice it is generally found that the anomalous sounds are beyond the range of human capacity, and thusly will exceed the range of physical limitation, not to mention that the voice usually doesn’t sound like anyone who was on the investigation crew. Bear in mind though that just because we are generally listening for a vocal EVP that doesn’t always mean we will get one. In some cases the EVPs captured are of sounds, echoes in time of events that have happened in earlier years and, although these are not vocal EVPs, they are still EVPs nonetheless. Understanding the Spectrographic Axis Time – (X) Time, when speaking of spectrograms, generally doesn’t apply in the way that we understand it. Time on a spectrogram is called the “X axis,” and is generally found along the lower horizontal bar near the bottom of a spectrogram screen. Often this scale is broken down into both seconds and milliseconds.
10.3 – This is a spectrogram with an active sound file. Note the timeline at the bottom as well as the “clock” below it. Frequency – (Y) Frequency, also known as pitch, on a spectrogram is read from left to right and a typical spectrographic scale will run from zero (0) to about thirty thousand (30,000) cycles per second, with the “cycles per second” being interpreted in Hertz (Hz) or kilohertz (KHz) which is Hz x 1,000. The frequency or “Y” axis is usually found on a vertical bar to the right of the spectrogram screen. This is where the “energy” of the wave form is found. An accurate understanding of frequency in the analysis of EVP is of the utmost importance. To get a better understanding of frequency we can say that in any case where physical events are cyclical (or at least nearly cyclical) the frequency acts as a measurement for the intangible events that take place. Frequency can be thought of like this: T / C = F T = Time, C = number of full cycles within given period of time, F = frequency. Basically what this means is that the number of cycles that complete in a given duration of time is the frequency of that event. So, if an event or sound occurs 120 times per minute it can be said that it has a frequency of 120 cycles per minute – or – 120 Hz = 1 longitudinal vibration per second.
10.4 – This photo of a spectrogram reveals an independent frequency analysis screen in the upper portion of the screen. Generally the frequency portion of the spectrogram is located on the right side vertical to the screen. Amplitude (Decibels) Amplitude is the intensity of the sound which is measured in decibels and is depicted as the sharp vertical striations on a waveform analysis. On a spectrogram, however, amplitude is depicted by color saturation or by the color coding that the sounds produce. This information, in addition to the frequency that is produced by the sound will give the investigator a good idea of what kind of acoustical energy was involved in making the sounds heard on the potential EVP.
10.5 – This color spectrogram / sonogram shows the sound amplitude (intensity) through a varying degree of color. A Final Note on Spectrograms And now for something truly obvious (which I cannot stress enough): When using sound enhancement/ analysis/ diagnostic software you should read the manual thoroughly and experiment with the program so that you have a feel for the tools you are using before you actually start modifying EVP files. The more familiar you are and the more you work with your spectrogram program the better you will get with it.
|