Overview “Known Unknowns”

New:  Google Image Displays of Chemical Structures

New:  Chemical Ionization GC-MS Substituting Liquid Reagents for Gasses

New:  Conserving Helium in GCMS,GC, and DART Applications

New:  FREE Courses for Unknown Identification Using NIST Search and EI and MS/MS (Tandem) Libraries

New:  ACS 2021 Presentation:  GC-MS and LC-MS Identifications with NIST Search and Commercial/User Libraries

New :  Differences in EI GC Orbitrap and Standard Quadrupole Spectra

Systematic Process for the Identification of “Known Unknowns” in Commercial Products by GC-MS and LC-MS

Note: Details on other subjects are found in "My Topics" tab under sailboat picture above or on the sidebar links to the right..

Introduction:  In the last 34 years, we have developed a systematic process for the identification of “known unknowns” by GC-MS and LC-MS in commercial products.  We define “known unknowns” as non-targeted species which are known in the chemical literature or mass spectrometry reference databases, but unknown to the investigator.  The process is shown in the following simplified flowchart:


This process is described in detail in the February 2013 copy of LCGC, “MS-The Practical Art.”  The article is entitled:  “Identifying “Known Unknowns” in Commercial Products by Mass Spectrometry.”  A copy with associated ads is shown below:

LCGC PDF with Advertisements

The article originated from work presented at Pittcon in 2012.  The approach uses either NIST libraries or “spectraless” databases such as the CAS Registry or ChemSpider.  The “spectraless” ones are discussed at the bottom of this post.types_of_databases_searchesNIST Search of EI and CID Spectra:  The initial step in our process utilizes computer searches of EI (GC-MS) or CID (LC-MS) spectra against reference databases using the NIST MS search.

I have developed two free “webinar” series courses that detail the use of the NIST search with EI and tandem libraries for the identification of unknowns:

Link to free webinar courses

The computer EI searches normally work better than CID ones, but the latter are still very useful.  We employ both purchased, in-house, and “crowd-sourced” libraries.  The MoNA “crowd-sourced” libraries in NIST EI and Tandem formats are found at the following link:

MoNA EI and Tandem Libraries in NIST format

We use the NIST and Wiley commercial databases, but there are many other specialty databases that others might find useful.

The NIST search interfaces easily with a wide variety of manufacturers’ data processing and structural drawing programs:


“Spectra-Less” Database Searching:  If the NIST search is not successful, then accurate mass data is used to obtain a molecular formula (MF), a monoisotopic mass, or an average molecular weight.  This data is used to search very large databases such as the CAS Registry or ChemSpider  via web interfaces.  We define them as “spectra-less” databases because they contain no computer-searchable mass spectral data.  We had originally used this approach to search the TSCA database or our Eastman Chemical plant material listing.

The candidate structures from the CAS Registry or ChemSpider searches are prioritized by either the number of associated references or key words.  Other ancillary information such as mass spectral fragments in EI or CID spectra; isotopic abundances, UV spectra; types of ion adductsCI data; number of exchangeable protons;  etc. are used to narrow the list to one structure.  This website has many screenshots (SciFinder1, SciFinder2, ChemSpider) that illustrate these approaches with many examples.

Model EI and CID Spectra from NIST Structure Search:  The NIST MS Search program ranks model compounds employing structural searching of both our commercial and in-house databases.  This is particularly valuable for finding model compounds for a proposed structure found in searches of “spectra-less” databases such as the CAS Registry and ChemSpider.

As noted in the table above, there are ~800,000 structures associated with the EI spectra and 100,000 structures with CID mass spectra in our computer-searchable databases.  We use the NIST MS Interpreter  program to automatically correlate fragment ions in the EI and CID spectra with the component’s substructure.

“No Results” from Process:  There will be non-targeted species in the sample which are “unknown unknowns”, those not found in any reference libraries or “spectra-less” databases.  A few thoughts on their identification are discussed in another section.

Future Improvements Needed:  The approach works well for the majority of our samples which are fairly simple and contain components with molecular weights <500 daltons.  On the other hand, improvements are needed for complex samples and components with molecular weights >500 daltons.

For a chemist, viewing Google search result as structures using “image” approach can be very useful. Much quicker than sorting through text.

As an example, searches of molecular formula can be useful for the identification of unknowns when reference EI and MSMS spectra are not available. The following information compares this approach to our other approaches using ChemSpider and SciFinder. See what you think!

Solvent Mediated Chemical Ionization (SMCI), as demonstrated by Shimadzu in a poster session, can be a very convenient means for substituting liquids as chemical ionization reagents for GC-MS. Gases in lecture bottles are often inconvenient to purchase, somewhat hazardous to use, and often very difficult to dispose of when spent. NOTE: Ammonia gas if not used properly can be delivered to the MS source as a liquid (see link: Ammonia Gas Connection on 1st page) !

SMCI using propylamine was shown by Eastman Chemical in their recent work to be very useful in the ionization of Surfynol. This compound is very difficult to ionize with many gasses showing the loss of water and no molecular weight information. The propylamine was found to offer similar useful molecular weight information which normally required the use of either ammonia and methylamine as shown in previous work at Eastman.

Other possibilities to try would be solutions of methylamine and ammonia in methanol that can be purchased from Aldrich, methanol-d4, ammonium hydroxide, deuterium oxide, propylamine in methanol-d4, etc. Of course, one could also do negative ion chemical ionization with a variety of solvents to get M-H or chloride adducts. The possibilities are somewhat endless depending on the class of compounds and their relative proton affinities using either pure solvents or mixtures taking into account their relative proton affinities.

Posted by: tvasailor | April 2, 2022

Helium Conservation in GCMS, GC, and DART Applications

Currently and in several times in the past, the cost and supply of helium in GCMS and GC applications can be cause for concern. Several manufacturers have taken notice and started to supply products and technical advice to address the problem.

Conserve Helium or Switch to Hydrogen or Even Nitrogen in GCMS and GC Applications? The most straigtforward approach is to conserve helium by minimizing helium useage in GCMS applications. This avoids having to make changes to current well established analytical methods.

A more radical approach is to switch from helium to hydrogen or even nitrogen for GCMS. Switching to other carrier gasses shows promise and is discussed in a separate section below, but there are manyt challenges.

Thermo has a clever product, iConnectTM Helium Saver, for their Split/Splitless TraceTM GC systems. It uses helium for the injection and nitrogen for all other aspects of the injection process to conserve helium. They state that it can dramatically increase the lifetime of one cylinder of helium:

-3.5 years continuously used 24/7/365 for GC-MS
-14 years when you either shut the helium off of divert to nitrogen overnight and on weekends

Link to their brochure
Link to their website
Development Work Leading to Product
Excerpt Description of Injection Process from Agilent Manual

Agilent has created many useful documents and products. They compare the advantages of using Hydrogen for GC separations compared to helium and even discuss the use of nitrogen as a carrier gas in GC, but not recommended in GCMS. They also discuss a study that dramatically reduced helium useage in their Little Falls Site. For example, a GC-MSD (GC-MS) using their Gas Saver hardware increased the lifetime of a helium cylinder from 109 to 252 days.

Link to Little Falls Site Study

Another Agilent presentation describes in detail the use of their 7890 Helium Conservation Module which uses nitrogen during standby. In one application, it extended the use of a cylinder of helium from 2 to 12 months.

7890 Conservation Module Information

Other Agilent Resources include:

Agilent Cost Saving On-Line Calculator
On-line overview from Agilent On-Line

Chromtech Eco-Saver for All GC Systems

Chromtech Device Link

Markes Mulitigas Thermal Desorber for GCMS

Markes Multigas Desorber

*****SWITCH from Helium to Hydrogen or Even Nitrogen for GCMS? *****

Manufacturers discuss the pros and cons of using hydrogen or even nitrogen as a carrier gas instead of helium for GC-MS analyses. Most say limitations, but Agilent, Leco, Bruker, Shimadzu, and Anatune say not so much a problem?

Topics discussed include chemical noise, detection limits, changes in analyte EI spectra (reduction), tailing, in-situ source cleaning, self-CI, etc. when hydrogen or nitrogen are substituted for helium and how to minimize or avoid such limitations.


Nitrogen was shown successfully used as a carrier in a Shimadzu thermal desorption GC-MS method. In addition, other information found including ideas (e.g. 200 eV vs. 70 eV) for inceasing sensitivity, thoughts on problems in method development, and work in progess on EPA method development. A good article describes maintaining speed in GC analyses using nitrogen, but does not discuss applications in GCMS as a carrier gas.

Leco with hydrogen notes an increase in S/N in all components in a complex tee leave extract as shown in the link to their study above. This is impressive noting a significant decrease in analysis time by a factor of 5 while maintaining good retention indices compared to helium and good EI library search quality. Also, no discussion of any signficant delay times in source “acclimation.” Possibly due to their somewhat unique “open-style” EI source?

Bruker with hydrogen says their SCION GC-MS with the Helium Free Analyzer option offers good pumping and injector design. They indicate that all 76 components in EPA Method 8270 including nitro and nitroso compounds gave high quality searches against the NIST library using AMDIS deconvolution. Others noted that nitro compounds can yield spectra showing the in-source reduction of nitro compounds.

Agilent Jet-Clean technology and New Hydroinert Source: Their source actually uses hydrogen in instruments that use helium as a makeup when the instrument is at idle to remove build-up of “dirt” in the source. Thus, using hydrogen as the carrier might also offer similar advantages by minimizing source cleanings. Also, a newly “Hydrointert” source introduced at ASMS 2022 minimizes undesirable chemical reactions

Agilent Jet-Clean info for “in-situ source” cleanings

New Hydroinert Ion Source

*****SWITCH from Helium to Nitrogen for DART Applications*****

Direct Analysis in Real Time (DART) mass spectrometry commonly uses helium as the DART gas. With the looming helium shortage, other gases are being evaluated for DART. Nitrogen is inexpensive and readily available, making it a desirable alternative. However, NO+ reagent ions present in positive-ion nitrogen DART result in extensive oxidation for many compounds. Using a narrower aperture on their ceramic insulator cap significantly reduces the problem.

DART ASMS Journal Link



Agilent Webinar (Jim McCurry)
Another Agilent (Shannon Coleman discusses part of Little Falls Study shown above)
Yet Another Agilent (Bryan White)
Axion (switching GC from He to H2)
C&EN Podcast World Supply Outlook

MRI and NMR useage can also be decreased by new technology. Many years ago our Cryrolect GC-IR had a similar approach.

Limiting Amount Needed in Seimins MRI

Party and Weather Balloons account for about 7% of the useage of helium per year. I saw in several places on the internet, but surprised it was such a realtively high number.

Identification of Unknowns by GC-MS and LC-MS Using NIST Search with Commercial and User Libraries

Copy of Talk in PDF

Link to YouTube Video


For the last 20 years, Eastman has routinely employed the NIST search for the identification of unknowns.  Initially, we  primarily utilized EI searches, but in recent years, the increase in the availability and quality of tandem libraries plus associated software has greatly enhanced LC-MS library search results.  More than 50 instruments and associated users are networked at Eastman world-wide.  Our corporate library of >55K entries in NIST format is automatically updated and distributed nightly.

The 2020 edition of the NIST library utilized with the NIST MS 2.4 search software offers many major enhancements.  NIST for the last six years has had an ambitious program to acquire and analyze targeted chemicals (>50,000).  In addition, the novel hybrid search expands the scope of all EI and tandem libraries.  These new improvements and free training resources,  including videos and associated handouts, will be described.  EI/tandem libraries utilized include NIST, Wiley, MoNA, and user-created ones.

ACS 2021 Symposium: Compound Identification with LC/MS and GC/MS: Experimental Method, Data Analysis and Applications

Liquid chromatography and gas chromatography coupled with mass spectrometry (LC/MS, GC/MS) are two major analytical techniques for identifying compounds in complex mixtures. This symposium provides a forum for experts from industry, academia and government agencies to share their novel experimental methods, creative data analysis techniques including mass spectral library searching and software development, and the applications of these techniques, with special emphasis on the use of LC/MS/MS in metabolomics, proteomics, lipidomics, forensics, pharmaceutical and environmental studies.

Posted by: tvasailor | January 31, 2021

NIST MS Interpreter

NIST MS Interpreter is a utility developed at the NIST Mass Spectrometry Data Center to assist in the evaluation of mass spectra. The Interpreter finds possible structural origins of peaks in a mass spectrum and provides formula and isotopic processing utilities. It operates in conjunction with the MS Search Program. It works in both nominal and accurate mass modes.

Below are some course resources teaching the use of the program:

Video for EI Use
Handout for EI Use
Video for Tandem Use
Handout for Tandem Use

Below are other resources from NIST:

New Developments Poster
New Enhancements Accurate Mass Tandem Spectra
Basic Description of Program

Posted by: tvasailor | January 11, 2021

Bread Recipes from Betty Ottenfeld

In the early 80’s, my Wife Sandra took a bread baking course at the Presbyterian Church. Our family was highly appreciative. Sandra scanned the cookbook handout from the course.

Link to Cookbook Handout in PDF

Picture of Betty, a bird-lover, below:

Posted by: tvasailor | January 1, 2021

Buying NIST “20” EI and Tandem (MSMS) Libraries

The new libraries for 2020 include the NIST MS Search Software at no additional cost.

There were major increases in the number of compounds and major software enhancements.

Link to 2020 NIST Updates

The NIST 2020 Tandem library is ridiculously inexpensive, very high quality, and contains 1.3 M spectra!

Be sure to shop around. Here is the total list of distributors:

Total List of Distributors

Here are some good websites that I have noted for distributors. Let me know if you have found others.

Diablo Analytical (convenient same-day ftp download)
Adaptas Solutions (formerly SIS)
Cerno Biosciences
GC Image
MS Wil
Stanton Scientific

The EI library package contains Lib2NIST utility, but not the Tandem one.

Link for Lib2NIST Utility

MS/MS (Tandem) spectra can be used to identify unknowns employing library searches. This is accomplished in much the same approach as that employed for EI GC-MS with the NIST search software. The much improved NIST Search Version 2.4 is included with the 2020 library release. I will attempt to introduce the user to the use of the NIST Version 2.4 search software employing NIST, Wiley, “crowd-sourced (MoNA)”, and user libraries for the identification of unknowns.

NIST has an ambitious program to extend their already comprehensive, high-quality MS/MS databases. See the following link:

NIST Pipeline for Extending MS/MS Libraries

This involves a very comprehensive process for selecting pertinent compounds for purchase and subsequent analyses. The problem with variability of the spectra is reduced by analyzing the samples on a variety of instruments at a variety of energy levels (20 steps). In addition, many different precursor ions and fragment ions are characterized (MSn). This has already led to a very large high quality library of accurate mass MS/MS spectra in the 2020 release version (31K compounds/1.3 M spectra) and the effort is ongoing.

YouTube Videos

Part I: Overview (17 min)
Part II: NIST MS/MS Search (18 min)
Part III: Detailed Discussion Hybrid MS/MS Search (18 min)
Part IV: Importing MSMS Spectra (11 min)
Part V: NIST Structure Searches (12 min)
Part VI: MS Interpreter (12 min)
Part VII: Creating-Using MSMS Libraries (12 min)
Part VIII: “Spectraless Libraries” (12 min)

Higher Quality Videos Zipped Pt. I-VIII

Detailed Handouts

Part I: Overview 12/27/20
Part II: NIST MSMS Search 12/27/20
Part III: Detailed Discussion Hybrid MS/MS Search 12/27/20
Part IV: Importing MSMS Spectra 12-27-20
Part V: NIST Structure Searches 12-27-20
Part VI: MS Interpreter 12-27-20
Part VII: Using-Creating MSMS Libraries 12-27-20
Part VIII: “Spectraless” Libraries 12-27-20

Handouts Pt. I-VIII_Zipped-12-27-20


Using with Manufacturers’ Data Processing Software
Buy a copy! Distributors of NIST Libraries Shop Around for Best Price!
NIST Tandem Quick-Start Guide
DeltaMass Table in Nominal Mass
Beyond the Top Hit: Information from Hybrid Similarity Search Hit Lists
MS_Interpreter Correlating Structure to Spectrum
Lib2NIST (not included with NIST tandem library)
Importing Spectra to NIST Search (all internet links)
Components in MoNA Libraries
MoNA MSMS Library in NIST Format
MoNA EI Library in NIST Format
MoNA Download Site in MSP and SDF Formats
Resources for NIST EI Searches
PowerShell Script for Adding Precursor_m/z Field to User Library
Users’ Manual for NIST 2.4
NIST Downloads Libraries-Tools-Software
NIST MS 2020 at ASMS

Posted by: tvasailor | December 11, 2020

Identifying Unknowns Using LCMS and GCMS with Library Searching

This is a free instructional series including YouTubes, detailed handouts, and miscellaneous resources.

EI GC-MS Identifications: The first part is for EI GC-MS identifications and is accessed at the following link:

Link to EI GC-MS ID of Unknowns with NIST Search

LC-MSMS Identifications: The second part is for LC-MS-MS identifications and is accessed at the following link:

Link to LC-MS-MS ID of Unknowns with NIST Search

Using with Manufacturers Data Processing Software:

Link to Instructions

