In MS-based metabolomics, the starting point of data processing is a collection of raw data files, each file corresponding to a single biological sample. A LC-MS data file is a set of sequentially recorded histograms, each representing hits of ionized molecules on the detector during a very small time frame. A histogram is made up of a number of m/z and intensity data points. The basic purpose of data processing is to transform raw data files into representation that assists easy access to characteristics of each observed ion. The characteristics include retention and m/z time of the ion and an ion intensity measurement from each raw data file. Except for these basic features, data processing can also extract additional information such as isotope distribution of the ion. data preprocess and normalization
