Privacy and Security Notice

Data Analysis for EG1

Data Analysis for EG1

The following is my ongoing attempt to list and categorize the necessary tasks for the EG1 data analysis. Since my understanding of some of the issues is rather limited, I am very grateful for email with corrections, additions and more details. Also, it would be very helpful if as many students as possible could pick out still-unclaimed pieces of the work soon and begin their analysis tasks. Inform me if you plan to work on any or several aspects listed below.

Additional information of interest can be found at Alex Skabelin's web page and Raffaella Devita's web page. Please inform me of any additional web pages you aware of which are related to EG1 analysis.

Note that we continue to have weekly EG1 meeting every Thursday, 10:30 a.m. in L207 CEBAF Center. We will alternate between target (and EG2000-) related meetings and analysis meetings (every other week). The room is reserved for all of 1999. If you want to receive meeting reminders for the analysis meetings, sign up to the EG1_RUN email list.

- Sebastian

Table of Contents

Organization of runs

Calibration

Production

Physics results

Publish

Organization of runs

Database of runs

We have a complete database of all "non-crashed" electron beam runs taken during EG1 (since mid-September) correlated with additional information like target polarization etc.

This database can be found in my directory www.cebaf.gov/~kuhn (or on jlabs1 in ~kuhn/public_html) as either EG1RunDB.txt (a tab-delimited ASCII file) or as EG1RunDB.exc (an Excel '98 file with additional information). The same information (or at least most of it) is contained in the offline database maintained by Alex Skabelin (password protected). The content from these databases comes from the INGRES run database at http://db3.cebaf.gov/clas-online/runinfo.html, the set of runsummaries#.txt files maintained by the shift leaders in the "clasrun" root account, Raffaella Devita's analysis of target polarization (only the entries which have both a value and a sigma are re-analyzed; the rest is online information) and the electronic online logbook. Future work needed to improve this database would be:

  1. Enter actual (best-guess) beam polarization data from Tony's analysis of Møller runs (after all corrections are known)
  2. Finish re-analysis of target polarization
  3. Check hand-written logbooks for additional important information, especially on trigger configurations (and changes), unusual detector parameters and status, etc.
  4. Add information on raster settings if they can be obtained, and possibly target magnetic field direction (and whether it's on or off)

All "chefs" and other analysis workers are supposed to add information to the offline database/logbook on each run they work with.

Beam polarization history (Workers: Tony Forest with help from Arne and Brian)

Ultimately, the average beam polarization needs to be entered in the database for each run. This requires that all Møller runs are analyzed to the best of our knowledge, cross-correlated with Møller measurements in Hall A (see, e.g., their list at http://www.jlab.org/~moller/raw_results.html ) and Mott measurements done by the accelerator (see, e.g., the accelerator logbook at http://opweb.cebaf.gov/internal/ops/daily_activity_log/html/logdir.html ). From the accelerator logbook, one can also retrieve the exact beam energies and the setting of the Wien filter (the spin launch angle). This information, together with the program for polarization angles in all Halls by B. Mecking (ask me if you don't know how and where to run it) is needed to determine the relative correlation between helicity signal and actual beam polarization direction in the Hall. Also, the status of the extra l/2 wave plate needs to be factored in (it was moved into the beam on December 9, I believe), since it introduces another overall factor (-1) into this correlation. The goal is to get a best-estimated time history of the beam polarization throughout all of EG1 which can then be used to estimate the average polarization for each run in the database.

This job is mostly finished by now; the results need to be corrected (very small correction) for proper analyzing power (from MC simulation done at FIU) and entered into the database.

Target polarization history (Workers: Raffaela Devita and Renee Hutchins)

This requires an offline analysis of all polarization measurements done, again yielding a database of target polarization (including sign) vs. time. Ultimately, this involves analysis of optimum background and pulse shape, and in the case of the deuteron, a comparison of the ratio method and the direct NMR area determination. For additional checks one needs to compare with the "online" values as written down and the runsummaries#.txt files. Once the "target computer times" have been reconciled/aligned with the "DAQ time", we can again extract the average target polarization for each run. Note that a more precise estimate of the target polarization would require to use all available information on the raster size, the charge deposited on each target and the relevant decay times to extract the polarization of the fraction of the target actually seen by the beam.

This job is done to a large part; the biggest part missing are the deuteron polarization numbers. We will probably never get a very accurate target polarization from the direct measurements, especially not for the runs where we kept burning a small raster pattern into the NH3 target. Our best bet will be to attempt an absolute determination for the early runs and then use known elastic asymmetries to get info on the remaining runs.

Lists of runs

  1. Non-standard Calibration runs - any runs with non-standard running conditions that can be used to study aspects of the detector calibration (e.g., torus field or target field off, no LHe, no rastering, or overrastering, non-standard triggers for efficiency studies, pedestal and cosmic runs, and of course Møller runs). These runs have been ignored so far and are not contained (in general) in any of the offline databases.
  2. Junk runs - any runs where software or hardware failures make successful analysis unlikely. In this category belong also runs where the beam was unstable (in intensity or position), where the detectors tripped off during the run or other detector parameters changed markedly, where the target polarization was varying rapidly, or any other known problems exist that make it impossible or exceedingly difficult to extract reliable data. Those runs for which we KNOW their "junk" status are either not in the database at all, or marked by comments. However, some of these runs will only be eliminated by offline inspection of histograms or cuts.
  3. Production runs - any runs on NH3, ND3, Carbon and empty targets that don't fall under the first 2 categories. These are the runs that are cooked in Pass 1.
  4. Calibration runs - any runs from category c) which were used for Pass 0 analysis. These are runs where all run parameters have standard values, no hardware or software problems existed, the beam was stable, and we have at least 5 Million triggers in them. We chose a set of runs evenly distributed throughout the experiment and scattered over all running conditions. They are marked by blue numbers in the EXCEL file and by "golden" shading in the offline database. All CLAS detectors have been calibrated using these runs.

Runs can be moved from one category to another later on, as information is added to the data base. For instance, runs during which the charge asymmetry of the beam exceeded 10-3 might be declared junk runs.

Calibration

All detector elements (DC's, TOF, Cerenkov, EC/LAC) needed to get their calibration constants (gains, offsets, start times,…) updated. These updated calibration constants were moved into the parameter maps. We might still need to check that target field and type, polarization, and torus polarity, are also entered correctly in those maps.

The following sequence (with preliminary names attached) was created by Stepan Stepanyan in preparation for the calibration. It may be out of date by now.

Code/algorithm improvements/modifications

Each event in the run is associated with -

· Strong magnetic field near the production vertex

· Beam helicity

· Beam position on the target (raster)

all of which require modification of analyses algorithm. (We can't really include raster information yet; the strong magnetic target field has been implemented but can also be circumvented by using "pseudo vertex" information).

Code modifications:

· Modification of fitting procedure

· Reconstruction of the vertex using raster information (NOT DONE)

· Reconstruction of three-vector at the vertex

· Store helicity information in the high DST banks

· New run control parameters

· Design the shell of the code to include all DST banks

· Monitoring histograms and database

· Scripts for farm processing and monitoring

Pre-calibration preparations

Set the initial calibration constants -

· Time-of-flight counters (Duke U. group, Volker Burkert, Hovanes Egiyan)

· DC timing (done according to Liming Qin)

· RF TDC

· Define set of the runs to be processed for calibration (see previous Section " Lists of runs")

· Get the "tagged" version of the analyses code (RECSIS) with first order modifications;

· Get the monitoring scripts/code (and the data bases?) ready to test - possible connection with the eg1 off-line logbook.

Proposed Ending time - February 12.

Processing for calibration (10%)

· Group the selected runs (by the beam energy and the target);

· Assign people (grad.stud.) to each group;

· Process all selected files using tagged code and scripts;

· Monitoring processing the data;

· Store all necessary BOS banks for the calibration;

Proposed Starting time - February 12.

Ending time - February 23.

Calibration.

Assign people to the following calibration tasks -

· DC - time, geometry/field (Rob Feuerbach), time-to-distance function and parameters (Junho Yun - only job done so far), vertex reconstruction (Franz Klein, Stepan Stepanyan, Tony Forest) "Roads" for all 5 different torus settings used during EG1 (Franz Klein - seems o.k.);

· SC - time calibration of TOF strips, determination of RF offsets, changes due to interchanged cables, time walk corrections energy losses (Duke/W&M/UVa/Elton Smith/Rustam Niyazov)

· CC - efficiency (Alex Vlassov), timing and amplitude;

· EC - energy (Cole Smith, Stepan Stepanyan), timing (Mehmet Bektasoglu, ODU)

· LAC - time, energy (Genova/INFN group);

· PID (Stepan Stepanyan, others);

· Helicity monitors (Tony Forest, Alex Skabelin, Raffaella Devita);

· Beam position (rastering) and current (Tony Forest and Sebastian Kuhn);

Proposed Starting time - February 12.

Ending time - March 21.

Re-processing of calibrated runs.

Experts should check the quality of the calibration.

· Detector specific characteristics;

· PID;

· Physics distributions;

· Specific analyses correlated to the beam and the target polarizations;

If requires, new processing for additional calibration should be done.

Proposed Starting time - March 23.

Ending time - March 31.

In fact, calibration was really finished by the end of April.

Production

Under this headings fall all analysis tasks related to accumulate statistics for all event types that we are interested in.

Cooking - EG1 Pass 1

Cooking entails a first pass analysis of all production runs using a standard package like A1 or RECSIS. The goal is to extract from each run all good events with electron tracks, including pattern recognition, matching of tracks with hits in the TOF, Cerenkov and EC/LAC detectors, and identifying electron tracks and particle ID for all other tracks. The resulting banks can be used for either a second pass analysis (if there are major improvements in the analysis code and parameters) or, more to the point, for final analysis of publishable data.

Use the same scheme as for calibration -

· Group the selected runs (by the beam energy and the target) into 3 separate groups

· Assign 2 people (Renee and Alex, Junho and Angela, Mehmet and Raffaella) to each group;

· Store BOS banks (PART, EVNT and other high-level banks, plus some fraction of raw data).

· Bookkeeping, monitoring (look at histograms, run "tester");

ACTUAL Starting time - Beginning of May.

List of DST banks -

HEAD 0 8 1 8 I

TGBI 0 1 1 1 B32

CALL 0 3 3 5 B16

EVNT 0 18 5 90 I,F,F,I,F,F,F,F

DCPB 0 13 4 52 I,F,F,F,F,F,F,F

ECPB 0 16 4 64 I,F,F,F,F,F,F,F

SCPB 0 6 4 24 I,F,F,F,F,I

CCPB 0 6 1 6 I,F,F,F,F,I

HEVT 0 16 1 16 I,I,I,I,I,I,F,F

TBID 1 34 5 170 I,I,F,F,I,I,F,F

PART 1 13 5 65 I,F,F,F,F,F,F,F

and all scaler banks.

Data volume will be - ~ 1 Kb/event (with e-).

In the current "cooked" files in addition -

CC 3 3 1 2 B16

DC0 1 2 56 56 B16

EC 1 3 2 3 B16

EC1 1 5 5 13 B16

SC 4 5 1 3 B16

TRKS 1 10 1 10 I,F,F,F,F,F,I,I

TDPL 5 8 10 80 I,F,F,F,F,F,F,F

HBTR 0 9 5 45 F,F,F,F,F,F,F,F

TBTR 0 10 4 40 F,F,F,F,F,F,F,F

HBER 0 23 5 115 F,F,F,F,F,F,F,F

TBER 0 23 4 92 F,F,F,F,F,F,F,F

Data volume will be - ~ 6.5 Kb/event (with e-).

For calibration data volume will be - ~ 8.5 Kb/event (with e-).

Extract polarization information (Workers: Tony Forest, Alex Skabelin, Raffaella Devita)

We need to set up a consistent scheme to correlate beam events (scattered electrons) and their helicity information with the scaler events that record the integrated current and other parameters every second, also keyed with helicity. We need to check that all helicity signals are consistent and use cuts to exclude helicity pairs where this correlation is "confused", or where any relevant beam parameters (intensity, position,…) fluctuate to strongly, or where unusual backgrounds are present. Ultimately, we need a mask to extract only those events that pass these tests and sort them properly according to beam helicity. It is important to study whether any of these cuts might introduce false asymmetries (and vice versa, which cuts we need to minimize existing false asymmetries). We need to collect the charge accumulated for each beam helicity in a run, excluding scaler events where data where not taken (no beam) or didn't pass our cuts at all. One problem is that we cannot trust the "fast" nA-BPM charge measurements, so we have to first develop a reliable normalization scheme. The best way seems to be to use the Faraday Cup information (maybe cross-check with Beam scintillation counter information). We need to cross-check our normalization with guaranteed "unpolarized" events (e.g., 2-proton emission or W < 0.8 GeV events). We also need to very carefully analyze deadtime/Poisson effects. A final cross check will be the comparison between opposite target polarizations.

Optimize analysis code

Before starting a Pass 2 reanalysis of EG1 (most likely only after and in conjunction with EG2000), one would want to improve the analysis code. Possible areas of improvement are:

Tracking http://claspc2.cebaf.gov/~marki/dc_workshop/summary-1.html , TOF, Cerenkov, EC/LAC, PID, extracting E', theta, phi, reconstructing physics parameters. Among the things that are still suspect are the alignment and orientation of the Torus coils, the relative alignment of the drift chambers to those and to each other, and other possible effects that lead to bad tracking (ignoring wire sag, offset and distortions of the drift chambers, incomplete or sub-optimal implementation of all magnetic fields, etc.) One could also improve tracking if the time-to-DOCA function parameters could be separately determined for each sector individually, and in the case of Region 2 Drift chambers also for different polar angle segments (e.g., extremely forward wires, forward wires, intermediate and backward wires).

Barring these improvements, we will need to apply a "Volker-Burkert-like" correction to line up all W distributions for different sectors, different f within sectors, and different scattering angle/vertices.

Cuts

In addition to the helicity-dependent cuts (see above), we also need to impose additional cuts. Most importantly, we will use a fiducial acceptance cut on the reconstructed "pseudo-vertex" to avoid regions where CLAS has a large (and hard to pin down) inefficiency (e.g., center strip in the Cerenkovs).

We need to study the polarization dependence as well as efficiencies of all cuts. Some cuts are "predetermined", especially by the trigger thresholds and by the cuts used in the particle

ID (SEB) code. Again, we need to gather as much information on the efficiency of these cuts.

Sorting

Events that pass all cuts need to be sorted into histograms or ntuples. For the inclusive data, we need for each run a 2D histogram of Q2 vs. W for each helicity separately, normalized to beam current integral. The binning must be determined by considerations such as statistics, resolution and Physics interest. We need to study deadtime corrections Poisson statistics and truncation.

We also need histograms for possible background events from pi-, charge symmetric events (pi0 decay into electron-positron pair), accidental coincidences (hadron plus photon?), non-beam related events (cosmics, noise) and non-target related events (Møller electrons, "junk" from downstream and upstream). Combine histograms for all runs with identical beam and target parameters. Compare +/- target polarization and combine, compare l/2 plate in/out and combine. Calculate raw asymmetries and/or count rate differences.

Physics results

GSIM (Workers: Junho Yun, Sebastian Kuhn, many others)

We need as accurate as possible simulations of the spectra for both protons and deuterons alone as well as for targets with different nuclear species (Nitrogen or Carbon, Helium, Aluminum,…). This information is required to extract the dilution factor (see below) and/or the acceptance of CLAS as well as its resolution.

For the purpose of determining the acceptance, we can use existing parametrizations of cross sections and generate events (using MC_ASK, including radiative effects) which are roughly distributed like actual events from a real target (we might want to simulate H, D, C and He seperately and then combine them for a realistic fit of each of our targets). These events would then be run through GSIM, followed (if at all possible) by GSIMKO and a acceptance correction for the trigger and the other cuts (especially the fiducial cut). The ratio between accepted and thrown events as a function of E' and q can be used to approximate the actual CLAS acceptance for electrons. (For additional particles, one might need a more sophisticated approach.) The difference between reconstructed and thrown kinematic variables can be parametrized to approximate the CLAS resolution. We can check our results by comparing the simulated "He+C" spectra (with adjustable overall normalization) to the empty and Carbon target runs.

One potential problem is that the simulated tracking efficiency may be different from the actual one, especially dependent on the general direction of the track (whether it goes through noisy regions of drift chambers etc.) and on beam intensity. One might try "peppering" the simulated evens with real noise events to check these effects.

The dilution factor can be gotten by making a 3-parameter fit of the simulated H or D data plus the measured Empty and Carbon run data to the observed NH3/ND3 spectra. This fit should get the relative normalization for the Empty and Carbon data and will be mostly constrained in the sub-elastic region (where only non-hydrogen data contribute, e.g. W<0.8 GeV) and the elastic region (where the shape of the peak from hydrogen should be very well simulated). The form of the fit would be

N(NH3) = a N(Empty) + b N(Carbon) + g Nsimul(Hydrogen)

The dilution factor could then be determined as

DF = [N(NH3) - a N(Empty) - b N(Carbon)] / N(NH3) .

Extract proton/deuteron data

There are two possible approaches to extract Physics data from the inclusive data:

I) Via asymmetries and dilution factor:

One takes the raw asymmetry and divides it by the dilution factor determined as described above. In the (quasi-)elastic region, one can use the well(-enough)-known nucleon form factors to predict the asymmetry. Comparing with the result from the data, one can get a reliable number for the product of beam polarization and target polarization (cross check with expected results from Møller and NMR measurements). After normalizing with this product, one is left with the radiated asymmetry A||. For final results, one has to radiatively correct this asymmetry (see below) and then extract quantities like A1 (assuming we can either determine A2 from different energy data or use a plausible ansatz for it). Note that A2 is limited by

|A2| ² [R.(A1+1)/2]1/2

where R is the ratio of longitudinal to transverse cross section (which must ALSO be known to extract A1 from A||). On the other hand, one can obtain g1 from A|| by using an existing fit to the world data on W1 and W2/R in the resonance region.

 

II) Via cross section differences

One takes the acceptance from the GSIM simulation and corrects (i.e., divides) the measured countrate difference N+ - N- (for opposite beam helicities) with it. One then integrates over the well-separated (?!?) (quasi-)elastic peak (using a fiducial range in W and Q2) and normalizes all of the data with the result. This should "automatically" take out the product of hydrogen density times beam intensity times beam polarization times target polarization, since this product enters both the (quasi-)elastic peak and all the other data in the same way. Any overall constant error in the acceptance also will be cancelled. One can calculate the theoretical cross section difference in the (quasi-)elastic peak (integrated over the same kinematic range), and by multiplying the normalized data with this number one obtains the cross section difference for the whole measured range.

From the cross section difference, it is (more) straightforward to extract the spin structure function g1 which is less "contaminated" by our lack of knowledge about A2 or g2. On the other hand, by dividing with a fit to existing absolute cross section data, one can obtain A|| again.

Both methods have their advantages and disadvantages, and both should probably be pursued in parallel. The "asymmetries" method may be best suited to extract asymmetries for specific resonances, while the "cross section difference" method might be a quicker route to integrals (of the GDH/Ellis-Jaffe/Bjorken type).

Radiative, resolution and nuclear corrections

The final step to get Physics results requires 3 corrections (that I can think of right now) for the inclusive data, for either of the two analysis methods outlined above.

  1. We need to correct for the (small) contamination from the polarized proton in 15N and possibly for residual 14N contributions and/or H contributions in the ND3 case. This correction should be small (even if 15N were 100% polarized, which it isn't - its more like in the 10% ballpark), the contribution would be only 1/9 of the H signal (since there is only 1 bound proton for every 3 free ones, and according to the shell model it's polarization is only 1/3 carried by its spin and 2/3 by its orbital angular momentum). This correction can be done with a simple Fermi-gas type model of 15N; the other contributions require some more work (if we think they could be significant).
  2. We need to account for the finite resolution of CLAS. This can be treated together with 3).
  3. We need to correct for radiative effects. For this purpose, we need semi-reliable models for cross sections and asymmetries, including adjustable parameters to fit our data. These models can be used to predict both Born cross sections for H/D(e,e') and, after running them through a radiative code (I propose RCSLACPOL which we are maintaining here at ODU) and smearing them (using the resolution parametrization from GSIM, see above), be compared to our actual data. There is a standard method how to use the results to extract radiative corrections in terms of a additive and a multiplicative term (see upcoming radiative corrections workshop where Frank Wesselmann will explain this method). I am presently working on setting this machinery up for EG1. We will also need a reasonable target model for external radiative corrections (Raffaella will look into that).

The end result of these corrections will be to extract either A||(Born) or g1(Born).

Final results

Resonance amplitudes/asymmetries, integrals, semi-exclusive channels, …

Publish

Write theses, publications, go to beautiful, exotic places to give talks,…