Full redrock API Reference

This is used to include docstrings from modules. See the autodoc documentation.

If you’re loading a module here, and don’t see some functions, try adding the :imported-members: option.

redrock

Redrock redshift fitter.

redrock.archetypes

Classes and functions for archetypes.

class redrock.archetypes.All_archetypes(lstfilename=None, archetypes_dir=None, verbose=False)[source]

Class to store all different archetypes of all the different spectype.

Parameters:

lstfilename (lst str) – List of file to get the templates from
archetypes_dir (str) – Directory to the archetypes

class redrock.archetypes.Archetype(filename)[source]

Class to store all different archetypes from the same spectype.

The archetype data are read from a redrock-format archetype file.

Parameters:: filename (str) – the path to the archetype file

eval(subtype, coeff, wave, z, R=None, legcoeff=None)[source]

Return archetype for given subtype, coefficients, wavelengths, and redshift

Parameters:

subtype (str) – comma separated str of archetype subtype(s)
coeff – array of archetype coefficients
wave – wavelengths at which to evaluate template flux
z – redshift at which to evaluate template flux

Options:: R : array[nwave,nwave] resolution matrix to convolve with model legcoeff : array of additional legendre coefficients

Returns:: archetype flux array

Notes

No factors of (1+z) are applied to the resampled flux, i.e. evaluating the same coeffs at different z does not conserve integrated flux, but more directly maps model=templates.dot(coeff). If legcoeff is 1D it is applied to all cameras (like the templates); if it is 2D[m,n] then they are for m cameras and n polynomials.

get_best_archetype(target, weights, flux, wflux, dwave, z, per_camera, n_nearest, trans=None, solve_method='bvls', prior=None, use_gpu=False)[source]

Get the best archetype for the given redshift and spectype.

Parameters:

target (object) – target object.
weights (array) – concatenated spectral weights (ivar).
flux (array) – concatenated flux values.
wflux (array) – concatenated weighted flux values.
dwave (dict) – dictionary of wavelength grids
z (float) – best redshift
per_camera (bool) – True if fitting needs to be done in each camera
n_nearest (int) – number of nearest neighbours to be used in chi2 space (including best archetype)
trans (dict) – pass previously calcualated Lyman transmission instead of recalculating
solve_method (string) – bvls or pca
prior (2d array) – prior matrix on coefficients (1/sig**2)
use_gpu (bool) – use GPU or not

Returns:

chi2 of best archetype zcoef (array): zcoeff of best archetype fulltype (str): fulltype of best archetype

Return type:

chi2 (float)

nearest_neighbour_model(target, weights, flux, wflux, dwave, z, n_nearest, zzchi2, trans, per_camera, dedges=None, binned=None, use_gpu=False, prior=None, ncam=None)[source]

Nearest neighbour archetype approach; fitting with a combinating of nearest archetypes in chi2 space

Parameters:

target (Target) – the target object that contains spectra
weights (array) – concatenated spectral weights (ivar).
flux (array) – concatenated flux values.
wflux (array) – concatenated weighted flux values.
dwave (dict) – dictionary of wavelength grids
z (float) – best redshift
n_nearest (int) – number of nearest neighbours to be used in chi2 space (including best archetype)
zchi2 (array)
trans (dict)
per_camera (bool) – If True model will be solved in each camera
dedges (dict) – in GPU mode, use pre-computed dict of wavelength bin edges, already on GPU
binned (dict) – already computed dictionary of rebinned fluxes
use_gpu (bool) – use GPU or not
prior (2d array) – prior matrix on coefficients (1/sig**2)
ncam (int) – Number of camera for given Instrument

Returns:

chi2 of best fit model zcoeff (array): zcoeff of best fit model fulltype (str): fulltype of best archetypes

Return type:

chi2 (float)

rebin_template(index, z, dwave, trapz=True)[source]

rebin_template_batch(z, dwave, trapz=True, dedges=None, indx=None, use_gpu=False)[source]

Rebin templates to a set of wavelengths.

The templates are part of this archetype. The wavelengths can be passed as centers (dwave) or edges (dedges) of the wavelength bins.

Parameters:

z (float) – the redshift
dwave (dict) – the keys are the “wavehash” and the values are a 1D array containing the wavelength grid.
trapz (bool) – Whether to use the trapz algorithm or interpolation
dedges (dict) – in GPU mode, use pre-computed dict of wavelength bin edges, already on GPU
indx (array) – Only rebin these indices in the flux array (e.g. rebin flux[indx] rather than flux[:])
use_gpu (bool) – whether or not to use the GPU algorithm

Returns:

The rebinned template

Return type:

dict

redrock.archetypes.find_archetypes(archetypes_dir=None)[source]

Return list of rrarchetype-*.fits archetype files

Search directories in this order, returning results from first one found:

archetypes_dir
$RR_ARCHETYPE_DIR
<redrock_code>/archetypes/

Parameters:: archetypes_dir (str) – optional directory containing the archetypes.
Returns:: a list of archetype files.
Return type:: list

redrock.archetypes.split_archetype_coeff(subtype, coeff, nbands, nleg)[source]

Split coeff array into archetype + legendre terms

Parameters:

subtype (str) – comma separated archetype subtypes
coeff (array) – coefficients from redrock fit
nbands (int) – number of spectrograph bands (e.g. 3 for DESI b/r/z)
nleg (int) – number of legendre terms per band

redrock.constants

Set constants used by the rest of the package.

redrock.external

This provides interfaces to actual data files.

redrock.external.boss

redrock wrapper tools for BOSS

redrock.external.boss.read_spectra(spplates_name, targetids=None, use_frames=False, fiberid=None, coadd=False, cache_Rcsr=False, use_andmask=False)[source]

Read targets from a list of spectra files

Parameters:

spplates_name (list or str) – input spPlate files or pattern to match files.
targetids (list) – restrict targets to this subset.
use_frames (bool) – if True, use frames.
fiberid (int) – Use this fiber ID.
coadd (bool) – if True, compute and use the coadds.
cache_Rcsr (bool) – pre-calculate and cache sparse CSR format of resolution matrix R
use_andmask (bool) – sets ivar = 0 to pixels with and_mask != 0

Returns:

(targets, meta) where targets is a list of Target objects and meta is a Table of metadata (currently only BRICKNAME).

Return type:

tuple

redrock.external.boss.rrboss(options=None, comm=None)[source]

Estimate redshifts for BOSS targets.

This loads targets serially and copies them into a DistTargets class. It then runs redshift fitting and writes the output to a catalog.

Parameters:

options (list) – optional list of commandline options to parse.
comm (mpi4py.Comm) – MPI communicator to use.

redrock.external.boss.write_zbest(outfile, zbest, template_version, archetype_version)[source]

Write zbest Table to outfile

Parameters:

outfile (str) – output file.
zbest (Table) – the output best fit results.

redrock.external.desi

redrock wrapper tools for DESI

class redrock.external.desi.DistTargetsDESI(spectrafiles, coadd=True, targetids=None, first_target=None, n_target=None, comm=None, cache_Rcsr=False, cosmics_nsig=0, negflux_nsig=5, capacities=None)[source]

Distributed targets for DESI.

DESI spectral data is grouped by sky location, but is just a random collection of spectra for all targets. Read this into memory while grouping by target ID, preserving order in which each target first appears.

We pass through the spectra files once to compute all the book-keeping associated with regrouping the spectra by target. Then we pass through again and actually read and distribute the data.

Parameters:

spectrafiles (str or list) – a list of input files or pattern match of files.
coadd (bool) – if False, do not compute the coadds.
targetids (list) – (optional) restrict the global set of target IDs to this list.
first_target (int) – (optional) integer offset of the first target to consider in each file. Useful for debugging / testing.
n_target (int) – (optional) number of targets to consider in each file. Useful for debugging / testing.
comm (mpi4py.MPI.Comm) – (optional) the MPI communicator.
cache_Rcsr – pre-calculate and cache sparse CSR format of resolution matrix R
cosmics_nsig (float) – cosmic rejection threshold used in coaddition
negflux_nsig (float) – mask negative flux significance threshold
capacities (list) – (optional) list of process capacities. If None, use equal capacity per process. A process with higher capacity can handle more work.

redrock.external.desi._get_header(templates, archetypes=None, spec_header=None)[source]

Get standardized header with template and archetype versions

Parameters:: templates – list or dict of template objects

Options:: archetypes : list or dict of Archetype objects spec_header (dict-like): header of HDU 0 of input spectra/coadd

redrock.external.desi.rrdesi(options=None, comm=None)[source]

Estimate redshifts for DESI targets.

This loads distributed DESI targets from one or more spectra grouping files and computes the redshifts. The outputs are written to a redrock scan file and a DESI redshift catalog.

Parameters:

options (list) – optional list of commandline options to parse.
comm (mpi4py.Comm) – MPI communicator to use.

redrock.external.desi.write_bestmodel(outfile, zbest, modeldict, wavedict, templates, archetypes=None, spec_header=None)[source]

Writes best fit redrock model in the outfile

Parameters:

outfile (str) – output file name (fits)
zbest (Table) – best fit redshift table
modeldict (dict) – models[ntargets, nwave] keyed by camera band
wavedict (dict) – wavelength dictionary keyed by camera band
templates – list or dict of template objects

Options:: archetypes : list or dict of Archetype objects spec_header (dict-like): header of HDU 0 of input spectra

The output file format mirrors the structure of an input coadd, with B/R/Z_MODEL HDUs instead of B/R/Z_FLUX HDUs.

redrock.external.desi.write_zbest(outfile, zbest, fibermap, exp_fibermap, tsnr2, templates, archetypes=None, spec_header=None, secondary_headers=None)[source]

Write zbest and fibermap Tables to outfile

Parameters:

outfile (str) – output path.
zbest (Table) – best fit table.
fibermap (Table) – the coadded fibermap from the original inputs.
tsnr2 (Table) – table of input coadded TSNR2 values
exp_fibermap (Table) – the per-exposure fibermap from the orig inputs.
templates – list or dict of template objects

Options:: archetypes : list or dict of Archetype objects spec_header (dict-like): header of HDU 0 of input spectra secondary_headers (dict-like, astropy.io.fits.Header objects): header of other HDUs than primary

Modifies input tables.meta[‘EXTNAME’]

Note: spec_header is a dict of KEY/VALUE pairs, while secondary_headers is a dict keyed by EXTNAME, with values that are Header objects with KEY/VALUE pairs.

redrock.fitz

Functions for fitting minima of chi^2 results.

redrock.fitz.find_minima(x)[source]

Return indices of local minima of x, including edges.

The indices are sorted small to large.

Note

this is somewhat conservative in the case of repeated values: find_minima([1,1,1,2,2,2]) -> [0,1,2,4,5]

Parameters:: x (array-like) – The data array.
Returns:: The indices.
Return type:: (array)

redrock.fitz.fitz(zchi2, redshifts, target, template, nminima=3, archetype=None, use_gpu=False, deg_legendre=None, zminfit_npoints=15, per_camera=False, n_nearest=None, prior_sigma=None)[source]

Refines redshift measurement around up to nminima minima.

Parameters:

zchi2 (array) – chi^2 values for each redshift.
redshifts (array) – the redshift values.
target (Target) – the target for this fit which includes a list of Spectrum objects at different wavelength grids.
template (Template) – the template for this fit.
nminima (int) – the number of minima to consider.
archetype (object, optional) – A single Archetype object (of given spectype) to use for final fitz choice of best chi2 vs. z minimum.
use_gpu (bool) – use GPU or not
deg_legendre (int) – in archetype mode polynomials upto deg_legendre-1 will be used
zminfit_npoints (int) – number of finer redshift pixels to search for final redshift - default 15
per_camera – (bool): True if fitting needs to be done in each camera for archetype mode
n_nearest – (int): number of nearest neighbours to be used in chi2 space (including best archetype)
prior_sigma (float) – prior to add in the final solution matrix: added as 1/(prior_sigma**2) only for per-camera mode

Returns:

the fit parameters for the minima.

Return type:

Table

redrock.fitz.get_dv(z, zref)[source]

Returns velocity difference in km/s for two redshifts

Parameters:

z (float) – redshift for comparison.
zref (float) – reference redshift.

Returns:

the velocity difference.

Return type:

(float)

redrock.fitz.minfit(x, y)[source]

Fits y = y0 + ((x-x0)/xerr)**2

See redrock.zwarning.ZWarningMask.BAD_MINFIT for zwarn failure flags

Parameters:

x (array) – x values.
y (array) – y values.

Returns:

(x0, xerr, y0, zwarn) where zwarn=0 is good fit.

Return type:

(tuple)

redrock.fitz.prior_on_coeffs(n_nbh, deg_legendre, sigma, ncamera)[source]

Parameters:

n_nbh (int) – number of dominant archetypes
deg_legendre (int) – number of Legendre polynomials
sigma (int) – prior sigma to be used for archetype fitting
ncamera (int) – number of cameras for given instrument

Returns:

2d array to be added while solving for archetype fitting

redrock.igm

Calculate intergalactic medium (IGM) absorption.

redrock.igm.transmission_IGM_Inoue14(zObj, lObs, use_gpu=False)[source]

Calculate the transmitted flux fraction from the Lyman series and due to the IGM. This returns the transmitted flux fraction: 1 -> everything is transmitted (medium is transparent) 0 -> nothing is transmitted (medium is opaque)

Parameters:

zObj (array of float) – Redshifts of objects
lObs (array of float) – wavelength grid
use_gpu (boolean) – whether to use CUPY

Returns:

transmitted flux fraction T[nz, nlambda]

Return type:

array of float

redrock.igm.transmission_Lyman(zObj, lObs, use_gpu=False, model='Calura12', always_return_array=True)[source]

Calculate the transmitted flux fraction from the Lyman series This returns the transmitted flux fraction: 1 -> everything is transmitted (medium is transparent) 0 -> nothing is transmitted (medium is opaque)

Parameters:

zObj (float or array of float) – Redshift(s) of object
lObs (float or array of float) – wavelength grid
use_gpu (boolean) – whether to use CUPY
model – which IGM model to use; Calura12, Kamble20, or Inoue14
always_return_array – if True (default), always return array even if all ones

Returns:

transmitted flux fraction if both zObj and lObs are scalar, this returns a float. If one is scalar and other is array, this returns array If both are arrays, this returns 2D array[nz, nwave]

Return type:

float or array of float

if always_return_array is False, returns None if there is no IGM absoption for this redshift (faster).

redrock.igm.transmission_Lyman_CaluraKamble(zObj, lObs, use_gpu=False, model='Calura12')[source]

Calculate the transmitted flux fraction from the Lyman series This returns the transmitted flux fraction: 1 -> everything is transmitted (medium is transparent) 0 -> nothing is transmitted (medium is opaque)

Parameters:

zObj (array of float) – Redshifts of objects
lObs (array of float) – wavelength grid
use_gpu (boolean) – whether to use CUPY
model (str) – Calura12 or Kamble20, IGM model constants to use

Returns:

transmitted flux fraction T[nz, nlambda]

Return type:

array of float

redrock.plotspec

Visualization tools for plotting spectra.

redrock.priors

Classes and functions for priors.

class redrock.priors.Priors(filename)[source]

Class to store all different redshift priors.

Parameters:: filename (str) – the path to the redshift prior file.

Note

The file should have at least one HDU with EXTNAME=’PRIORS’, composed of:

TARGETID: the id of the object
Z: the mean of the prior
SIGMA: the sigma (in dz) for the prior

eval(targetid, z)[source]

Return prior contribution to the chi2 for the given TARGETID and redshift grid :param targetid: TARGETID of the object :param z: redshift at which to evaluate template flux

Returns:: prior values on the redshift grid

static gaussian(z, z0, s0)[source]

Return a Gaussian prior of mean z0 and sigma s0 on the grid z :param z: redshift grid :param z0: mean :param s0: sigma

Returns:: prior values on the redshift grid

static lorentzien(z, z0, s0)[source]

Return a Lorentzien prior of mean z0 and sigma s0 on the grid z :param z: redshift grid :param z0: mean :param s0: sigma

Returns:: prior values on the redshift grid

static tophat(z, z0, s0)[source]

Return a tophat prior of mean z0 and width s0 on the grid z.

Parameters:

z – redshift grid
z0 – mean
s0 – width

Returns:

prior values on the redshift grid.

Warning

np.nan <= np.nan -> False
np.nan <=/>= 0.0 -> False
np.inf >= 0.0 -> True
np.inf >= np.inf -> True

redrock.rebin

Tools for binning data.

redrock.rebin._trapz_rebin_1d(x, y, edges, results)

Numba-friendly version of trapezoidal rebinning

See redrock.rebin.trapz_rebin() for input descriptions. results is pre-allocated array of length len(edges)-1 to keep results

redrock.rebin._trapz_rebin_batch(x, y, edges, myz, results, redshifted_x)

Numba-friendly version of trapezoidal rebinning for multiple bases and/or redshifts. This is a wrapper to call _trapz_rebin_1d multiple times.

See redrock.rebin.trapz_rebin() for input descriptions. results is pre-allocated array of shape (nz, len(edges)-1, nbasis) to keep results.

redrock.rebin._trapz_rebin_batch_gpu(x, y, edges, myz, result_shape)[source]

Rebin y(x) flux density using trapezoidal integration between bin edges GPU algorithm can rebin in batch for multiple redshifts and bases and returns 1-d, 2-d, or 3-d array (n redshifts x n bins x n basis) where n basis is the optional second dimension of the y input array and n redshifts is the length of the optional myz array.

Parameters:

x (1-d array) – input x values.
y (1-d or 2-d array) – input y values (for all bases).
edges (1-d array) – new bin edges.
myz (array) – redshift array to rebin in batch, applying redshifts on-the-fly to x
result_shape (tuple) – output shape of results array - this is required because for instance if an input scalar redshift or no redshift was given to trapz_rebin, then the result shape should be (nbin, nbasis) whereas if redshift array of length 1 was given as input it should be (1, nbin, nbasis). Similarly if the y array given to trapz_rebin is 1-d, the basis dimension is omitted but if it is 2-d with (1, n) shape, it will explicitly be 1.

Returns:

integrated results with shape (nz, nbin, nbasis): where nbin = len(results) = len(edges)-1 if no input redshift is given to trapz_rebin or the redshift is a scalar, the nz dimension will be omitted. If the input y given to trapz_rebin is 1-d then the nbasis dimension will be omitted. e.g. for 1-d input y and nz = 100, the shape will be (100, nbin) while for y with shape (1, n) the result will be (100, nbin, 1). For 1-d input y and scalar or omitted redshift, the shape will be (nbin).

Return type:

cp.array

redrock.rebin.centers2edges(centers)[source]

Convert bin centers to bin edges, guessing at what you probably meant

Parameters:: centers (array) – bin centers,
Returns:: bin edges, lenth = len(centers) + 1
Return type:: array

redrock.rebin.rebin_template(template, myz, dwave=None, dedges=None, use_gpu=False, xmin=None, xmax=None)[source]

Rebin a template to a set of wavelengths.

Given a template and a single redshift - or an array of redshifts, rebin the template to a set of wavelength arrays. The wavelengths can be passed as centers (dwave) or edges (dedges) of the wavelength bins.

Parameters:

template (Template) – the template object
myz (float or array of float) – the redshift(s)
dwave (dict) – the keys are the “wavehash” and the values are a 1D array containing the wavelength grid.
dedges (dict) – the keys are the “wavehash” and the values can be either a 1D array containing the bin edges of the wavelength grid (computed from centers2edges) or a 3-element tuple with this 1D array as a CuPy array and the min and max values as scalars in CPU memory.
use_gpu (bool) – whether or not to use the GPU algorithm
xmin (float) – (optional) minimum x-value - x[0]*(1+myz.max()) will be used if omitted - this is useful to avoid GPU to CPU copying in the case where x is a CuPy array and providing the scalar value results in a large speed gain
xmax (float) (optional) maximum x-value - x[-1]*(1+myz.min()) – used if omitted - this is useful to avoid GPU to CPU copying in the case where x is a CuPy array and providing the scalar value results in a large speed gain

Returns:

The rebinned template for every basis function and wavelength: grid in dwave. This is a dict of np or cp 3d arrays (nz x nlambda x nbasis)

Return type:

dict

redrock.rebin.trapz_rebin(x, y, xnew=None, edges=None, myz=None, use_gpu=False, xmin=None, xmax=None, edge_min=None, edge_max=None)[source]

Rebin y(x) flux density using trapezoidal integration between bin edges Optionally use GPU helper method to rebin in batch, see trapz_rebin_batch_gpu Note - current return array shape is (nz, nbins, nbasis). Changing to (nz, nbasis, nbins) would intuitively make sense but the former shape is needed by zscan. Flagging this for possible changes down the road.

Notes

y is interpreted as a density, as is the output, e.g.

>>> x = np.arange(10)
>>> y = np.ones(10)
>>> trapz_rebin(x, y, edges=[0,2,4,6,8])  #- density still 1, not 2
array([ 1.,  1.,  1.,  1.])
>>> y = np.ones((2,10)) #nbasis = 2
>>> y[1,:] = np.arange(10)
>>> trapz_rebin(x, y, edges=[0,2,4,6,8], use_gpu=True) #nbasis=2, GPU mode
array([[1., 1.],
       [1., 3.],
       [1., 5.],
       [1., 7.]])
>>> trapz_rebin(x, y, edges=[0,2,4,6,8], myz=[0,0.5],use_gpu=True) #nbasis=2, multiple redshifts, GPU mode
array([[[1.        , 1.        ],
        [1.        , 3.        ],
        [1.        , 5.        ],
        [1.        , 7.        ]],
       [[1.        , 0.66666667],
        [1.        , 2.        ],
        [1.        , 3.33333333],
        [1.        , 4.66666667]]])
>>> trapz_rebin(x, y, edges=[0,2,4,6,8], myz=[0.0,0.5], use_gpu=False) #nbasis=2, CPU mode
array([[[1.        , 1.        ],
        [1.        , 3.        ],
        [1.        , 5.        ],
        [1.        , 7.        ]],
       [[1.        , 0.66666667],
        [1.        , 2.        ],
        [1.        , 3.33333333],
        [1.        , 4.66666667]]])

Parameters:

x (array) – input x values.
y (1-d or 2-d array) – input y values (batch mode allows 2-d array with multiple bases).
edges (array) – (optional) new bin edges.
myz (array) – (optional) redshift array to rebin in batch, applying redshifts on-the-fly to x
use_gpu (boolean) – whether or not to use GPU algorithm
xmin (float) – (optional) minimum x-value - x[0]*(1+myz.max()) will be used if omitted - this is useful to avoid GPU to CPU copying in the case where x is a CuPy array and providing the scalar value results in a large speed gain
xmax (float) (optional) maximum x-value - x[-1]*(1+myz.min()) – used if omitted - this is useful to avoid GPU to CPU copying in the case where x is a CuPy array and providing the scalar value results in a large speed gain
edge_min (float) – (optional) minimum new bin edge - edges[0] will be used if omitted - this is useful to avoid GPU to CPU copying in the case where edges is precomputed on the GPU as a CuPy array and providing the scalar value results in a large speed gain
edge_max (float) – (optional) maximum new bin edge - edges[-1] will be used if omitted - this is useful to avoid GPU to CPU copying in the case where edges is precomputed on the GPU as a CuPy array and providing the scalar value results in a large speed gain

Returns:

integrated results with len(results) = len(edges)-1: In batch mode, returns np.array (use_gpu=False) or cp.array (use_gpu=True) with shape (nz, nbin, nbasis) where nbin = len(results) = len(edges)-1 if myz is None or myz is a scalar, the nz dimension will be omitted. If y is 1-d then the nbasis dimension will be omitted. e.g. for 1-d y and nz = 100, the shape will be (100, nbin) while for y with shape (1, n) the result will be (100, nbin, 1). For 1-d input y and scalar or omitted myz, the shape will be (nbin).

Return type:

array

Raises:

ValueError – if edges are outside the range of x or if len(x) != len(y)

redrock.results

Functions for reading and writing full redrock results to HDF5.

redrock.results.read_zfit(filename, select_targetids=None, upper=False)[source]

Read and stack zfit tables from filename with N best fits

Parameters:: filename (str) – path to Redrock details hdf5 file

Options:: select_targetids (int or array of int): read only these targetids upper (bool): convert table column names to UPPERCASE

Returns: astropy Table of zfit results, or dict of Tables keyed by targetid

redrock.results.read_zscan(filename, select_targetids=None, upper=False, nozfit=False)[source]

Read redrock.zfind results from a file.

Parameters:: filename (str) – redrock details (.h5) filename

Options:: select_targetids (int or array of int): TARGETID(s) to read upper (bool): convert table column names to UPPERCASE nozfit (bool): do not include zfit information

Returns:

zscan, or (zscan, zfit) tuple where zscan is a nested dictionary zscan[targetid][templatetype] with keys:

redshifts: array of redshifts scanned

zchi2: array of chi2 of fit vs. redshifts

penalty: array of chi2 penalties for unphysical fits at each z

zcoeff: array of coefficients fit per redshifts

zfit: table of N best-fit redshifts for just this target and templatetype

zfit is a Table with the N best fits for each target per spectype and subtype

redrock.results.read_zscan_redrock(filename)[source]

Read redrock.zfind results from a file to be reused by redrock itself.

Returns:

dictionary of results for each local target ID.: dic.keys() are TARGETID dic[tg].keys() are TEMPLATE dic[tg][ft].keys() are [‘penalty’, ‘zcoeff’, ‘zchi2’, ‘redshifts’]

Return type:

dict

redrock.results.write_zscan(filename, zscan, zfit, clobber=False)[source]

Writes redrock.zfind results to a file.

The nested dictionary structure of results is mapped into a nested group structure of the HDF5 file:

/targetids[nt] /zscan/{spectype}/redshifts[nz] /zscan/{spectype}/zchi2[nt, nz] /zscan/{spectype}/penalty[nt, nz] /zscan/{spectype}/zcoeff[nt, nz, nc] or zcoeff[nt, nc, nz] ? /zfit/{targetid}/zfit table…

Parameters:

filename (str) – the output file path.
zscan (dict) – the full set of fit results.
zfit (Table) – the best fit redshift results.
clobber (bool) – if True, delete the file if it exists.

redrock.targets

Classes and functions for targets and their spectra.

class redrock.targets.DistTargets(targetids, comm=None)[source]

Base class for distributed targets.

Target objects are distributed across the processes in an MPI communicator, but the details of how this data is loaded from disk is specific to a given project. Each project should inherit from this base class and create an appropriate class for the data files being used.

This class defines some general methods and the API that should be followed by these derived classes.

Parameters:

targetids (list) – the global set of target IDs.
comm (mpi4py.MPI.Comm) – (optional) the MPI communicator.

eval_models(bestfit, templates, archetypes=None, nleg=None)[source]

Calls target.eval_model(bestfit, templates, archetypes) for each local target

Parameters:

bestfit – dict-like with SPECTYPE, SUBTYPE, COEFF, and Z
templates – dict[(SPECTYPE,SUBTYPE)] of Template objects

Options:: archetypes: dict[(SPECTYPE,SUBTYPE)] of Archetype objects
Returns models list of dictionaries, with one item per local target,: and dictionaries keyed by the wavehashes of the spectra for that target.

See Target.eval_model for more details.

Note: sets target.model for each local target

local()[source]: Return the local list of Target objects.

local_target_ids()[source]: Return the local list of target IDs.

wavegrids()[source]: Return the global dictionary of wavelength grids for each wave hash.

class redrock.targets.DistTargetsCopy(targets, comm=None, root=0)[source]

Distributed targets built from a copy.

This class is a simple wrapper that distributes targets located on one process to the processes in a communicator.

Parameters:

targets (list) – list of Target objects on one process.
comm (mpi4py.MPI.Comm) – (optional) the MPI communicator.
root (int) – the process which has the input targets locally.

class redrock.targets.Spectrum(wave, flux, ivar, R, Rcsr=None, band=None)[source]

Simple container class for an individual spectrum.

Parameters:

wave (array) – the wavelength grid.
flux (array) – the flux values.
ivar (array) – the inverse variance.
R (scipy.sparse.dia_matrix) – the resolution matrix in band diagonal format.
Rcsr (scipy.sparse.csr_matrix) – the resolution matrix in CSR format.
band (str) – camera band name

sharedmem_pack()[source]: Pack spectral data into multiprocessing shared memory.

sharedmem_unpack()[source]: Unpack spectral data from multiprocessing shared memory.

class redrock.targets.Target(targetid, spectra, coadd=False, cosmics_nsig=0.0, meta=None)[source]

A single target.

This represents the data for a single target, including a unique identifier and the individual spectra observed for this object (or a coadd).

Parameters:

targetid (int or str) – unique targetid
spectra (list) – list of Spectrum objects
coadd (bool) – compute and store the coadd at construction time. The coadd can always be recomputed with the compute_coadd() method.
cosmics_nsig (float) – cosmic rejection threshold in compute_coadd
meta (dict) – optional metadata dictionary for this Target.

compute_coadd(cache_Rcsr=False, cosmics_nsig=0.0)[source]

Compute the coadd from the current spectra list.

Parameters:

cache_Rcsr – pre-calculate and cache sparse CSR format of resolution matrix R
cosmics_nsig (float) – number of sigma for cosmic rejection.

This method REPLACES the list of individual spectra with coadds.

eval_model(bestfit, templates, archetypes=None, nleg=None)[source]

Evaluate the best fit model for the spectra in this Target.

Parameters:

bestfit – dict-like with SPECTYPE, SUBTYPE, COEFF, and Z
templates – dict[(SPECTYPE,SUBTYPE)] of Template objects

Options:: archetypes: dict[(SPECTYPE,SUBTYPE)] of Archetype objects nleg (int): highest degree of Legendre polynomial

Returns model dict of rendered best fit models keyed by wavehash

Note: modifies self by setting self.model to be returned value.

Example after calling target.eval_model:

spec = target.spectra[i]
wavelengths = spec.wave
fluxdata = spec.flux
wavehash = spec.wavehash
fluxmodel = target.model[wavehash]

sharedmem_pack()[source]: Pack all spectra into multiprocessing shared memory.

sharedmem_unpack()[source]: Unpack all spectra from multiprocessing shared memory.

redrock.targets.distribute_targets(targets, nproc)[source]

Distribute a list of targets among processes.

Given a list of Target objects, compute the load balanced distribution of those targets among a set of processes.

This function is used when one already has a list of Target objects that need to be distributed. This happens, for example, when creating a DistTargetsCopy object from pre-existing Targets, or when using multiprocessing to do operations on the MPI-local list of targets.

Parameters:

targets (list) – list of Target objects.
nproc (int) – number of processes.

Returns:

A list (one element for each process) with each element: being a list of the target IDs assigned to that process.

Return type:

list

redrock.templates

Classes and functions for templates.

class redrock.templates.DistTemplate(template, dwave, mp_procs=1, comm=None, use_gpu=False, gpu_mode=False)[source]

Distributed template data interpolated to all redshifts.

For a given template, the redshifts are distributed among the processes in the communicator. Then each process will rebin the template to those redshifts for the wavelength grids specified by dwave.

Parameters:

template (Template) – the template to distribute
dwave (dict) – the keys are the “wavehash” and the values are a 1D array containing the wavelength grid.
mp_procs (int) – if not using MPI, restrict the number of multiprocesses to this.
comm (mpi4py.MPI.Comm) – (optional) the MPI communicator.
use_gpu (bool) – (optional) If this process uses GPU
gpu_mode (bool) – (optional) If ANY process uses GPU - the reason we need both is that in GPU mode, the rebinning of all redshifts is done on all GPUs (redundantly but much faster than using MPI allgather) and no redshifts are done on any CPU only proc.

cycle()[source]

Pass our piece of data to the next process.

If we have returned to our original data, then return True, otherwise return False.

Parameters:: Nothing

Returns (bool):: Whether we have finished (True) else False.

class redrock.templates.DistTemplatePiece(index, redshifts, data)[source]

One piece of the distributed template data.

This is a simple container for storing interpolated templates for a set of redshift values. It is used for communicating the interpolated templates between processes.

In the MPI case, each process will store at most two of these simultaneously. This is the data that is computed on a single process and passed between processes.

Parameters:

index (int) – the chunk index of this piece- this corresponds to the process rank that originally computed this piece.
redshifts (array) – the redshift range contained in this piece.
data (list) – a list of dictionaries, one for each redshift, and each containing the 2D interpolated template values for all “wavehash” keys.

class redrock.templates.ReDistTemplate(template, dwave, mp_procs=1, comm=None, use_gpu=False, gpu_mode=False)[source]

Distributed template data interpolated to all redshifts.

For a given template, the redshifts are distributed among the processes in the communicator. Then each process will rebin the template to those redshifts for the wavelength grids specified by dwave. After rebinning, the full redshift ranges are redistributed to each process in the communicator.

Parameters:

template (Template) – the template to distribute
dwave (dict) – the keys are the “wavehash” and the values are a 1D array containing the wavelength grid.
mp_procs (int) – if not using MPI, restrict the number of multiprocesses to this.
comm (mpi4py.MPI.Comm) – (optional) the MPI communicator.

cycle()[source]

This function is a no-op since redshift ranges have been redistributed.

Parameters:: Nothing

Returns (bool):: Always returns True

class redrock.templates.Template(filename=None, spectype=None, redshifts=None, wave=None, flux=None, subtype=None, method='PCA', igm_model='Inoue14', zscan_galaxy=None, zscan_qso=None, zscan_star=None, version=None)[source]

A spectral Template PCA object.

The template data is read from a redrock-format template file. Alternatively, the data can be specified in the constructor.

Parameters:: filename (str) – the path to the template file, either absolute or relative to the template path ($RR_TEMPLATE_DIR or installed with code)

eval(coeff, wave, z, R=None)[source]

Return template for given coefficients, wavelengths, and redshift

Parameters:

coeff – array of coefficients length self.nbasis
wave – wavelengths at which to evaluate template flux
z – redshift at which to evaluate template flux

Options:: R : array[nwave,nwave] resolution matrix to convolve with model

Returns:: template flux array

Notes

No factors of (1+z) are applied to the resampled flux, i.e. evaluating the same coeffs at different z does not conserve integrated flux, but more directly maps model=templates.dot(coeff)

property full_type: Return formatted fulltype = spectype:::subtype string.

property method: Alias for self.solve_matrices_algorithm, i.e. PCA or NMF

property solve_matrices_algorithm: Return a string representing the algorithm to be used in zscan.solve_matrices. Possible values are: PCA NMF Logic can be added here to select a default algorithm based on header keywords etc so that different templates seamlessly use different algorithms.

redrock.templates._mp_rebin_template(template, dwave, zlist, qout, iproc, use_gpu)[source]: Function for multiprocessing version of rebinning. With rebinning now done in batch mode, use process index, iproc to keep track of order of redshifts instead of keying dict by individual redshifts.

redrock.templates.eval_model(data, wave, R=None, templates=None)[source]

Evaluate model spectra.

Given a bunch of fits with coefficients COEFF, redshifts Z, and types SPECTYPE, SUBTYPE in data, evaluate the redrock model fits at the wavelengths wave using resolution matrix R.

The wavelength and resolution matrices may be dictionaries including for multiple cameras.

Parameters:

data (table-like, [nspec]) – table with information on each model to evaluate. Must contain at least Z, COEFF, SPECTYPE, and SUBTYPE fields.
wave (array [nwave] or dictionary thereof) – array of wavelengths in angstrom at which to evaluate the models.
R (list of [nwave, nwave] arrays of floats or dictionary thereof) – resolution matrices for evaluating spectra.
templates (dictionary of Template) – dictionary with (SPECTYPE, SUBTYPE) giving the template corresponding to each type.

Returns:

model fluxes, array [nspec, nwave]. If wave and R are dict, then a dictionary of model fluxes, one for each camera.

redrock.templates.find_templates(template_path=None)[source]

Return list of Redrock template files

template_path can be one of 5 things:

None (use get_template_dir())

list/tuple/set/array of template files

path to directory containing template files

path to single template file to use

path to text file listing which template files to use

Returns list of template files to use

redrock.templates.get_template_dir(template_path=None)[source]: Return template_path is set, otherwise $RR_TEMPLATE_DIR, or (code)/data/templates

redrock.templates.header2templatefiles(hdr, template_dir=None)[source]

Derive template filenames from header keywords

Parameters:: hdr – dict-like with keys TEMNAMnn+TEMVERnn and/or TEMFILnn

Options:: template_dir (str): full path to redrock-templates directory

Returns list of template filenames, including directory path

redrock.templates.load_dist_templates(dwave, templates=None, comm=None, mp_procs=1, zscan_galaxy=None, zscan_qso=None, zscan_star=None, redistribute=False, use_gpu=False, gpu_mode=False)[source]

Read and distribute templates from disk.

This reads one or more template files from disk and distributes them among an MPI communicator. Each process will locally store interpolated data for a redshift slice of each template. For a single redshift, the template is interpolated to the wavelength grids specified by “dwave”.

As an example, imagine 3 templates with independent redshift ranges. Also imagine that the communicator has 2 processes. This function would return a list of 3 DistTemplate objects. Within each of those objects, the 2 processes store the interpolated data for a subset of the redshift range:

DistTemplate #1: zmin1 <—- p0 —-> | <—- p1 —-> zmax1 DistTemplate #2: zmin2 <– p0 –> | <– p1 –> zmax2 DistTemplate #3: zmin3 <— p0 —> | <— p1 —> zmax3

Parameters:

dwave (dict) – the dictionary of wavelength grids. Keys are the “wavehash” and values are an array of wavelengths.
templates (str or None) – if None, find all templates from the redrock template directory. If a path to a file is specified, load that single template. If a path to a directory is given, load all templates in that directory.
comm (mpi4py.MPI.Comm) – (optional) the MPI communicator.
mp_procs (int) – if not using MPI, restrict the number of multiprocesses to this.
redistribute (bool) – (optional) allgather rebinned templates after distributed rebinning so each process has the full redshift range for the template.

Returns dist_templates: a list of DistTemplate objects, sampled at redshifts

redrock.templates.load_templates(template_path=None, zscan_galaxy=None, zscan_star=None, zscan_qso=None, asdict=False)[source]

Return list or dict of Template objects

Options:: template_path: list of template files, directory, or text file with list of templates zscan_galaxy (str): zmin,zmax,dz redshift range for galaxies zscan_star (str): zmin,zmax,dz redshift range for stars zscan_qso (str): zmin,zmax,dz redshift range for QSOs asdict (bool): return dict keyed by (spectype, subtype) instead of list

template_path is list of template file paths, or path to provide to find_templates, i.e. a path to a directory with templates, a path to a text file containing a list of templates, a path to a single template file, or None to use $RR_TEMPLATE_DIR or templates installed with code instead.

Returns: list or dict of Template objects

Note: this always returns a list/dict, even if template_path is a path to a single template file.

redrock.templates.load_templates_from_header(hdr, template_dir=None, asdict=False)[source]

Return templates matching header keywords

Parameters:: hdr – dict-like with keys TEMNAMnn+TEMVERnn and/or TEMFILnn

Options:: template_dir (str): full path to redrock-templates directory asdict (bool): return dict keyed by (spectype, subtype) instead of list

Returns list of Template objects

redrock.templates.make_fulltype(spectype, subtype)[source]: combine (spectype, subtype) into fulltype

redrock.templates.parse_fulltype(fulltype)[source]: Parse template fulltype into (spectype, subtype)

redrock.utils

Redrock utility functions.

redrock.utils.distribute_work(nproc, ids, weights=None, capacities=None)[source]

Helper function to distribute work among processes with varying capacities.

Parameters:

nproc (int) – the number of processes.
ids (list) – list of work unit IDs
weights (dict) – dictionary of weights for each ID. If None, use equal weighting.
capacities (list) – list of process capacities. If None, use equal capacity per process. A process with higher capacity can handle more work.

Returns:

A list (one element for each process) with each element: being a list of the IDs assigned to that process.

Return type:

list

redrock.utils.elapsed(timer, prefix, comm=None)[source]

Get and print the elapsed time.

If timer is None, compute the start time and return. Otherwise, find the elapsed time and print a message before returning the new start time.

Parameters:

timer (float) – time in seconds for some arbitrary epoch. If “None”, get the current time and return.
prefix (str) – string to print before the elapsed time.
comm (mpi4py.MPI.Comm) – optional communicator.

Returns:

the new start time in seconds.

Return type:

float

redrock.utils.encode_column(c)[source]

Returns a bytes column encoded into a string column.

Parameters:: c (Table column) – a column of a Table.
Returns:: an array of strings.
Return type:: array

redrock.utils.extra_columns_in_archetype_mode()[source]: dictionary of extra column to be added in archetye mode

redrock.utils.getGPUCountMPI(comm)[source]

Determine the number of GPUs available when running in MPI mode

In MPI mode, cupy.cuda.runtime.getDeviceCount() always returns 1 so this method determines the actual number of GPUs.

Parameters:: comm – the MPI communicator
Returns:: the number of GPUs
Return type:: int

redrock.utils.get_mp(requested)[source]

Returns a reasonable number of multiprocessing processes.

This checks whether the requested value makes sense, and also whether we are running on a NERSC login node (and hence would get in trouble trying to use all cores).

Parameters:: requested (int) – the requested number of processes.
Returns:: the number of processes to use.
Return type:: int

redrock.utils.getch()[source]: Return a single character from stdin.

redrock.utils.mp_array(original)[source]

Allocate a raw shared memory buffer and wrap it in an ndarray.

This allocates a multiprocessing.RawArray and wraps the buffer with an ndarray.

Parameters:

typcode (str) – the type code of the array.
size_or_init – passed to the RawArray constructor.

Returns;: ndarray: the wrapped data.

If the input array has size=0, the original is returned without wrapping it in a size=0 multiprocessing.RawArray (which generates warnings)

redrock.utils.native_endian(data)[source]

Convert numpy array data to native endianness if needed.

Returns new array if endianness is swapped, otherwise returns input data

By default, FITS data from astropy.io.fits.getdata() are not Intel native endianness and scipy 0.14 sparse matrices have a bug with non-native endian data.

Parameters:

data (array) – input array

Returns:

original array if input in native endianness, otherwise a copy: with the bytes swapped.

Return type:

array

redrock.utils.nersc_login_node()[source]: Returns True if we are on a NERSC login node, else False.

redrock.utils.reduced_wavelength(wave)[source]

function to calculate reduced wavelength arrya for legendre polynomial in archetype mode, legendre polynomials are orthogonal in [-1,1] :param wave: wavelength for which reduced wavelengths to be estimated :type wave: array

Returns:: reduced wavelength in [-1,1] range

redrock.zfind

Redshift finding algorithms.

redrock.zfind._mp_fitz(chi2, target_data, t, nminima, qout, archetype, use_gpu, deg_legendre, zminfit_npoints, per_camera, n_nearest, prior_sigma)[source]: Wrapper for multiprocessing version of fitz.

redrock.zfind._rebalance_after_scan(targets, results)[source]: Helper for rebalancing targets and results after lopsided zscan

redrock.zfind.calc_deltachi2(chi2, z, zwarn, dvlimit=1000.0)[source]

Calculate chi2 differences, excluding candidates with close z or bad fits

Parameters:

chi2 – array of chi2 values
z – array of redshifts
zwarn – array of zwarn values

Options:

dvlimit: exclude candidates that are closer than dvlimit [km/s],: uses minumum value of the pair

Returns (deltachi2, setzwarn) where deltachi2 is array of chi2 differences

to next best good fit, and setzwarn is boolean array of whether a SMALL_DELTACHI2 zwarn bit should be set.

Note: The final target always has deltachi2=0.0 because we don’t know

what the next chi2 would have been. This can also occur for the last N targets if all N of them are within dvlimit of each other.

redrock.zfind.sort_dict_by_col(d, colname)[source]: Sort a dict of np.ndarrays by one key. Replacement for astropy.Table.sort

redrock.zfind.sort_dict_by_cols(d, colnames, sort_first_column_first=True)[source]

Sort a dict of np.ndarrays by multiple keys Replacement for astropy.Table.sort

Parameters:

d – a dictionary
colnames – a tuple or list of column names
sort_first_column_first – boolean - np.lexsort((a,b)) will sort by b first and then sort by a. This is the opposite of astropy.Table.sort behavior. Setting this to true will ensure that sort_dict_by_cols(d, (‘a’,’b’)) will result in sorting by a first and then b.

redrock.zfind.sort_zfit(zfit)[source]

Sorts zfit table by goodness of fit, using ‘zwarn’ and ‘chi2’ columns

Parameters:: zfit – astropy Table with columns ‘zwarn’ and ‘chi2’

Modifies zfit in-place by sorting it

redrock.zfind.sort_zfit_dict(zfit)[source]

Sorts zfit dict by goodness of fit, using ‘zwarn’ and ‘chi2’ columns

Parameters:: zfit – dict of numpy arrays with columns ‘zwarn’ and ‘chi2’

Modifies zfit in-place by sorting it

redrock.zfind.zfind(targets, templates, mp_procs=1, nminima=3, archetypes=None, priors=None, chi2_scan=None, use_gpu=False, zminfit_npoints=15, per_camera=None, deg_legendre=None, n_nearest=None, prior_sigma=None, ncamera=None)[source]

Compute all redshift fits for the local set of targets and collect.

Given targets and templates distributed across a set of MPI processes, compute the redshift fits for all redshifts and our local set of targets. Each process computes the fits for a slice of redshift range and then cycles through redshift slices by passing the interpolated templates along to the next process in order.

Note

If using MPI, only the rank 0 process will return results- all other processes with return a tuple of (None, None).

Parameters:

targets (DistTargets) – distributed targets.
templates (list) – list of DistTemplate objects.
mp_procs (int, optional) – if not using MPI, this is the number of multiprocessing processes to use.
nminima (int, optional) – number of chi^2 minima to consider. Passed to fitz().
archetypes (list optional) – list of Archetype objects to use for final fitz choice of best chi2 vs. z minimum.
priors (str, optional) – file containing redshift priors
chi2_scan (str, optional) – file containing already computed chi2 scan
use_gpu (bool, optional) – use gpu for calc_zchi2
deg_legendre (int) – in archetype mode polynomials upto deg_legendre-1 will be used
zminfit_npoints (int) – number of finer redshift pixels to search for final redshift
per_camera – (bool): True if fitting needs to be done in each camera for archetype mode
n_nearest (int) – number of nearest neighbours to be used in chi2 space (including best archetype)
prior_sigma (float) – prior to add in the final solution matrix: added as 1/(prior_sigma**2) only for per-camera mode
zminfit_npoints – number of minimum redshift to be fit for final redshift estimation

Returns:

(allresults, allzfit), where “allresults” is a dictionary of the: full chi^2 fit information, suitable for writing to a redrock scan file. “allzfit” is an astropy Table of only the best fit parameters for a limited set of minima.

Return type:

tuple

redrock.zscan

Algorithms for scanning redshifts.

redrock.zscan._batch_dot_product_sparse_gpu(spectra, tdata)[source]

GPU implementation. Calculate a batch dot product of the 3 sparse matrices in spectra with every template in tdata. A CUDA kernel replicates the functionality of the scipy sparse matrix dot product but done for every template in parallel so that the kernel is only called once per spectrum.

Parameters:

spectra (list) – list of Spectrum objects.
tdata (dict) – dictionary of interpolated template values for each wavehash.

Returns:

dot products of these 3 spectra with all templates

Return type:

Tbs (cp.array)

redrock.zscan._calc_batch_dot_product_3d2d_gpu(Tbs, zc)[source]

GPU implementation. Calculate a batch dot product of the 3d array Tbs with the 2d array zc. The 3-d array shape is A x B x C and the 2-d array shape is A x C. The resulting output 2-d array shape is A x B. E.g., for all A a dot product of a 2d array of shape B x C is performed with a 1-d array of shape C. These are non-sparse numpy arrays.

Parameters:

Tbs (array) – the stacked output from all 3 filters from batch_dot_product_sparse, for all redshift templates (nz x nrows x nbasis)
zc (array) – zcoeffs, the 2-d array output of zc = linalg.solve(all_M, all_y) (nz x nbasis)

Returns:

the output of the dot product, (nz x nrows)

Return type:

model (array)

redrock.zscan._mp_calc_zchi2(indx, target_ids, target_data, t, use_gpu, qout, qprog)[source]: Wrapper for multiprocessing version of calc_zchi2.

redrock.zscan._zchi2_one(Tb, weights, flux, wflux, zcoeff, solve_matrices_algorithm)[source]

Calculate a single chi2.

For one redshift and a set of spectral data, compute the chi2 for template data that is already on the correct grid.

redrock.zscan.batch_dot_product_sparse(spectra, tdata, nz, use_gpu)[source]

Calculate a batch dot product of the 3 sparse matrices in spectra with every template in tdata. Sparse matrix libraries are used to perform the dot products.

Parameters:

spectra (list) – list of Spectrum objects.
tdata (dict) – dictionary of interpolated template values for each wavehash.
nz (int) – number of templates
use_gpu (bool) – use GPU or not

Returns:

dot products of these 3 spectra with all templates

Return type:

Tbs (list)

redrock.zscan.calc_batch_dot_product_3d2d(Tbs, zc, use_gpu)[source]

Calculate a batch dot product of the 3d array Tbs with the 2d array zc. The 3-d array shape is A x B x C and the 2-d array shape is A x C. The resulting output 2-d array shape is A x B. E.g., for all A a dot product of a 2d array of shape B x C is performed with a 1-d array of shape C. These are non-sparse numpy arrays.

Parameters:

Tbs (array) – the stacked output from all 3 filters from batch_dot_product_sparse, for all redshift templates (nz x nrows x nbasis)
zc (array) – zcoeffs, the 2-d array output of zc = linalg.solve(all_M, all_y) (nz x nbasis)
use_gpu (bool) – use GPU or not

Returns:

the output of the dot product, (nz x nrows)

Return type:

model (array)

redrock.zscan.calc_batch_dot_product_3d3d_gpu(a, b, transpose_a=False, fullprecision=True)[source]

GPU implementation. Calculate a batch dot product of the 3d array a with the 3d array b. The 3-d array shape is A x B x C and the 2-d array shape is A x C. The resulting output 2-d array shape is A x B. E.g., for all A a dot product of a 2d array of shape B x C is performed with a 1-d array of shape C. These are non-sparse numpy arrays.

Parameters:

a (array) – a 3-d array (nz x ncols x nrows) In practice, the Tb array, the stacked output from all 3 filters from batch_dot_product_sparse, for all redshift templates (nz x nrows x nbasis) which should have its transpose performed yielding shape (nz x nbasis x nrows).
b (array) – another 3-d array (nz x nrows x ncols) In practice, the Tb array multiplied y weights, shape (nz x nrows x nbasis)
transpose_a (bool) – Whether or not to transpose the a array before performing the dot product
fullprecision (bool) – Whether or not to ensure reproducibly identical results. The context is that it can be faster to use many parallel threads to compute a single output array element of the dot product result, but due to floating point rounding issues, the random order of the summation can change the result by order 1e-16 (e.g. a+b+c != c+a+b). When set to true, the number of parallel threads is decreased to ensure reproducibility at a trade-off of speed.

Returns:

the output of the dot product, (nz x ncols x ncols): such that M[i,:,:] = a[i,:,:].dot(b[i,:,:])

Return type:

M (array)

redrock.zscan.calc_negOII_penalty(OIItemplate, coeff)[source]

return penalty term for model having negative [OII] flux

Parameters:

OIItemplate[nwave – portion of template around [OII] doublet
nbasis] – portion of template around [OII] doublet
coeff[nz – coefficients from full template fit to data
nbasis] – coefficients from full template fit to data

Returns:

penalty per redshift bin

Return type:

penalty[nz]

redrock.zscan.calc_zchi2(target_ids, target_data, dtemplate, progress=None, use_gpu=False)[source]

Calculate chi2 vs. redshift for a given PCA template.

New CPU/GPU algorithms June 2022

Parameters:

target_ids (list) – targets IDs.
target_data (list) – list of Target objects.
dtemplate (DistTemplate) – distributed template data
progress (multiprocessing.Queue) – optional queue for tracking progress, only used if MPI is disabled.

Returns:

(zchi2, zcoeff, zchi2penalty) with:

zchi2[ntargets, nz]: array with one element per target per
redshift
zcoeff[ntargets, nz, ncoeff]: array of best fit template
coefficients for each target at each redshift
zchi2penalty[ntargets, nz]: array of penalty priors per target
and redshift, e.g. to penalize unphysical fits

Return type:

tuple

redrock.zscan.calc_zchi2_batch(spectra, tdata, weights, flux, wflux, nz, nbasis, solve_matrices_algorithm=None, solver_args=None, use_gpu=False, fullprecision=True, prior=None)[source]

Calculate a batch of chi2. For many redshifts and a set of spectral data, compute the chi2 for template data that is already on the correct grid.

Parameters:

spectra (list) – list of Spectrum objects.
tdata (dict) – dictionary of interpolated template values for each wavehash - arrays are 3d (nz x nlambda x nbasis)
weights (array) – concatenated spectral weights (ivar).
flux (array) – concatenated flux values.
wflux (array) – concatenated weighted flux values.
nz (int) – number of templates
nbasis (int) – nbasis
solve_matrices_algorithm (string) – PCA, BLVS, or NMF - the algorithm used to solve matrix equation
solver_args (dict) – Optional args to pass to solver, such as bounds array for BVLS.
use_gpu (bool) – use GPU or not
fullprecision (bool) – Whether or not to ensure reproducibly identical results. See calc_batch_dot_product_3d3d_gpu.
prior (2d array) – prior matrix on coefficients (1/sig**2)

Returns:

array with one element per redshift for this target zcoeff (array): array of best fit template coefficients

Return type:

zchi2 (array)

redrock.zscan.calc_zchi2_one(spectra, weights, flux, wflux, tdata, solve_matrices_algorithm)[source]

Calculate a single chi2.

For one redshift and a set of spectra, compute the chi2 for template data that is already on the correct grid.

Parameters:

spectra (list) – list of Spectrum objects.
weights (array) – concatenated spectral weights (ivar).
flux (array) – concatenated flux values.
wflux (array) – concatenated weighted flux values.
tdata (dict) – dictionary of interpolated template values for each wavehash.

Returns:

chi^2 and coefficients.

Return type:

tuple

redrock.zscan.calc_zchi2_targets(targets, templates, mp_procs=1, use_gpu=False)[source]

Compute all chi2 fits for the local set of targets and collect.

Given targets and templates distributed across a set of MPI processes, compute the coarse-binned chi^2 fit for all redshifts and our local set of targets. Each process computes the fits for a slice of redshift range and then cycles through redshift slices by passing the interpolated templates along to the next process in order.

Parameters:

targets (DistTargets) – distributed targets.
templates (list) – list of DistTemplate objects.
mp_procs (int) – if not using MPI, this is the number of multiprocessing processes to use.
gpu (bool) – (optional) use gpu for calc_zchi2

Returns:

dictionary of results for each local target ID.

Return type:

dict

redrock.zscan.dot_product_sparse_one(spectra, tdata, i)[source]

Calculate a dot product of the 3 sparse matrices in spectra with ONE template in tdata. Sparse matrix libraries are used to perform the dot products.

Parameters:

spectra (list) – list of Spectrum objects.
tdata (dict) – dictionary of interpolated template values for each wavehash - arrays are 3d (nz x nlambda x nbasis)
i (int) – index of this redshift

Returns:

dot products of these 3 spectra with ONE templates

Return type:

Tb (array)

redrock.zscan.per_camera_coeff_with_least_square_batch(target, tdata, weights, flux, wflux, nleg, narch, method=None, n_nbh=None, prior=None, use_gpu=False, bands=None)[source]

This function calculates coefficients for archetype mode in each camera using normal linear algebra matrix solver or BVLS (bounded value least square) method

BVLS described in : https://www.stat.berkeley.edu/~stark/Preprints/bvls.pdf

Scipy: https://docs.scipy.org/doc/scipy/reference/generated/scipy.optimize.lsq_linear.html

Parameters:

target (object) – target object
tdata (dict) – template data for model fit for ALL archetypes
weights (array) – concatenated spectral weights (ivar).
flux (array) – concatenated flux values.
wflux (array) – concatenated weighted flux values.
nleg (int) – number of Legendre polynomials
narch (int) – number of archetypes
method (string) – ‘PCA’, ‘BVLS’, ‘NMF’, or ‘NNLS’ (same as NMF)
n_nbh (int) – number of nearest best archetypes
prior (array) – prior matrix added to the Legendre coefficients (1/sigma^2)
use_gpu (bool) – use GPU or not
bands (list) – list of wavelength bands

Returns:

coefficients and chi2

redrock.zscan.solve_matrices(M, y, solve_algorithm='PCA', solver_args=None, use_gpu=False)[source]

Solve the matrix equation y = M*x for the unknown x using the specified algorithm. The default is to use PCA via numpy or cupy linalg.solve. But non-negative matrix factorization (NMF) and other algorithms may be integrated here and selected based on the template.

Parameters:

M (array) – 2d array on CPU; 3d array on GPU for all redshifts
y (array) – 1d array on CPU; 2d array on GPU for all redshifts
solve_algorithm (string) – which algorithm to use
solver_args (dict) – Optional args to pass to solver, such as bounds array for BVLS.
use_gpu (bool) – use GPU or not

Returns:

solution to y = M*x: (1-d array on CPU, 2-d array on GPU)

Return type:

x (array)

Raises:

LinAlgError is passed up if raised by np.linalg.solve –
NotImplementedError if algorithm is not implemented or undefined –

redrock.zscan.spectral_data(spectra)[source]

Compute concatenated spectral data products.

This helper function builds full length array quantities needed for the chi2 fit.

Parameters:

spectra (list) – list of Spectrum objects.

Returns:

(weights, flux, wflux) concatenated values used for single: redshift chi^2 fits.

Return type:

tuple

redrock.zwarning

Mask bit definitions for zwarning.

WARNING on the warnings: not all of these are implemented yet.