A publishing partnership

The following article is Free article

ASTEROID LIGHT CURVES FROM THE PALOMAR TRANSIENT FACTORY SURVEY: ROTATION PERIODS AND PHASE FUNCTIONS FROM SPARSE PHOTOMETRY

Adam Waszczak, Chan-Kao Chang, Eran O. Ofek, Russ Laher, Frank Masci, David Levitan, Jason Surace, Yu-Chi Cheng, Wing-Huen Ip, Daisuke Kinoshita, George Helou, Thomas A. Prince, and Shrinivas Kulkarni

Published 2015 August 18 • © 2015. The American Astronomical Society. All rights reserved.
The Astronomical Journal, Volume 150, Number 3 Citation Adam Waszczak et al 2015 AJ 150 75 DOI 10.1088/0004-6256/150/3/75

Download Article PDF

Article metrics

6131 Total downloads
Video abstract views

Permissions

Get permission to re-use this article

Dates

Received 2015 March 4
Accepted 2015 April 15
Published 2015 August 18

Keywords

minor planets, asteroids: general; surveys

Journal RSS

Create or edit your corridor alerts

What are corridors?

ABSTRACT

We fit 54,296 sparsely sampled asteroid light curves in the Palomar Transient Factory survey to a combined rotation plus phase-function model. Each light curve consists of 20 or more observations acquired in a single opposition. Using 805 asteroids in our sample that have reference periods in the literature, we find that the reliability of our fitted periods is a complicated function of the period, amplitude, apparent magnitude, and other light-curve attributes. Using the 805-asteroid ground-truth sample, we train an automated classifier to estimate (along with manual inspection) the validity of the remaining ∼53,000 fitted periods. By this method we find that 9033 of our light curves (of ∼8300 unique asteroids) have "reliable" periods. Subsequent consideration of asteroids with multiple light-curve fits indicates a 4% contamination in these "reliable" periods. For 3902 light curves with sufficient phase-angle coverage and either a reliable fit period or low amplitude, we examine the distribution of several phase-function parameters, none of which are bimodal though all correlate with the bond albedo and with visible-band colors. Comparing the theoretical maximal spin rate of a fluid body with our amplitude versus spin-rate distribution suggests that, if held together only by self-gravity, most asteroids are in general less dense than ∼2 g cm⁻³, while C types have a lower limit of between 1 and 2 g cm⁻³. These results are in agreement with previous density estimates. For 5–20 km diameters, S types rotate faster and have lower amplitudes than C types. If both populations share the same angular momentum, this may indicate the two types' differing ability to deform under rotational stress. Lastly, we compare our absolute magnitudes (and apparent-magnitude residuals) to those of the Minor Planet Center's nominal (G = 0.15, rotation-neglecting) model; our phase-function plus Fourier-series fitting reduces asteroid photometric rms scatter by a factor of ∼3.

Export citation and abstract BibTeX RIS

Previous article in issue

Next article in issue

1. INTRODUCTION

In this work we model an asteroid's apparent visual magnitude V (log flux) as

where H is the absolute magnitude (a constant), δ is a periodic variability term due to rotation (e.g., if the object is spinning and has some asymmetry in shape or albedo), r and Δ are the heliocentric and geocentric distances (in AU), and $\phi =\phi (\alpha )$ is the phase function, which varies with the solar phase angle α (the Sun–asteroid–Earth angle). When $\alpha =0$ (i.e., at opposition), $\phi =1$ by definition, while in general $0\lt \phi \lt 1$ for $\alpha \gt 0$ (with ϕ decreasing as α increases).

A key feature of our approach is the simultaneous fitting of both the phase function ϕ and the rotation term δ. The detailed forms of ϕ and δ, as well as the algorithm underlying our fitting procedure, are motivated by a variety of prior work in this area, as described in the following sections.

1.1. Asteroid Rotation

Building upon the work of Kaasalainen et al. (2001), Hanuš & Ďurech (2012) discuss the inversion of asteroid light-curve data taken over several oppositions to obtain a 3D shape solution. The form of δ (see Equation (1)) in this case consists of a large number of free parameters (several tens to hundreds). Results from inversion agree well with those from stellar occultations, adaptive optics imaging, and in situ spacecraft imagery (Hanuš et al. 2013). Knowledge of the detailed irregular shapes of asteroids improves our ability to constrain models of their internal structure, as well as the magnitude and timescale of spin and orbital evolution due to solar-radiation and thermal emission, including the Yarkovsky and YORP effects (see Bottke et al. 2013 and references therein).

A simpler model for δ—suitable for fitting to data sparser than those required for most inversion methods—is a Jacobi ellipsoid (Chandrasekhar 1969) in its principal-axis spin state. The light curve of such an ellipsoid is a double-peaked sinusoid, given by a simple expression depending solely (assuming constant surface albedo) on the axes ratio and angle between the line of sight and spin axis. The fitted amplitude thus yields a lower-bound elongation estimate for the asteroid.

The predicted distribution of the rotation frequencies of a collisionally equilibrated system of particles has long been claimed to be a Maxwellian function (Salo 1987), which—as reviewed by Pravec et al. (2002)—very well approximates the observed distribution of several hundred of the brightest (∼40 km or larger) asteroids, but breaks down for smaller objects, among which an excess of slow and fast rotators appear to exist. Steinberg & Sari (2015) more recently argue that collision instead leads to a Lévy distribution, and that a significant primordial spin component remains in the present observed population. Some studies that have examined the spin distribution of small objects are Pravec et al. (2008), Polishook & Borsch (2009), the Thousand Asteroid Light Curve Survey (Masiero et al. 2009), and two brief observing runs conducted within the Palomar Transient Factory⁷ (PTF) survey (Polishook et al. 2012; Chang et al. 2014a).

Warner et al. (2009) describe the Light Curve Database (LCDB), which compiles several thousand densely sampled light curves of asteroids targeted by dedicated observing teams. Light curves in the LCDB have the following features:

1.
LCDB light curves' dense sampling generally permits fitting of Fourier series with many harmonic terms;
2.
LCDB light curves are often sampled over the shortest time window necessary to measure the period and therefore generally do not require large or uncertain corrections due to phase-angle effects;
3.
LCDB light curves' fitted periods are assigned integer quality codes by a human reviewer (from 1 = poor to 3 = confident).

All three of the above features are either impractical or infeasible when the set of light curves is very large and the data sparsely sampled, as is the case for PTF. In this work we adopt the following modified approaches when fitting light curves:

1.
We truncate the rotation curve's Fourier-series fit after the 2nd harmonic, a simplification broadly justified by Harris et al. (2014) and the assumption of an ellipsoidal shape (see Section 3.1.2).
2.
We simultaneously fit a phase-function model with the rotational part.
3.
We use a machine-learned classifier to objectively aid in estimating the validity of each fitted period. The classifier is trained using all fitted light curves that have previously (and confidently) measured LCDB periods and takes into account the accuracy with which the true period was retrieved along with 20 light-curve metrics (fitted period, amplitude, ratio of peaks, ${\chi }^{2}$ per degree of freedom of fit, number of data points, and more).

Use of a machine classifier in asteroid light-curve period quality assessment is entirely novel and inspired in part by work done by PTF collaborators in extragalactic transient science (Bloom et al. 2012) and variable star science (Masci et al. 2014; Miller et al. 2014), as well as work by A. Waszczak et al. (2015, in preparation) on detection techniques for streaking near-Earth objects. Among the advantages of using a machine-classified quality score is that, via cross-validation with the known-period sample, one estimates the completeness and contamination, i.e., the true-positive rate (TPR) and false-positive rate (FPR) with respect to identifying an accurately fit period, as a function of, e.g., the period, amplitude, etc. The resulting TPR and FPR may then be used to de-bias the classifier-filtered period distribution.

1.2. Asteroid Phase Functions

The analytic phase function of an ideal Lambertian-scattering sphere fits well to featureless, atmospheric planets like Venus, but quite poorly to airless bodies (see Figure 3.9 of Seager 2010 for a comparison). In later sections we describe several ϕ models that have been derived for (or empirically fit to) asteroids. Qualitatively, asteroids show an approximately linearly decreasing ϕ out to $\alpha \approx 100^\circ$ , modified by a surge (increase in slope) at low phase angles ( $\alpha \lesssim 5^\circ$ ), known as the opposition effect (see Figure 1).

Figure 1. Refer to the following caption and surrounding text. — **Figure 1.** Phase curves (from the literature) containing densely sampled, rotation-corrected photometry of asteroids in four taxonomic classes. Colored lines are our original fits to the data using various single-parameter ϕ models (see Section 3.2).
Download figure:
Standard image High-resolution image

Early work (e.g., Bowell et al. 1989 and references therein) on a small sample of well-observed asteroids suggested that different asteroid spectral types display distinct behavior in ϕ. Figure 1 compares example phase-curve data for D, C, S, and E types,⁸ incorporating photometry from various sources. We emphasize the fact that all of the data points in Figure 1 have been corrected for rotational modulation (the δ in Equation (1)) through dense sampling of each asteroid's light curve at each phase angle (equivalently, each epoch).

Using a large corpus of low-precision photometry from the MPC,⁹ Oszkiewicz et al. (2011, 2012) showed that a fitted parameter of one particular ϕ model correlates well with an asteroid's Sloan Digital Sky Survey (SDSS) visible color. While they were unable to correct for rotational variation (δ-term in Equation (1)), the Oszkiewicz et al. work nevertheless demonstrates a solid trend between ϕ and a compositional attribute (color).

These prior works motivate several defining aspects of this work's phase-function analysis:

1.
We fit multiple phase-function models to each light curve, both for compatibility with the literature and to explore how the fitted parameters are related.
2.
We simultaneously fit the rotational component with the phase-function part.
3.
We introduce a single colorimetric index for quantifying C-type versus S-type taxonomic classification, based on the compilation of several visible-band-color asteroid data sets (see Appendix), and examine the variation in phase-function parameters as a function of this color index.

2. OBSERVATIONS

2.1. Overview of the PTF Survey

The PTF is a synoptic survey designed primarily to discover extragalactic transients (Law et al. 2009; Rau et al. 2009). The PTF camera, mounted on Palomar Observatory's 1.2 m Oschin Schmidt Telescope, uses 11 CCDs (each 2K × 4K) to image 7.3 deg² of sky at a time at $1\buildrel{\prime\prime}\over{.} 0$ /pixel resolution. Most exposures (∼85%) use a Mould-R filter¹⁰ (hereafter "R"). The remaining broadband images acquired use a Gunn g-band filter. Nearly all broadband PTF images are 60 s integrations, regardless of filter. About 15% of nights (near full moon) are devoted to a narrowband (Hα) imaging survey of the full Northern Sky.

Science operations began in 2009 March, with a nominal one- to five-day cadence for supernova discovery and typical twice-per-night imaging of fields. Median seeing is 2'' with a limiting magnitude $R\approx 20.5$ (for 5σ point-source detections), while dark conditions routinely yield $R\approx 21.0$ (Law et al. 2010).

The PTF survey is ongoing and expected to continue through mid-2016. In January 2013 the PTF project formally entered a second phase called the intermediate PTF ("iPTF'; Kulkarni 2013). In this paper we simply use "PTF" to mean the entire survey, from 2009 through the present (2015). The iPTF program accommodates more varied "sub-surveys" as opposed to a predominantly extragalactic program, including variable star and solar system science. Images are still acquired with the same telescope/camera/filters with 60 s exposures and are processed by the same reduction pipeline.

Laher et al. (2014) describe the PTF data reduction and archiving pipelines, hosted at the Infrared Processing and Analysis Center (IPAC) at Caltech. Processing at IPAC includes bias and flat-field corrections, astrometric calibration against UCAC3 (Zacharias 2010), astrometric verification against Two Micron All Sky Survey (2MASS; Skrutskie et al. 2006), creation of source catalogs with Source Extractor (Bertin & Arnouts 1996), and production of reference images (stacks of ∼20–30 PTF images that reach $V\approx 22$ ).

Ofek et al. (2012a, 2012b) describe the PTF survey's absolute photometric calibration method, which relies on source matching with SDSS DR7 (Abazajian et al. 2009), and thus requires at least partial overlap of PTF with SDSS each night. A separate, relative photometric calibration (based on light curves of non-variable field stars) also exists for PTF data and is described by Levitan et al. (2011) and in the Appendix of Ofek et al. (2011). In this work we utilize all R-band and g-band PTF data accumulated from the survey's start (2009 March) through 2014 July. The asteroid magnitudes reported in this work use relative photometric zero points when available (which as of this writing applies to ∼85% of PTF images) and absolute photometric zero points otherwise.

The PTF's robotic survey program and processing pipeline, as well as our data aggregation and analysis in this work, make use of many functions from the MATLAB package for astronomy and astrophysics (Ofek 2014).

2.2. This Work's Data Set

Waszczak et al. (2013) used a custom spatial indexing algorithm to search the set of all PTF single-epoch transient detections (through 2012 July) for detections of all asteroids with orbits known as of 2012 August. That search procedure first generated uniformly spaced ephemerides for each asteroid using JPL's online service (HORIZONS; Giorgini et al. 1996). Each asteroid's ephemeris defines a 3D curve (two sky coordinates plus one time); the intersection of each curve with the 3D kd-tree of transient detections was then computed and positive detections within a 4'' matching radius saved.

In this work we use a modified version of the Waszczak et al. (2013) algorithm. The updates/changes are as follows.

First, in terms of content, we now search all PTF (R and g-band) data from 2009 March 01 through 2014 July 18 for all numbered asteroids as of 2014 July 12 (401,810 objects). We now exclude unnumbered objects as the positional uncertainty of these objects can be very large, and as they tend to be very faint, their light curves will not in general be of high quality.

Second, in place of a single-step matching of a 3D transient-detection kd-tree against 3D ephemeris curves, we now divide the search into two main steps. We first perform a 2D spatial matching that exploits the natural indexing of PTF exposures into tiles (i.e., the grid of evenly spaced boresights or "fields" on the sky). Each 2D ephemeris curve's intersection with the 2D PTF survey footprint is computed, the object's position is cubically interpolated to all epochs of exposures possibly containing the object, and the object's precisely computed positon is then compared to the precise image boundaries of candidate exposures. Matching of predicted positions against actual detections takes place subsequently as source catalogs are then loaded into memory (as needed and in parallel). This method is faster than the original Waszczak et al. (2013) method and enables separate logging of predicted and positive detections.

The results of the known-asteroid search, as well as the derived light-curve data (described later), are stored in a relational database, the size and contents of which are summarized in Table 1. Out of ∼18 million predicted single-epoch asteroid sightings (including predicted magnitudes as dim as $V\approx 23$ , well below PTF's sensitivity), there were 8.8 million positive detections (within a 4'' radius). Of these, we define 4.3 million detections as "reliable" as they (1) lack any cataloged background sources within the 4'' radius, (2) have a calibrated magnitude uncertainty of less than 0.1 mag, and (3) lack any processing flags indicative of contamination. Figure 2 compares predicted, positive, and "reliable" detections; the middle and right panels of Figure 2 show that our definition of "reliable" seems to include a small fraction of likely bad observations (<1% contamination, note the vertical log scale), namely, those that have distance residuals greater than $\sim 1^{\prime\prime}$ or magnitude residuals greater than ∼1 mag. Because these reliable detections are the subset of observations that we input into our light-curve-fitting model (Section 4), the fitting algorithm includes logic designed to remove isolated data points that have very large residuals, either with respect to the median light curve value or relative to their uncertainty.

Figure 2. Refer to the following caption and surrounding text. — **Figure 2.** Comparison of predicted asteroid sightings against positive and "reliable" asteroid detections. We define a "reliable" detection as any positive detection that (1) lacks any *cataloged* background sources within a 4'' radius, (2) has a calibrated magnitude uncertainty of less than 0.1 mag, and (3) lacks any processing warning flags. As suggested by the middle and right column of plots, this definition of "reliable" still contains some small contamination (at the <1% level) from uncataloged background sources and/or noise, as indicated by detections with distance residuals greater than ∼1'' or magnitude residuals of greater than ∼1 mag. In panel D, the less than 100% completeness at the bright end reflects the non-negligible probability that any asteroid will fall within 4'' of a cataloged background source (regardless of the magnitude of either the asteroid or the background source).
Download figure:
Standard image High-resolution image

Table 1. Description of the PTF Asteroid Database

Table	# Rows	Example Columns (not necessarily comprehensive)
PTF tiles	11,169	R.A., Decl., tile ID
Exposures	304,982	Epoch, filter, exposure time, absolute photometric zero point, tile ID, exposure ID
CCD images	3,305,426	CCD ID, corners R.A. and Decl., seeing, limiting mag., relative phot. zero point, # of sources, exposure ID, image ID
Asteroids	401,810	Name, orbital elements, color data (e.g., SDSS), IR data (e.g., WISE), known rotation period, asteroid ID (number)
Predicted sightings	17,929,274	R.A., Decl., rates, helio- and geocentric range, phase and elong. angle, pred. V mag., image ID, asteroid ID, prediction ID
Positive detections	8,842,305	R.A., Decl., instrumental mag., local zero point, shape data, quality flags, prediction ID, light-curve ID, detection ID
Reliable detections^a	4,392,395	Detection ID
Light curves^b	587,466	# of constituent detections, filter, opposition year, median mag., asteroid ID, light-curve ID
Light-curve fits^c	54,296	Fitted light-curve parameters, human-assigned quality code, machine-classified quality index, light-curve ID, fit ID
Reliable-period fits^d	9033	Fit ID
Reliable-G₁₂ fits^e	3902	Fit ID

Note. Includes PTF data acquired from 2009 March through 2014 July, excluding Hα survey data.

^a"Reliable" detections are those free from possible background-source or bright star contamination, magnitude errors >0.1 mag, and certain SExtractor flags. ^bA light curve is here defined as a set of positive detections of a given asteroid in a single filter and opposition. ^cLight-curve fits only exist for light curves that contain at least 20 reliable detections and converged to a solution during the light-curve-fitting process. ^dFits have reliable rotation periods if a human screener labels the period reliable and the machine classifier rates it above a certain quality threshold (see text). ^eFits have reliable G₁₂ phase-function parameter if (1) amplitude <0.1 mag or period is reliable, and (2) fit has sufficient phase-angle coverage (see Section 6.3).

Download table as: ASCII Typeset image

3. LIGHT-CURVE MODEL

Equation (1) presents the overall form and notation of our asteroid light-curve model. In this section we describe the detailed parameterization and assumptions of the model.

3.1. Rotation Component

3.1.1. Intra-opposition Constraint

The most important parameter in the rotation component (the δ in Equation (1)) is the synodic spin period P, a constant that satisfies

where $\tau \equiv t-{\rm{\Delta }}/c$ is the light-time-corrected observation timestamp, ${\rm{\Delta }}={\rm{\Delta }}(t)$ is the asteroid's geocentric distance, c is the speed of light, and n is any integer satisfying

where ${P}_{\mathrm{orb}}$ is the synodic orbital period,

where ${T}_{\mathrm{orb}}$ is the asteroid's sidereal orbital period and ${a}_{\mathrm{orb}}$ is its orbital semimajor axis (related by Kepler's third law). ${P}_{\mathrm{orb}}$ is the time elapsed between the asteroid's consecutive oppositions. Pursuant to this restriction, we constrain each δ solution using observations from within the same opposition—i.e., for most asteroids, within a 1.1–1.6 yr interval centered on the date of locally minimally observed α.

The intra-opposition restriction is important given that our data set (described in the next section) spans ∼5 yr. For an asteroid with a zero-inclination circular orbit and spin axis perpendicular to its orbital plane, we can relax Equation (3) to allow n to be any integer, in which case δ can be constrained using observations spanning many years. In general, however, Equation (2) must be modified to accommodate a varying viewing geometry with respect to the spin axis:

where F is some unknown periodic function satisfying $F(t)=F(t+{{mT}}_{\mathrm{orb}})$ , where m is any integer and ${T}_{\mathrm{orb}}$ is the sidereal orbital period. Provided that the amplitude of F is not large relative to that of δ, and provided that the spin vector is not changing with respect to the orbital plane (i.e., precessing¹¹ ) on a timescale comparable to ${P}_{\mathrm{orb}}$ , we are justified in assuming that Equation (2) (with the Equation (3) restriction) applies.

3.1.2. Second-order Fourier Series

Any δ satisfying Equation (2) can be approximated to arbitrary precision using a Fourier series. Harris et al. (2014) discuss why, from a geometric standpoint, the second harmonic tends to dominate an asteroid's fitted δ. As noted earlier (Section 1.1), most large asteroids approximately resemble triaxial prolate ellipsoids (e.g., Jacobi ellipsoids), having equatorial axis ratios of at most ∼3:1 (corresponding to a ${\delta }_{\mathrm{max}}-{\delta }_{\mathrm{min}}$ amplitude of ∼1.2 mag). For less extreme axis ratios (specifically, those producing a ∼0.4 mag or smaller second-harmonic amplitude), other harmonics related to shape or albedo asymmetries may contribute comparable coefficients to the Fourier approximation of δ.

The PTF survey program has—on a few rare occasions—conducted high-cadence (∼10 minutes spaced) observations of low ecliptic latitude fields. These runs produced a set of ∼1000 densely sampled main-belt asteroid rotation curves, which have already been analyzed and published (Polishook et al. 2012; Chang et al. 2014a). These high-cadence "pilot studies" are relevant to our present work in that they demonstrate (1) the quality of the PTF survey's photometric calibration for asteroids with unambiguously valid δ solutions and (2) the above-described prevalence of a dominant second harmonic in most of the objects sampled.

Following these pilot studies, we adopt a second-order Fourier series model:

where τ is the light-time-corrected epoch (see Equation (2)). In the pilot studies, most of the fitted δ solutions qualitatively resemble a simple sine or cosine function. Such a solution can be represented by either

1.
a first harmonic with period $P={P}_{1}$ (with ${A}_{i,1}\ne 0$ and ${A}_{i,2}=0$ ), or
2.
a second harmonic of period $P=\displaystyle \frac{1}{2}{P}_{1}$ (with ${A}_{i,1}=0$ and ${A}_{i,2}\ne 0$ ).

Given the prolate ellipsoid model, choice 2 is more realistic and hence preferred. However, again recognizing that other harmonics can have a non-negligible contribution, in fitting δ to our light-curve sample we allow the first-harmonic coefficients ${A}_{i,1}$ to be non-zero, but introduce logic into the fitting algorithm (see Section 4), which checks for double-period solutions satisfying certain criteria and iterates accordingly.

3.2. Phase-function Component

In this work we simultaneously fit each light curve's phase function ϕ along with its rotation curve δ (see Equation (1)). This approach is intermediate in complexity between some of the simpler, two-parameter (δ-neglecting) models that have been applied to very large data sets (e.g., Oszkiewicz et al. 2012; Williams 2012) and the more complex, shape plus pole-orientation models (Kaasalainen 2004; Cellino et al. 2009; Hanuš & Ďurech 2012) which can involve tens of parameters and require data spanning multiple oppositions.

Regarding the former class of models, we note that there is a formal statistical problem associated with neglecting δ when fitting ϕ. If modeling the observations M by $V^{\prime} \equiv V-\delta =H+5{\mathrm{log}}_{10}(r{\rm{\Delta }})-2.5{\mathrm{log}}_{10}(\phi )$ , then the distribution of residuals $M-V^{\prime}$ is not Gaussian. Assuming that δ is a sinusoid with amplitude A, for observations M sampling the light curve at random times, the residual probability density function $p=p(M-V^{\prime} )$ has a local minimum value ${p}_{\mathrm{min}}$ at $M-V^{\prime} =0$ and maximum value ${p}_{\mathrm{max}}$ near $M-V^{\prime}$ = ±A. Thus, p is bimodal and roughly bowl shaped—not at all Gaussian shaped. The uncertainty in ϕ produced by a standard ${\chi }^{2}$ minimization—which assumes Gaussian-distributed errors—is thus inaccurate. However, since p is symmetric about $M-V^{\prime} =0$ , for densely sampled data the fitted phase function ϕ remains unaffected by neglecting δ; in such a case the only effect is an underestimated uncertainty.

We obtain three separate fits for each light curve, each using a different phase function (ϕ) and allowing for unique solutions for H and δ in Equation (1). The three phase-function models are:

1.
the two-parameter model of Shevchenko (1997);
2.
the one-parameter G model (Bowell et al. 1989);
3.
the one-parameter G₁₂ model (Muinonen et al. 2010).

In this section we review and motivate the application of each of these ϕ models.

3.2.1. Two-parameter Shevchenko Model

Shevchenko (1997) introduced a phase function dependent on two parameters; in terms of Equation (1) the model is¹²

where β has units of mag/deg and C is the amplitude of the opposition surge (units of mag). This model was subsequently considered in-depth by Belskaya & Shevchenko (2000, hereafter B&S), who compiled the most complete (to date) set of high-precision, targeted phase-curve observations of main-belt asteroids from various data sets spanning several decades.

Though in practice Shevchenko's model is the least commonly used phase function out of the three we consider, it is by far the simplest to express mathematically, and it is the only model for ϕ whose parameters have linear dependence in Equation (1).

Furthermore, this model's parameters are the most straightforward to associate with physical asteroid properties. B&S highlighted a robust relationship between an asteroid's ( $\beta ,C$ ) phase-function parameters and its geometric albedo.¹³ As we later explore a similar relationship in the present work, we here review the basis of this observation.

The geometric albedo p_V is formally defined in terms of the phase function ϕ:

where ${A}_{\mathrm{bond}}$ is the (visible) bond albedo, defined as the total visible light energy reflected or scattered by the asteroid (in all directions) divided by the total visible light energy incident upon the asteroid (from the Sun). We also here define the phase integral q.

B&S showed that, in the range of β observed from S-type to C-type asteroids, β and C are empirically correlated, in a relation that we approximate here as

Using Equation (9) to substitute for C in Equation (7), inserting the result into Equation (8), and numerically evaluating the integral gives

B&S saw a negative correlation between p_V and β in the data,¹⁴ consistent with Equation (10) only if either ${A}_{\mathrm{bond}}$ is assumed constant among different asteroid types (not a reasonable assumption) or if ${A}_{\mathrm{bond}}$ negatively correlates with β, which B&S did not explicitly show.

The bond albedo ${A}_{\mathrm{bond}}$ can be thought of as an intrinsic, bulk-compositional characteristic of an asteroid's surface,¹⁵ much like an asteroid's color, whereas β and C relate (in part) to the textural, particulate, and macroscopic roughness of the asteroid's surface. B&S and other authors separately associate β with the shadow-hiding effect and C with the coherent backscatter effect. Both of these physical phenomena are understood from a theoretical standpoint (e.g., Helfenstein & Veverka 1989; Hapke 2012) to be functions of ${A}_{\mathrm{bond}}$ , with β negatively related to ${A}_{\mathrm{bond}}$ and C positively related. This is consistent with Equation (9) and renders Equation (10) consistent with B&S's noted p_V-versus-β correlation. Other properties such as particle size, particle geometry, and regolith porosity also have predicted (and laboratory-measured) contributions to the observed phase function (Hapke 2012 and references therein); these properties can conceivably vary independently of ${A}_{\mathrm{bond}}$ .

In short, our interpretation of the S-type and C-type asteroid data reviewed by B&S is that a compositional indicator ( ${A}_{\mathrm{bond}}$ ) correlates with indicators of two independent phenomena (β and C) that contribute to how light scatters from an asteroid's surface. This statement intentionally makes no mention of p_V, since Equation (8) tells us that p_V by definition varies with β (in a non-obvious way) and with ${A}_{\mathrm{bond}}$ , the latter being a more basic compositional attribute.

As stated above, the phase function can be related to properties other than ${A}_{\mathrm{bond}}$ , such as regolith porosity. Many of these other properties in theory and experiment contribute to effects involving multiply scattered light and therefore do not alter the effect of shadow hiding (β-term in Equation (7)), which is dominated by singly scattered light (Hapke 2012). In contrast, the coherent backscatter effect (C-term) does involve multiply scattered light. B&S saw non-monotonic behavior in C as a function of p_V when including the rarer, high-p_V E-type asteroids in the same plot as C and S types. E types do conform, however, to the same negative monotonic trend in p_V-versus-β satisfied by the C and S types, consistent with the hypothesis that β is adequately expressed as a function of ${A}_{\mathrm{bond}}$ alone, yet E types have a lower-than-predicted C value based on extrapolation of Equation (9).

One possibility is that Equation (9) is not valid for all asteroids, but must be replaced by some unknown non-monotonic relationship, possibly because C depends non-monotonically on ${A}_{\mathrm{bond}}$ and/or has comparable dependence on other properties (e.g., porosity or grain size). Assuming that Equation (7) is a sufficiently general model for ϕ, and lacking knowledge of a good model for C, it follows that β and C should in practice always be fit separately. Another possibility is that Equation (7) is an incorrect or incomplete model; however, B&S described no instances wherein their model was unable to adequately fit the data for a particular asteroid or class of asteroids.

3.2.2. Lumme–Bowell G Model

The next phase-function model we consider is the Lumme–Bowell model (Bowell et al. 1989), also known as the (H,G) or IAU phase function:

Like Shevchenko's model, this model includes two terms (the basis functions ${\phi }_{1}$ and ${\phi }_{2}$ ) representing two physically distinct contributions to the observed ϕ. As detailed in Bowell et al. (1989), this model is semi-empirical in that it was derived from basic principles of radiative transfer theory with certain assumptions, and at various stages tailored to match existing laboratory and astronomical observations. That the two basis functions' coefficients are related to a single parameter G bears resemblance to the β-versus-C correlation described by Equation (9).

Marsden (1986) marked the IAU's adoption of this phase function as a standard model for predicting an asteroid's brightness. Since then this model has seen widespread application, and it is often used with the assumption G = 0.15 (e.g., in the ephemeris computation services offered by the MPC and JPL). Harris & Young (1988) present mean values of G for several of the major asteroid taxonomic classes (based on a sample of ∼80 asteroids), with G = 0.15 being an average between the C types ( $G\approx 0.08$ ) and the S types ( $G\approx 0.23$ ). The G-model fails to accurately fit the rarer D types (which have linear phase curves) and E types (which have very sharp opposition spikes), whereas the Shevchenko model can properly accommodate these rarer types.

Use of the Lumme–Bowell ϕ in our light-curve model (Equation (1)) introduces a second nonlinear parameter (G) into the model, the period P being the other nonlinear parameter. This complicates the fitting algorithm somewhat, as described in Section 4.

3.2.3. Muinonen et al. G₁₂ Model

The third phase-function model we consider, introduced by Muinonen et al. (2010), bears resemblance to the G-model but includes a second free parameter and a third basis function:

As opposed to the analytic trigonometric basis functions of the G-model, here ${\phi }_{1}$ , ${\phi }_{2}$ , and ${\phi }_{3}$ (all functions of α alone) are defined in terms of cubic splines (see Muinonen et al. 2010 for the exact numerical definitions). Assuming that the coefficients G₁ and G₂ are constrained independently, these basis functions were designed to provide the most accurate fits to the phase functions of all major asteroid taxonomic types, including the rarer D types and E types.

For situations where fitting G₁ and G₂ separately is infeasible, Muinonen et al. (2010) specialized their above model to make it a function of a single parameter, G₁₂, which parameterizes G₁ and G₂ using piecewise functions:

In this work we use this single-parameter G₁₂ form of the Muinonen et al. model, making it analogous to the G-model in terms of implementation, including the complication associated with a nonlinear parameter.

3.2.4. Multi-parameter Hapke Model

Just as we commented on the more rigorous means of fitting a rotation curve via 3D shape modeling with multi-opposition data, for completeness we note that a more rigorous model (than the three presented above) exists for phase functions. Given better-sampled light curves and more computational power, future modeling of large photometric data sets would benefit from applying the more theoretically motivated model of Hapke (2012), an abbreviated form of which is

Here w is the single-scattering albedo (see footnote 15), of which r₀ is solely a function. The remaining factors all are functions of phase angle (α). Each opposition-surge term (B_S and B_C) has two free parameters (width and amplitude). K depends on the mean topographic roughness (a function of one free parameter); g is the single-scattering angular distribution function (typically includes one parameter); h is a function of α only; and ${\phi }_{L}$ is the phase function of an ideal Lambertian-scattering sphere (a simple function of α).

With its $\phi \propto {p}_{V}^{-1}$ dependence, the Hapke model (Equation (14)) can conveniently eliminate both p_V and H from the modeling process. Inserting Equation (14) into Equation (1), and using the common relation¹⁶

where H is the absolute visual magnitude, D is the asteroid's effective diameter, and 1329 km is a constant (set by the arbitrarily defined magnitude of the Sun), produces a model with many physically meaningful parameters and free of both H and p_V.

4. LIGHT-CURVE-FITTING ALGORITHM

We solve Equation (1) using a custom linear least squares (LLSq) method. A basic review of LLSq can be found in Hogg et al. (2010). Each fitted asteroid light curve contains ${N}_{\mathrm{obs}}\geqslant 20$ observations, with measured apparent magnitudes m_i and measurement uncertainties ${\sigma }_{i}$ . All instrumental magnitudes are elliptical aperture (Kron 1980) measurements (SExtractor's MAG_AUTO) calibrated with a local zero point (i.e., the "ZPVM" correction of Ofek et al. 2012a). The uncertainties contain a Poisson-noise component (SExtractor's MAGERR_AUTO), as well as systematic error from the calibration. For images lacking a relative photometric solution, the relevant systematic error is the APBSRMS parameter in the PTF database; for images having a relative photometric solution, the systematic error is a combination of the sysErr and zeroPointErr database quantities (added in quadrature).

In all cases, our model (Equation (1)) is nonlinear in at least one parameter (the period P, or equivalently the frequency $f=1/P$ ). We test ${N}_{\mathrm{frq}}$ evenly spaced frequencies between f = 0 (infinite rotation period) and f = 12 days⁻¹, i.e., up to the ∼2 hr spin barrier.

Asteroids rotating faster than the ∼2 hr spin barrier are likely monolithic objects and—particularly if larger than ∼150 m—are interesting in their own right (see the discussion in Pravec et al. 2002). However, given the apparent observed rarity of such super-fast rotators (SFRs) and the large interval in frequency space that must be searched to discover them; we impose 2 hr = 12 cycles per day as our upper limit on fitted frequency in order to make computational time reasonable without sacrificing sensitivity to the majority of asteroids' spin rates. Chang et al. (2014a) present preliminary results of an independent, ongoing effort to use PTF data (or at least specific subsets thereof) to search for SFRs, with at least one SFR having been discovered and confirmed (Chang et al. 2014b).

We use a frequency spacing ${\rm{\Delta }}f=1/(4{\rm{\Delta }}t)$ , where ${\rm{\Delta }}t$ is the time interval between the first and last observation in the light curve. Formally ${\rm{\Delta }}t$ can be as long as 1.1–1.6 yr for most asteroids (see Section 3.1.1); however, the median value of ${\rm{\Delta }}t$ (among light curves that ultimately acquired fits) is ∼45 days, with 16 $\mathrm{th}$ and 84 $\mathrm{th}$ percentiles of 13 and 106 days, respectively.

In addition to the nonlinear parameter f, the light-curve model in general has ${N}_{\mathrm{lin}}$ linear parameters. We seek to solve the following tensor equation for X:

where m_i is the $i\mathrm{th}$ observation, L is the "design matrix" (a 3D array of size ${N}_{\mathrm{obs}}\times {N}_{\mathrm{frq}}\times {N}_{\mathrm{lin}}$ ), and X is the linear-parameter matrix ( ${N}_{\mathrm{frq}}\times {N}_{\mathrm{lin}}$ ) containing the linear-parameter solutions as a function of frequency.

4.1. Linear Phase-function Parameters

For the particular case wherein we use Shevchenko's model (Equation (7)) for the phase function ϕ, the design matrix is

where the k-index has been omitted with the convention that k = 1 is the ${\rm{first}}$ row of the above column vector, k = 2 is the second row, etc. Here ${\tau }_{i}$ and ${\alpha }_{i}$ are the time and phase angle of the $i\mathrm{th}$ observation, f_j is the $j\mathrm{th}$ frequency, etc. Likewise, the linear-parameter matrix X in this case is

where H_j is the fitted absolute magnitude for the $j\mathrm{th}$ frequency, etc.

The general LLSq solution to Equation (16) is

where ${B}^{-1}$ is the inverse of the data-covariance matrix B:

and ${S}_{{jk}{\ell }}$ is the parameter-covariance matrix, given by

where in the above definition we invert each of the ${N}_{\mathrm{frq}}$ matrices s_j, these being defined by

The elements of the parameter-covariance matrix S are the variances and covariances of the fitted parameters (as a function of frequency). The fit's residuals (as a function of frequency) are

and the fit's chi-squared (as a function of frequency) is

The frequency-dependent chi-squared ${({\chi }^{2})}_{j}$ is also known as the periodogram. Formally, the best-fit rotation frequency corresponds to the minimal value of ${({\chi }^{2})}_{j}$ , but this may differ from the preferred frequency solution if the light curve is contaminated by other systematic periodic signals, if the data suffer from underestimated measurement uncertainties, or if the best-fit frequency corresponds to a dominant first harmonic (as opposed to a preferred dominant second harmonic; see Section 3.1.2).

Figure 3 details our iterative light-curve-fitting algorithm's logic. Fitting commences as long as 20 or more "reliable" data points (see Section 2.2 and Figure 2) are associated with a light curve. Irrevocably bad data points are discarded in the first round of iterations; these include detections with 7σ or greater residuals from the initial solution. Examples of detections with such high residuals include contamination from background sources missing in the reference catalog, bad detector pixels that were not flagged by the pipeline, or spurious zero-point solutions.

Figure 3. Refer to the following caption and surrounding text. — **Figure 3.** Diagram detailing the logic of this work's data reduction and analysis. Includes mining the survey for known-asteroid observations, aggregation of the data into light curves, vetting of the light curves, and an application wherein phase functions are compared to color-derived asteroid taxonomy. See text for details.
Download figure:
Standard image High-resolution image

In the next stage of iterations, the fit's ${\chi }^{2}$ per degree of freedom is reduced to ∼1 (formally, it is reduced until it is less than 3; see Figure 3) by gradually inflating the observations' error bars through addition of a "cosmic error," so-named because it encompasses contamination from possible errors (in all the "cosmos"). In general, the cosmic error represents the same diverse contaminating phenomena responsible for the >7σ deviations seen in the initial iterations (see previous paragraph), just to a lesser extent.

Separately, this error bar inflation compensates for our model's inability to fit each asteroid's precise periodic structure using only two harmonic terms in the Fourier series. In the limit of infinite observations and sufficiently many Fourier terms, we would ideally expect our data's error bars to reflect true Gaussian variance. However, by truncating the series at two harmonics and using sufficiently precisely calibrated photometry, we are in effect choosing to sacrifice (downsample) some of our photometric precision to obtain a formally better fit at the coarser resolution limit of the model.

To illustrate use of the cosmic error, consider the example of an eclipsing binary light curve, i.e., a rotation curve that is effectively sinusoidal except for a small interval around the phase of minimum flux, when it dips to a lower-than-predicted brightness. Observations acquired during such eclipses will have systematic negative deviations greater in absolute value than would be explained by Gaussian variance alone. Increasing the error bars of these observations will decrease the fits' ${\chi }^{2}$ without altering the value of the fitted frequency. The fitted parameters' uncertainties (for both frequency and the linear parameters) are accordingly inflated as a penalty, and the fitted amplitude will be underestimated. As detailed in Figure 3, the initial cosmic error used is 0.002 mag, and each iteration is multiplied by a factor of 1.5 until the ${\chi }^{2}$ is sufficiently low. If the cosmic error exceeds 0.1 mag, the fitting is aborted. If the ${\chi }^{2}$ (per degree of freedom) drops below 3 while the cosmic error is still below 0.1 mag, the fitting process concludes "successfully" (see Figure 3).

Concurrently, each iteration includes a test for the presence of double peaks in the folded rotation curve (only if the fitted amplitude is at least 0.1 mag). In particular, if there exist two maxima and two minima in the folded light curve, we demand that the ratio of these peaks be greater than 0.2. Such a solution is preferred (see Section 3.1.2) given our ellipsoidal shape assumption, as described by Harris et al. (2014).

Denote as ${f}_{\mathrm{best}\_\mathrm{global}}$ the frequency yielding the absolute minimum ${\chi }^{2}$ per degree of freedom value, denoted ${\chi }_{\mathrm{min}\_\mathrm{global}}^{2}$ (after the cosmic error has been tuned). If the folded light curve is single-peaked (or has only a relatively small secondary peak), then another deep minimum usually exists at the harmonic frequency ${f}_{\mathrm{best}\_\mathrm{harmonic}}=0.5\times {f}_{\mathrm{best}\_\mathrm{global}}$ , the local minimum ${\chi }^{2}$ value of which we denote ${\chi }_{\mathrm{min}\_\mathrm{harmonic}}^{2}$ . For cases wherein ${\chi }_{\mathrm{min}\_\mathrm{harmonic}}^{2}\lt {\chi }_{\mathrm{min}\_\mathrm{global}}^{2}+\mathrm{inv}-{\chi }^{2}-\mathrm{cdf}(0.95,7)$ , where $\mathrm{inv}-{\chi }^{2}-\mathrm{cdf}(p,N)$ is the inverse of the ${\chi }^{2}$ cumulative distribution function for N free parameters evaluated at p, then we instead choose ${f}_{\mathrm{best}\_\mathrm{harmonic}}$ rather than ${f}_{\mathrm{best}\_\mathrm{global}}$ . The 1σ uncertainty interval for the best-fit frequency is then found by computing the upper and lower intersections between ${\chi }_{\mathrm{min}}^{2}+\mathrm{inv}-{\chi }^{2}-\mathrm{cdf}(0.68,7)$ and the periodogram in the vicinity of ${f}_{\mathrm{best}}$ . Note that we used n = 7 free parameters in this case, i.e., the number of elements of X_j (Equation (18)).

4.2. Nonlinear Phase-function Parameters

Modeling the phase function ϕ with either the G or G₁₂ model (Equations (11) and (12)) introduces a second nonlinear parameter (after the frequency f), and so we must modify the equations of the previous section accordingly. We sample ${N}_{\mathrm{pha}}=200$ evenly spaced phase-function parameter values. In particular, for G we test the interval $-0.3\leqslant G\leqslant 0.7$ in steps of ${\rm{\Delta }}G=0.005$ , and for G₁₂ we test the interval $0\leqslant {G}_{12}\leqslant 1$ in steps of ${\rm{\Delta }}{G}_{12}=0.005$ .

Our approach is to modify the left-hand side of Equation (16) by defining a new matrix ${m}_{{iq}}^{\prime }$ that contains all possible phase-function-corrected observed magnitudes:

where, e.g., for the case of the G-model (Equation (11)),

The linear-parameter-solution array X now has an extra index q, reflecting the fact that we are now solving for each linear parameter as a function of the two nonlinear parameters. The design matrix has the same number of indices as before (but fewer rows):

while the linear-parameter matrix X is now

The appeal in adopting the above approach is that the general solution is only slightly modified:

where the only differences between Equations (19) and (29) are the q indices appended to X and m (and the latter being redefined as m').

The fit's residuals R are now a function of frequency and phase-function parameter:

as is the fit's chi-squared:

As a function of any of the linear parameters, the fit's ${\chi }^{2}$ varies precisely quadratically, whereas as a function of frequency it has an intricate spectral structure with many local minima. As a function of a nonlinear phase parameter (G or G₁₂), the ${\chi }^{2}$ tends to have a single minimum (on the range we evaluate): in this sense G and G₁₂ are more similar to the linear parameters than they are to frequency. However, the generally asymmetric shape of the phase parameter' ${\chi }^{2}$ dependence necessitates its grid-based numerical treatment—particularly to ensure accurate estimation of the phase parameter's uncertainty.

The two-dimensional ${\chi }^{2}$ surface given by Equation (31), which is defined on an ${N}_{\mathrm{freq}}\times {N}_{\mathrm{pha}}$ grid, can be reduced to a one-dimensional ${\chi }^{2}$ function by choosing, for each frequency index j, the phase-parameter index q that minimizes the ${\chi }^{2}$ . The result is a one-dimensional periodogram, as in Equation (24). Once the fitted frequency is identified, we compute the uncertainty in the fitted f by the method described in the previous section using the inv- ${\chi }^{2}$ -cdf() function. We then likewise numerically compute the uncertainty in the phase parameter by again collapsing ${({\chi }^{2})}_{{jq}}$ to a one-dimensional vector, this time as a function of the phase parameter with the frequency fixed at the fitted value (j-index), and use the inv- ${\chi }^{2}$ -cdf() function to estimate the uncertainty in the phase parameter.

As noted in Table 1, a total of 587,466 light curves exist in PTF, where each light curve by definition consists of all reliable observations of a unique asteroid observed in a single opposition in a single photometric band. Of these, only ∼10% (59,072 light curves) have at least 20 observations and therefore qualified for fitting with our algorithm. A total of 54,296 light curves actually produced a fit—the remaining ∼5000 light curves failed to produce a fit either because some observations were discarded and the total fell below 20 data points or because the fitted cosmic error grew to exceed 0.1 mag.

Figure 4 shows several examples of light curves fitted with the algorithm described in this section. In the third column (column C) of Figure 4, we show the periodograms of each light curve. Note that although the periodogram's horizontal axes are labeled with the period (for easier interpretation), the chi-squared (per degree of freedom) values are actually plotted linearly with respect to frequency. This is because, as described earlier, our sampling is uniform with respect to frequency, and the harmonics are more easily seen with constant frequency spacing. Column (D) shows the data sampling in rotational phase versus solar phase angle, a useful plot to ensure that there is no obvious correlation between the two (which could lead to an erroneous fit, e.g., for long periods, large amplitudes, and/or few data points).

Figure 4. Refer to the following caption and surrounding text. — **Figure 4.** Examples of light curves having both well-sampled rotation *and* phase-function components. Each row corresponds to a different asteroid. These example asteroids are sorted vertically by their physical diameter (assuming 7% albedo); the top object is ∼45 km, and the bottom object is ∼2 km. Column A shows the phase curve (corrected for rotation); Column B shows the rotation curve (corrected for phase function); Column C shows the periodogram; Column D shows the distribution of the observations in rotational phase vs. solar phase angle. Above each plot is additional information depending on the column: (A) the asteroid number, followed by (in square brackets) the opposition year (most are 2013) and filter (in all cases "r") followed by the fitted G₁₂ parameter; (B) the fitted absolute magnitude and amplitude; (C) the fitted period (in hours); (D) the number of data points included (and shown) in the fit.
Download figure:
Standard image High-resolution image

4.3. Comments on Implementation

Each iteration in the fitting of each asteroid light curve involves evaluating the arrays and tensor-products in either Equation (19) or (29). This includes inverting the data-covariance matrix B (Equation (20)) and inverting the ${N}_{\mathrm{frq}}$ matrices s_j (Equation (22)). The arrays L, m', X, and R can have a relatively large number of elements, making them and their relevant products potentially taxing with respect to computational memory.

Our particular implementation of this algorithm leverages the efficient array-manipulation capabilities of MATLAB, especially its ability to perform fast matrix multiplication and matrix inversion utilizing BLAS calls¹⁷ and OpenMP multithreaded C loop code.¹⁸ Given typical numbers of observations and frequency sampling, each of our light-curve fits (including the multiple iterations) takes on average several tens of seconds to run on an eight-core machine (multithreading enabled) and typically consumes less than ∼4 GB of memory using single-precision computation.

In a tar.gz file we provide our custom MATLAB function used for fitting the G-parameter version of the light-curve model (asteroid_lc_fit_G.m). Analogous versions exist for the Shevchenko and G₁₂ models. This function takes as input an asteroid's apparent magnitudes, magnitude uncertainties, observed epochs, phase angles, and geocentric and heliocentric distances. Its outputs include the linear-parameter-solution array (Equation (28)), residuals (Equation (30)), chi-squared array (Equation (31)), and additional information about each light-curve solution such as the amplitude and peak ratios.

5. RELIABILITY OF FITTED ROTATION PERIODS

A primary concern in the quality assessment of our fitted light-curve parameters is the validity of our derived rotation periods. In this section we describe several methods of estimating the reliability of these periods, beginning with comparison to a ground-truth subsample of known-period asteroids and followed by a full vetting of our entire sample using a combination of machine-learning and manual classification.

The fitted period may differ (slightly or significantly) between the fits using the different phase-function models. In this section for simplicity we consider only the period value obtained when fitting with the G₁₂ phase-function model (Section 3.2.3). In subsequent sections we will again consider all three ϕ models.

5.1. Known-period Subsample

A total of 927 (∼2%) of our fitted light curves belong to 805 unique asteroids having a previously measured period listed in the LCDB of Warner et al. (2009). This includes only asteroids having a quality code of 3 (highest quality) in the LCDB.

Figure 5 shows that the distribution of relative errors on our fitted frequencies is bimodal, with the left mode corresponding to periods having better than ∼3% agreement with the reference period, and the right mode corresponding to periods in disagreement with the reference period. These disagreeing fits include light curves that differ from the reference value by a harmonic (half = relative error 0.5, double = relative error 1.0), as well as frequencies that do not differ by a factor of two or any integer multiple. About 1/3 of the light curves in Figure 5 fall into the right mode and are thus considered disagreeing fits.

Figure 5. Refer to the following caption and surrounding text. — **Figure 5.** For the 927 light curves (805 unique asteroids) having a quality code 3 period in the Light Curve Database of Warner et al. (2009) *and* an original fit in this work, we plot the distribution of the relative error in our fitted rotation frequencies with respect to the literature-referenced frequencies. The distribution is bimodal, with the left-hand mode corresponding to those fits having better than ∼3% agreement.
Download figure:
Standard image High-resolution image

Figure 6 shows some examples of these disagreeing fits. Row 1 shows an apparent low-amplitude rotator, whose fitted period of 15.7 hr differs from the reference value of 9.7 hr. Row 2 is an object whose periodogram contains a great deal of noise, divided into two broad forests of frequency minima. The left forest appears to have been selected by our fitting algorithm, while the right forest seems associated with the true period of ∼2.7 hr. Row 3 contains an object whose 12% relative frequency error exceeds the 3%-accuracy threshold we have defined, and so despite appearing to be a good fit it is formally categorized as inaccurate. Row 4 also looks like a reasonable fit at 6.4 hr, but disagrees with the reference period of 11.0 hr (though the latter does have a perceptible local minimum in the periodogram). Finally, Row 5 includes a likely example of the algorithm fitting noise in the photometry of a faint asteroid.

Figure 6. Refer to the following caption and surrounding text. — **Figure 6.** Examples of light curves whose fitted frequency differs from the reference frequency by more than 3%, so that they fall in the right mode in the histogram shown in Figure 5 and are formally defined as inaccurate fits. *Row 1*: low-amplitude rotator. *Row 2*: incorrect period (too few observations?). *Row 3*: a fitted frequency that differs from the reference frequency by 12%. *Row 4*: period that differs by a non-integer multiple, despite looking reasonable. *Row 5*: folded light curve appears to be fitting noise in the data.
Download figure:
Standard image High-resolution image

In Figures 7 and 8 (top and middle rows) we detail the distribution of the accurately recovered-period and inaccurately recovered-period subgroups in terms of eight different light-curve parameters. Some basic observations from these histograms are:

1.
fitted periods are far less reliable if longer than ∼1 day or shorter than ∼2.7 hr;
2.
fitted amplitudes of less than 0.1 mag correspond to the least reliably fit periods;
3.
light curves consisting of observations dimmer than ∼18.5 mag are much less reliable than brighter light curves (though they are also far less numerous in the known-period sample);
4.
fit ${\chi }^{2}$ (per degree of freedom) values of less than ∼1.7 correlate with less reliable periods (though they are also far less numerous in the known-period sample); note that, in the fitting process, growth of the cosmic error term ceased once the ${\chi }^{2}$ (per degree of freedom) fell below 3 (see Figure 3);
5.
the number of observations in a light curve is not directly correlated to the reliability of the fitted period;
6.
the ratio of the folded light curve's two peaks, the signal-to-noise ratio of the periodogram's chosen minimum, and the uncertainty in the absolute magnitude parameter are all strong indicators of the reliability of the fitted period.

Figure 7. Refer to the following caption and surrounding text. — **Figure 7.** *Top row*: 927-light-curve known-period sample (black), divided into the accurately fitted (green) and inaccurately fitted (red) subgroups. *Middle row*: ratio of the green to black histograms. *Bottom row*: results of cross-validation of the machine classifier (see Section 5.2.2).
Download figure:
Standard image High-resolution image

The above comments reflect consideration of the one-dimensional distributions in Figures 7 and 8; however, we can easily imagine that there are correlations in more dimensions not evident from these plots alone. An obvious example would be the two-dimensional distribution in amplitude versus median magnitude: reliability is presumably greater for bright asteroids having amplitudes $\lt 0.1$ mag than it is for dim asteroids having amplitudes $\lt 0.1$ mag. Period versus amplitude is also likely an insightful distribution (and was considered, for example, by Masiero et al. 2009). The number of observations possibly would correlate with reliability if we were to restrict another parameter or parameters to some specific interval.

Figure 8. Refer to the following caption and surrounding text. — **Figure 8.** *Top row*: 927-light-curve known-period sample (black), divided into the accurately fitted (green) and inaccurately fitted (red) subgroups. *Middle row*: ratio of the green to black histograms. *Bottom row*: results of cross-validation of the machine classifier (see Section 5.2.2).
Download figure:
Standard image High-resolution image

Rather than manually examining the period-fitting reliability as a function of all possible multi-dimensional combinations of the eight light-curve parameters detailed in Figures 7 and 8, we can take a more general approach of considering the reliability to be a single function defined on the multi-dimensional parameter space in which all of the light curves reside. We hypothesize that accurately fit light curves and inaccurately fit light curves occupy distinct regions in this multi-dimensional volume. As these volumes can overlap to some extent, we can at least estimate the probability that a light curve with that particular vector of parameters corresponds to an accurately recovered (or inaccurately recovered) period when obtained by the fitting algorithm of Section 4.

There are two general ways of accomplishing this goal. One way is to produce a large number of synthetic light curves filling out the multidimensional light-curve-parameter space, subject these synthetic light curves to our fitting algorithm, and thereby map out, e.g., by binning and interpolation, the fit reliability throughout the multi-dimensional volume. This method requires us to accurately simulate all sorts of varying sampling cadence as well as measurement uncertainties, including contributions from both systematics and noise, and it requires significant extra computing time to actually subject the synthetic data to our fitting procedure. The second method—the approach we take in this work—uses a ground-truth sample (the known-period light curves already described in this section) to train a machine classifier to discriminate reliable versus unreliable fits within the multi-dimensional light-curve-parameter space.

5.2. Machine Learning

We adopt a supervised ensemble-method approach for classification, originally popularized by Breiman et al. (1984), specifically the random forest (RF) method (Breiman 2001). RF classification has extensive and diverse applications in many fields (e.g., economics, bioinformatics, sociology). Within astronomy in particular RF classification is one of the more widely employed methods of machine learning, though many alternatives exist. For example, Masci et al. (2014) use the RF method for variable-star light-curve classification, while others have approached this problem via the use of, e.g., support vector machines (Woźniak et al. 2004), Kohonen self-organizing maps (Brett et al. 2004), Bayesian networks and mixture-models (Mahabal et al. 2008), principle component analysis (Deb & Singh 2009), multivariate Bayesian and Gaussian mixture models (Blomme et al. 2011), and thick-pen transform methods (Park et al. 2013).

For general descriptions of RF training and classification, we refer the reader to Breiman (2001), Breiman & Cutler (2004), and the many references cited by Masci et al. (2014). Our use of an RF classifier is particularly motivated by its already-proven application to the discovery and classification of astrophysical transients in the same PTF survey data (Bloom et al. 2012), as well as streaking near-Earth asteroid discovery in PTF data (A. Waszczak et al. 2015, in preparation).

Machine-learning application generally consists of three stages: training, cross-validation, and classification. In the training stage of building a machine classifier, the multi-dimensional parameter space is hierarchically divided into subspaces called nodes; these nodes collectively comprise a decision tree. The smallest node—also known as a leaf—is simply an individual data point (in our case, a single light curve). Given a set of leaves with class labels, one can build an ensemble of trees (called a forest), each tree representing a unique partitioning of the feature space, wherein the nodes are split with respect to different randomly chosen subsets of the parameter list. Each node splitting attempts to maximize the separation of classes between the sub-nodes. Serving as a model, in the subsequent classification stage the forest allows one to assign a probability that a given vector of features belongs to a given class. During cross-validation (an essential early stage in this process), the training and classification steps are repeated many times, each time using different subsamples (of labeled data) as the training data and testing data. Cross-validation evaluates the classifier's performance and ensures that it is not overfitting the training data.

For our light curves, we are interested in a binary classification, i.e., whether the fitted period is accurate ("real") or inaccurate ("bogus"). Bloom et al. (2012) coined the term realBogus to describe this binary classification probability in the context of extragalactic transient identification. In the present work we are essentially adapting Bloom et al.'s realBogus concept to the problem of light-curve-period reliability assessment.

We employ a MATLAB-based Random Forest classifier,¹⁹ which is a port of the original RF software (originally implemented in R). This software includes two main functions, which perform the training and classification steps separately.

5.2.1. Classifier Training

Our training data consist of the known-period light curves (see the previous section) belonging to the two classes under consideration: 618 light curves having accurately fit rotation periods and 309 light curves having inaccurately fit periods. Membership in one class versus the other depends on our arguably arbitrary 3% relative accuracy threshold, though we claim that the clearly bimodal shape of the distribution in Figure 5 justifies this 3% criterion. We note also that the classifier ultimately only provides a probability that a given light curve belongs to one class or the other, so that objects very near to the 3% cutoff may conceivably correspond to classification probabilities close to 0.5.

An important point is that the "ground-truth" reference periods we have taken from the database of Warner et al. (2009) may include some number of inaccurate periods. Such periods may be the product of erroneous fitting on the part of any one of its many different contributors, each of whom may employ a different fitting procedure and/or adhere to different confidence criteria. For the sake of this work, however, we consider all quality code 3 periods to be accurate—any actual inaccuracy will contribute to decreased classifier performance.

Besides ground-truth periods that are simply inaccurate, we also, in principle, risk contamination from reference periods that are no longer accurate. We assume that the majority of asteroids' periods are not changing with time, at least not at levels measureable with our data. For instance, direct measurement of the YORP mechanism in at least one asteroid (Lowry et al. 2007) reveals a relative rotation period change of several parts per million over several years. Any measureable period changes would likely be due to recent collisional events. The case of asteroid 596 Scheila (Bodewitts et al. 2011) demonstrates that detectable collisional events among main-belt asteroids do occur on a relatively regular basis, though even this robustly detected collision imparted no measurable change in the asteroid's spin rate (Shevchenko et al. 2013).

Although Figures 7 and 8 detail the period-fitting reliability as a function of only eight light-curve parameters, we construct our classifier using 12 additional parameters, for a total of 20 light-curve parameters. In the context of machine learning these parameters are known as features. The 20 features we use were chosen on the basis of their availability (most are output directly by the fitting process and do not require additional computation), as well as their actual importance (as computed during the cross-validation tests described in the next section).

Figure 9 visualizes the two-dimensional correlation coefficients for all possible pairs of the 20 light-curve features. Overall, the correlation structure of the training sample qualitatively resembles that of the full data set, implying that the training set fairly well represents the overall data set in terms of its feature-space structure. On the other hand, the distributions (e.g., median value, range of values) of individual features in the training set do not necessarily match the distributions in the full data set. An obvious example is that the full data set contains far more faint asteroids than does the training sample, even though in both cases the median magnitude (medMag) is positively correlated with quantities like rmsFit (due to Poisson noise) and hMagRef (since larger asteroids tend to be brighter).

Our 20 light-curve features are listed in Table 2, in order of decreasing importance. Most of these quantities we have discussed already in previous sections in the context of our model and fitting procedure. The list also includes two features characterizing the magnitude distribution of the folded light curve: (1) Stetson's K-index, a measure of the kurtosis borrowed from variable-star-light curve analysis (Stetson 1996), and (2) a "cusp index" that quantifies the extent to which the dimmest 10% of the data points in the folded light curve deviate from the best fit relative to the other 90% of the data points. We designed the cusp index to potentially identify eclipsing systems that are poorly fit by the two-term Fourier approximation but nonetheless may have accurately fit periods (examples of light curves with such cusp-like minima appear in Figure 10). Eclipsing binaries would be most properly treated with a different model entirely, as would tumbling asteroids (which we also did not systematically try to identify in the data, and probably lack reliable light curve solutions when subjected to this work's algorithm).

Table 2. Summary of the 20 Light-curve Parameters (Features) Used by Our Period-Quality Classifier

Feature	Importance (%)	Description
`peakRatio`	11.1	Ratio of the fitted light curve's two peaks ( $=\;\mathrm{max}-\mathrm{min}$ ). Zero if only one peak, one if exactly the same height.
`amplitude`	10.2	Fitted amplitude of the folded light curve. Equivalent to the height ( $\mathrm{max}-\mathrm{min}$ ) of the larger of the two peaks.
`periodFit`	8.6	Rotation period value obtained using this work's data and fitting algorithm.
`freqSNR`	8.4	Signal-to-noise ratio of the fitted (minimum) frequency in periodogram = $2\times \| \mathrm{min}-\mathrm{median}\| /(84\mathrm{th}-\mathrm{percentile}-16\mathrm{th}-\mathrm{percentile})$
`hMagErr`	5.8	Uncertainty in the fitted H-magnitude (i.e., error in the fitted absolute magnitude)
`a12Coeff`	4.3	Fourier coefficient A₁₂
`a22Coeff`	4.2	Fourier coefficient A₂₂
`numObsFit`	4.1	Number of observations in the final fitted light curve, after discarding any bad observations
`medMag`	4.1	Median calibrated magnitude (in the photometric band specific to the light curve, either R or g)
`chisq`	4.1	Reduced chi-squared of the fit, i.e., ${\chi }^{2}$ per degree of freedom)
`a21Coeff`	4.0	Fourier coefficient A₂₁
`a11Coeff`	3.8	Fourier coefficient A₁₁
`rmsFit`	3.8	Root-mean-squared residual of the fit
`hMagRef`	3.8	Reference H-magnitude (i.e., absolute magnitude of the asteroid in V band as listed by the MPC)
`kIndex`	3.7	Stetson's K-index (a measure of kurtosis in the magnitude distribution of a folded light curve, introduced by Stetson 1996)
`freqResol`	3.7	Resolution of the periodogram: ${\rm{\Delta }}f=1/(4{\rm{\Delta }}t)$ , where ${\rm{\Delta }}t$ is the time between the first and last observations in the light curve
`hMagResid`	3.7	Difference between the reference absolute magnitude (`hMagRef`) and the fitted H-magnitude
`cuspIndex`	3.6	"Cusp index": Median squared residual of the dimmest 10% points divided by the median squared residual of all other points
`numObsRem`	2.9	Number of observations removed during the fitting process (due to >7σ residuals with respect to preliminary fits)
`cosmicErr`	2.1	Final "cosmic error" value at end of fitting process ( $\lt 0.1$ mag in all cases)

Note. See text for a discussion of the cross-validation-derived importance value (Section 5.2.2).

Download table as: ASCII Typeset image

Figure 9. Refer to the following caption and surrounding text. — **Figure 9.** Correlation matrices (Spearman's ρ coefficient) for the 20 light-curve features (Table 2) in the training sample (left) and in the full data set (right).
Download figure:
Standard image High-resolution image

Figure 10. Refer to the following caption and surrounding text. — **Figure 10.** Examples of reliable light curves whose folded rotation curves include cusp-like minima (systematic negative deviation from the second-order Fourier fit at minimum brightness), suggestive of a binary system. Many more examples exist in our light curves; however, in this work we have not specifically flagged such light curves. Future works will more carefully label and analyze this particular class of objects.
Download figure:
Standard image High-resolution image

5.2.2. Classifier Cross-validation

To ascertain the trained classifier's capabilities, and to ensure that the classifier is not overfitting the training data, we perform a series of 1000 cross-validation trials. In each trial we split each class (accurate fits and inaccurate fits) into a training subsample (a randomly chosen²⁰ 80% of the class) and a test subsample (the remaining 20% of the class). We then train a classifier using the combined training subsamples and subsequently employ the classifier on the combined test subsamples. In each of the trials, the classifier outputs a classification probability (score) for each object in the test sample, and we track the TPR (fraction of accurate period fits that are correctly classified above some threshold probability) as a function of the FPR (fraction of inaccurate period fits that are incorrectly classified above said threshold probability). See Figure 11 for a summary of these terms.

Figure 11. Refer to the following caption and surrounding text. — **Figure 11.** Definitions of true vs. false and positive vs. negative labels. TPR is sometimes called the *completeness* or *sensitivity*, while FPR is otherwise known as the *false-alarm rate*, one minus the *reliability*, or one minus the *specificity*.
Download figure:
Standard image High-resolution image

The results of the cross-validation are shown in Figure 12. By tuning the minimum classification probability used to threshold the classifier's output, one effectively moves along the hyperbola-shaped locus of points in TPR-versus-FPR space seen in the plot. Several points have labels (p = ...) indicating the corresponding threshold probability (adjacent points being separated by ${\rm{\Delta }}p=0.05$ ). The error bars in Figure 12 represent the standard deviation of the location of each point over all 1000 trials, while the point centers are the average locations.

Figure 12. Refer to the following caption and surrounding text. — **Figure 12.** TPRs vs. FPRs for the cross-validation trials. Such as plot is sometimes referred to as a *receiver operating characteristic* (ROC) curve. Each trial trains the classifier using a randomly chosen 80% of the known accurate fits and 80% of the known inaccurate fits among the 927 light curves that have reference periods. The 20% remaining light curves serve as the test sample. Moving along the hyperbolic locus of points in this plot is equivalent to tuning the classification probability threshold from zero (lower left of the plot) to one (upper right of the plot). The error bars represent the scatter in the 1000 cross-validation trials.
Download figure:
Standard image High-resolution image

A classification threshold of $p\gt 0.5$ is conventionally used when quoting single FPRs and TPRs. In our case, this gives FPR $=\;0.45\pm 0.07$ with TPR $=\;0.89\pm 0.03$ . The contamination of positively classified light curves in the cross-validation trials depends also on the actual class ratios in the sample being classified. In particular, since $\sim 1/3$ of our known-period light curves are inaccurate fits (Figure 5), it follows that among all light curves the classifier labels as accurate fits, the contaminated fraction is $(0.45\times 1/3)/(0.89\times 2/3+0.45\times 1/3)\approx 1/5$ . If instead of using the classifier we just randomly labeled some fraction of the light curves as accurate and the rest as inaccurate, the resulting contamination would be $1/3$ (i.e., worse than the $1/5$ afforded by the classifier, as expected).

Several parameters can be adjusted or tuned when training an RF classifier. First is the number of decision trees generated during the training stage. Classification accuracy typically increases with the number of trees and eventually plateaus. Most applications employ hundreds to thousands of trees; we here use 1000 trees. Another tunable parameter is the number of randomly selected features (out of the 20 total here considered) with respect to which nodes are split in building the decision trees. Breiman (2001) recommends using the square root of the number of features. We ran the cross-validation for all possible numbers of features with respect to which the nodes can be split (i.e., all numbers between 1 and 20). The results are in Figure 13. We chose n = 4 as the number of features to split, both because the classifier's performance plateaus after that value and because it follow's the recommendation of Breiman (2001) ( $4\approx \sqrt{20}$ ) features.

Figure 13. Refer to the following caption and surrounding text. — **Figure 13.** Varying the number of features that are randomly split per node in the decision-tree-building process affects both the TPR and FPR. The values plotted here correspond to the $p\gt 0.5$ classification threshold; each point was generated by the exact same process for which the results in Figure 12 were generated, only varying the number of features with respect to which nodes are split. In the left plot, the first four points are labeled with the number of features for that trial (for $n\gt 4$ we omit the label). In our actual implemented model (Figure 12) we chose n = 4 features, the value after which the TPR/FPR ratio plateaus at approximately 2, and also the value Breiman (2001) recommends, i.e., the square root of the total number of features (in our case, $\sqrt{20}\approx 4$ ).
Download figure:
Standard image High-resolution image

**Figure 13.** Varying the number of features that are randomly split per node in the decision-tree-building process affects both the TPR and FPR. The values plotted here correspond to the $p\gt 0.5$ classification threshold; each point was generated by the exact same process for which the results in Figure 12 were generated, only varying the number of features with respect to which nodes are split. In the left plot, the first four points are labeled with the number of features for that trial (for $n\gt 4$ we omit the label). In our actual implemented model (Figure 12) we chose n = 4 features, the value after which the TPR/FPR ratio plateaus at approximately 2, and also the value Breiman (2001) recommends, i.e., the square root of the total number of features (in our case, $\sqrt{20}\approx 4$ ).
Download figure:
Standard image High-resolution image

Other parameters that can be tweaked are the maximum depth of a tree, the minimum number of samples per leaf, the minimum number of samples used in a split, and the maximum number of leaf nodes. We do not constrain any of these parameters, meaning we allow trees of any depth, with any number of leaf nodes, leaf nodes consisting of a single sample, and splits based on the minimum of two samples. We note that as a result our model optimization is not comprehensive and it is possible that a better classifier could be trained. However, the relatively small training sample size here is likely the limiting factor; additional data are necessary to substantially improve the classifier performance.

In the bottom rows of Figures 7 and 8, we detail the dependence of the TPR and FPR on various light-curve parameters. Averaging (marginalizing) over any of the x-axis quantities in these bottom-row plots (while also weighting each bin by the number of light curves it contains; see the top row of plots in Figures 7 and 8) produces precisely the TPR and FPR values of the p = 0.5 data point in Figure 12.

In addition to the TPR and FPR estimates, cross-validation allows us to quantify the relative importance of the features by computing the average depth in the trees at which a split was performed with respect to each feature. Those features with respect to which the training sample is consistently divided early in the building of each tree are deemed more important (i.e., more discriminating) than those features that are split later, as the tree-building process tries to maximize the separation of the classes as early as possible by splitting features in an optimal sequence. Both Table 2 and Figure 9 list the features in order of importance.

Note that we had manually guessed several of the most important features—namely, peakRatio, freqSNR, and hMagErr—prior to any machine-learning work via inspection of the plots in Figure 8. The numerical importance values thus agree with these initial observations and also quantify the significance of features that would be difficult to ascertain manually. For instance, numObsFit appears (in Figure 8) not to be related to the fitting accuracy, while medMag (Figure 7) does appear related to accuracy (fainter light curves being less accurate), yet these two features evidently have equal importance in the classification process (see Table 2). Figure 9 indicates that numObsFit and medMag have quite different correlation relationships with respect to more important features. Hence, it would not be surprising if their one-dimensional distributions (in Figures 7 and 8) bear no resemblance to the multi-dimensional distributions on which the decision trees are defined and in which these two parameters apparently carry comparable weight.

5.2.3. Machine-vetted Light Curves

Having trained the machine classifier as described in Section 5.2.1, we use it to predict the validity of our remaining ∼53,000 fitted periods (of ∼48,000 unique asteroids) that lack quality code 3 reference periods in Warner et al. (2009). The automated classifier assigned positive reliability scores ( $p\geqslant 0.5$ ) to 19,112 of the light curves (35% of the total data set). Figure 14 details the distribution of the light curves (raw-fitted, machine-vetted, and other subsets) with respect to some of the most important light-curve features.

With respect to rotation period (Figure 14 panel (A)), the classifier rejects the largest fractions of light curves in the long-period (≳1 days) and short-period (≲2.7 hr) bins. From Figure 7 (bottom row, leftmost column), we know that the classifier's completeness does not drop significantly for these long- and short-period objects, nor is the FPR higher among them. Hence, we have reason to trust the classifier's heavy rejection of periods in these bins, and therefore we conclude that our fitting algorithm (Section 4) is prone to erroneously fitting periods in these period extremes (as was also suggested in the known-period sample in Figure 7).

Panel (C) shows that the mode of the apparent-magnitude (medMag) distribution for machine-approved light curves is ∼19 mag, as compared to the predominantly $V\lesssim 17$ mag known-period training sample. Comparing this to Figure 2 panel A shows that the limiting magnitude of reliable light curves is comparable to that of individual detections.

Panel (E) of Figure 14 shows that the raw output of our fitting process contains peak-ratio values that are uniformly distributed above 0.2, this particular value being a hard-coded threshold that double-peaked light curves (at least those with amplitudes >0.1 mag) output by our fitting algorithm must satisfy (see Figure 3 and Section 4.1). The classifier's output clearly indicates that reliability is linearly related to the peak ratio, as was also prominently seen in Figure 8. Because Figure 8 also indicates that the classifier's TPRs and FPRs also relate linearly with peakRatio, we conclude that the slope of the peakRatio distribution for the machine-vetted light curves is likely an upper limit for the true slope.

5.3. Manual Screening

In addition to machine-based vetting, we manually inspected all 54,296 of the light curves that were output by our fitting process. A human screener first studies the ground-truth known-period examples (Section 5.1) in an effort to learn to distinguish between accurate and inaccurate fits. Only the G₁₂ fit is considered (as was the case with the automated classifier), and for each light curve the screener inspects precisely the amount of information included, for example, in Figures 4, 6, and 10 of this paper. Specifically, for each light curve the screener views a row of four plots: (1) the rotation-corrected phase curve, (2) the phase-function-corrected folded rotation curve, (3) the periodogram, i.e., the reduced ${\chi }^{2}$ plotted linearly against frequency (labeled however with the corresponding period), and (4) the rotational-phase versus phase-angle plot. A single screener is presented with these plots through a plain-formatted webpage, allowing for efficient scrolling through the light curves and rapid recording of either a "reliable" or "unreliable" rating for each fitted period. In addition, all light curves in the known-period sample were reinserted into the screening list, with their reference periods removed. These were thus blindly assessed by the screener, independent of their formal (3% accuracy) classification status.

The black lines in Figure 14 plot the results of the manual screening, in which a total of 10,059 light curves (19% of the total set) were deemed "reliable." With respect to the machine-approved sample, the human-rated sample is in all cases between roughly a factor of ∼1 and 2 smaller in each bin relative to the features examined in Figure 14. In general, the shapes of the machine-approved and human-approved distributions match fairly closely. Figure 15 shows examples of light curves for which the machine- and human-based classifiers differed in their rating (we focus on very short and very long fitted periods in Figure 15, but many examples exist for intermediate periods as well).

Figure 14. Refer to the following caption and surrounding text. — **Figure 14.** Distributions of PTF-fitted light curves (and various subsets thereof) in select features/parameters. These plots are histograms with the same binning as the top rows of Figures 7 and 8. For better readability we here use line-connected bin points (rather than the stair-plot format used in, e.g., Figure 5).
Download figure:
Standard image High-resolution image

Figure 15. Refer to the following caption and surrounding text. — **Figure 15.** Example light curves for which the machine-based and human-based reliability scores differ. *Row 1*: human approved, machine rejected (p = 0.32). *Row 2*: human rejected, machine approved (p = 0.66). *Row 3*: human approved, machine rejected. For this object, the fitted period differs from the known reference period of 392 hr by 7%; hence, the machine rejects it by definition. *Row 4*: human rejected, machine approved (p = 0.70).
Download figure:
Standard image High-resolution image

5.4. Asteroids with Multiple Fitted Periods

A total of 654 unique asteroids have more than one PTF light curve whose fitted period was labeled as reliable by the vetting process described in the previous sections. These 654 asteroids collectively have 1413 fits (so that the average multiplicity is ∼2.2 fits per asteroid) and include objects either observed in multiple oppositions and/or in both filters during one or more oppositions. Figure 16 plots the distribution of the relative error in the fitted frequencies of all such multiply fit asteroids, this error being defined as the range of the asteroid's fitted frequencies divided by the geometric mean of its fitted frequencies. Just as in Figure 5 (when we compared to literature-referenced frequencies), we see a prominent mode in the histogram peaking at ∼0.1% relative error, with some excess for errors greater than ∼3% error. There are 63 asteroids in particular with relative errors greater than 3%; of these, only four asteroids have more than two fits. If we assume that, in the remaining 59 pairs of disagreeing periods, one of the periods is correct, then the contamination fraction of light curves based on the sample of multiply fit asteroids is $\sim 30/1413=4$ %.

Figure 16. Refer to the following caption and surrounding text. — **Figure 16.** For the 654 unique asteroids having more than one reliable light-curve fit (either multiple oppositions and/or both R and g-band data) we plot the log of the relative frequency error, defined as the range of the asteroid's fitted periods divided by the geometric mean of its fitted periods. Comparison with Figure 5 suggests that we can deem all cases with error ≲3% as consistently recovered periods, and those with greater than 3% error as inconsistent fits.
Download figure:
Standard image High-resolution image

6. PRELIMINARY LIGHT-CURVE-BASED DEMOGRAPHICS

In this section we perform a preliminary analysis and interpretation of the demographic trends evident in this work's fitted light-curve parameters. Forthcoming works and papers will more closely examine the population distributions of both rotation and phase-function parameters.

Throughout this section we repeatedly examine variation of light-curve-derived parameters as functions of color index and infrared-derived diameters. In the Appendix we describe the aggregation and characteristics of these two custom data sets (compiled from external sources). The color index quantifies an asteroid's probability of membership in the C-type (p = 0) versus S-type (p = 1) color-based clusters. Objects that in fact belong to neither C nor S groups (e.g., V types, D types) will have color indices near p = 0.5 provided that they are in fact separated from both the C-type and S-type clusters in the 2D color spaces considered (see Appendix).

There are many interesting demographic questions addressable with these light-curve data that—in the interest of space—we do not treat in this work. For example, one could examine relationships between light-curve parameters and orbital elements and/or family membership, proximity to resonances, and so on. We are making all of these light-curve data available electronically (Tables 4 and 5; see Appendix A.3) so that the community may use these data to help explore such science questions.

6.1. Disclaimer Regarding De-biasing

The preliminary demographic analyses that follow do not take into account fully de-biased distributions of, e.g., spin rates, amplitudes, or phase-function parameters. The TPRs and FPRs given in the bottom row of plots in Figures 7 and 8 (also, the blue and violet lines in Figure 14) constitute some of the necessary ingredients for producing a fully de-biased data set; however, in this work we do not attempt to compute the de-biased distributions.

6.2. Rotation Rates and Amplitudes

In Figure 17 we reproduce several of the plots appearing in Pravec et al. (2002 and references therein), using this work's much larger data set (characterized by at least an order of magnitude larger sample of small objects). Both spin rate and amplitude are examined for the 4040 objects having diameter data from infrared surveys. Unlike Pravec et al. (2002), we are not able to individually plot each light curve's data (the ∼4000 points would make the plot difficult to render, as well as difficult to read); hence, we plot these (and other relationships later in this section) using two-dimensional histograms where the intensity of each pixel corresponds to the number of objects in that bin (darker means more, with linear scaling). Additionally, 2D histograms for which the diameter is plotted on the horizontal axis have their pixel values column-normalized, i.e., all pixels in each column of the histogram sum to the same value. This facilitates the visual interpretation of period and amplitude variation with diameter, as the left-hand side (small-diameter end) of the plots would otherwise saturate the plot.

Figure 17. Refer to the following caption and surrounding text. — **Figure 17.** *Panel A*: distribution of spin rate and amplitude as functions of infrared-derived diameters (see Appendix for diameter data sources), including data for 4040 of our light curves. The two-dimensional histograms (left side plots) are column normalized (see text for details). *Panel (B)*: comparison of the period vs. amplitude distribution (regular 2D histogram, *not* column normalized) with max-spin-rate vs. amplitude for a uniform density ellipsoid held together solely by self-gravity.
Download figure:
Standard image High-resolution image

Following Pravec et al. (2002), we include the geometric mean rotation frequency as computed from a running bin centered on each object. The half-width of the bin centered on each object is either 250 (data points) or the object's distance from the top or bottom of the sorted diameter list, whichever is smallest. This ensures that the geometric mean is not contaminated at the edges of the plot by the interior values, though it also means that more noise exists in these edge statistics. The geometric mean is the more intuitive statistic for the rotation period as compared to the arithmetic mean, since the rotation periods tend to span several orders of magnitude. In addition to the geometric mean, we plot the 16th and 84th percentile values from each running bin.

The basic observed trend regarding rotation rate is that smaller-diameter asteroids rotate faster on average. A slight increase in the rotation rate also appears for objects larger than ∼80 km. Binning the data into a coarser set of three diameter bins and normalizing each object's spin rate by the local geometric-mean rate, we see a progression from a near-Maxwellian distribution to a progressively non-Maxwellian distribution for smaller objects. The rotation rates of a collisionally equilibrated population of rotating particles are known to approach that of a Maxwellian distribution (e.g., Salo 1987), which for a population of N objects as a function of rotation frequency f is

where $n(N,f,{f}_{\mathrm{peak}}){df}$ is the number of objects in the interval $(f,f+{df})$ and ${f}_{\mathrm{peak}}$ is the peak frequency (i.e., the frequency corresponding to the distribution's maximum).

One way of testing how well a Maxwellian actually fits the data is the two-sided Kolmogorov–Smirnov test (Massey 1951). This test compares an empirical distribution to a reference distribution (e.g., Gaussian, Maxwellian, or another empirical sample) via a bootstrap method. In particular, it computes a statistic quantifying the extent to which the cumulative distribution function differs in the two distributions being compared. In our case, we use Equation (32) to simulate a large sample (10⁵) randomly drawn from an ideal Maxwellian distribution and compare this simulated sample against the 99-asteroid sample (of $D\gt 40$ km) frequencies. Interestingly, this test indicates that our 99 large-asteroid normalized frequencies differ from a Maxwellian at nearly the 10σ confidence level, though this could be due in part to the lack of a proper de-biasing of the distribution (see Section 6.1)

All of these trends—including the qualitative resemblance of a Maxwellian but its formal disagreement—were noted previously by Pravec et al. (2002). At the time their $D\lt 10$ km size bin contained data on only 231 objects, as opposed to our sample of 2,844 asteroids with $D\lt 10$ km. Conversely, our $D\gt 40$ km bin contains only 99 objects as compared to the ∼400 large asteroids they took into consideration in comparing to a Maxwellian.

Steinberg & Sari (2015) recently described how collisional evolution of large asteroids should actually lead to a Lévy distribution, which has a significantly longer tail than a Maxwellian distribution having the same peak. They compared their theory to spin rates of $D\geqslant 10$ km asteroids from the LCDB and found in general that the Lévy distribution fails to fit the spin distribution of large asteroids, suggesting that there may be a significant primordial component to the spin distribution. Potential primordial contributions to the angular momentum of asteroids were explored by Harris & Burns (1979) and later authors.

Our amplitude distribution contains an obvious observational bias (see Section 6.1) in that amplitudes less than ∼0.1–0.2 mag are generally ill fit by our modeling procedure (see Figure 7) and thus significantly underrepresented in our sample of reliable light curves considered here. Nonetheless, we see a clear trend of smaller asteroids exhibiting larger rotational amplitudes, consistent with the idea that larger bodies have sufficient surface gravity to redistribute any loose mass to a more spherical shape.

As we have done for the normalized frequency distribution, we plot diameter-binned normalized amplitudes against a Maxwellian distribution, this time merely to guide the eye as opposed to validating any hypothetical physical interpretation. The fact that the normalized amplitude distributions do not deviate too drastically from the Maxwellian shape at smaller diameters indicates that the spread in the amplitude distribution is proportional to its mean value, a basic property of the Maxwellian distribution, hence the good agreement. Carbognani (2010) provides a recent analysis of asteroid rotation amplitudes and highlighted a similar increase in both the amplitude's mean and spread with decreasing diameter.

Panel (B) of Figure 17 shows the distribution in period-versus-amplitude space, in which we can plot all 9033 light curves, including those lacking a diameter estimate. Contours representing the maximal spin rate of a body held together solely by self-gravity of certain uniform densities are overplotted. Our data as a whole do not appear to populate the region beyond the ∼2 g cm⁻³ contour. Later in this section we will re-examine this behavior separately for the two major taxonomic classes.

6.3. Phase Functions and Bond Albedos

We consider any of the 54,296 fitted PTF light curves to have a reliably fit phase function if both of the following conditions are satisfied:

1.
The light curve is one of the 9033 having a reliable period fit, or its fitted amplitude (for the G₁₂ model) is less than 0.1 mag (the latter is true for 1939 light curves, only 39 of which have reliable periods).
2.
The light curve is fit using data from at least five phase-angle bins of width ${\rm{\Delta }}\alpha =3^\circ$ . These five bins need not be contiguous, and they need not include phase angles in the region where opposition surges are typically measured (i.e., $\alpha \lesssim 10^\circ$ ).

The above two criteria are met by 3902 out of the 54,296 PTF light curves. Of these, 1648 have an infrared-based diameter available, 651 have a color index available, and 361 have both a diameter and color index.

Figure 18 details the distributions of the fitted phase parameters G₁₂, G, β, and C against the color index, bond albedo, and in 1D histograms with color-based taxonomic subsets. Though the phase parameters are all correlated with color index and with bond albedo, none of the 1D phase-parameter distributions (right column of plots) exhibit bimodality alone, whereas the bond albedo (bottom right plot) does show significant bimodality. The red and blue histograms consist of all asteroids having color metric either less than 0.25 (C types) or greater than 0.75 (S types). The G and $(\beta ,C)$ phase parameters are only plotted for those light curves that also have a G₁₂ solution. Not every light curve produced a solution for all three of the phase-function models; hence, the sample sizes for the G and $(\beta ,C)$ models include a slightly reduced number of data points.

Figure 18. Refer to the following caption and surrounding text. — **Figure 18.** Various fitted phase-function parameters plotted against color index and bold albedo (two-dimensional histograms; the total number of light curves in each plot is stated above it as $N=\ldots$ ). In the right column of plots, one-dimensional distributions with the color-index classified objects plotted separately. In the right column of 1D histograms, C and S types are defined as objects with color indices less than 0.25 and greater than 0.75, respectively.
Download figure:
Standard image High-resolution image

$N=\ldots $ — **Figure 18.** Various fitted phase-function parameters plotted against color index and bold albedo (two-dimensional histograms; the total number of light curves in each plot is stated above it as $N=\ldots$ ). In the right column of plots, one-dimensional distributions with the color-index classified objects plotted separately. In the right column of 1D histograms, C and S types are defined as objects with color indices less than 0.25 and greater than 0.75, respectively.
Download figure:
Standard image High-resolution image

We reiterate our statement from Section 3.2.1 that the bond albedo ${A}_{\mathrm{bond}}$ is a more fundamental (i.e., intensive rather than extensive) property than is the geometric albedo p_V, hence our focus on ${A}_{\mathrm{bond}}$ here. The bond albedo is computed using Equation (8) together with Equation (15) and makes use of our PTF-derived absolute magnitudes—H from the G₁₂ fit in particular—as well as the phase integral q of Equation (8), also computed directly from the G₁₂ fit for ϕ. In particular,

6.3.1. Taxonomy from Light-curve Data

We use the distribution of bond albedo versus G₁₂ to define another taxonomic metric analogous to the color index. In particular, we apply the same clustering analysis to this distribution as we did for the seven 2D color distributions in the Appendix. This procedure assigns to every object in the ${A}_{\mathrm{bond}}$ -versus-G₁₂ diagram a probability of membership in each of two clusters (color-coded blue and orange in Figure 19). The cluster centers are fit by the algorithm, and the output class probability of a given data point relates to its distance from these cluster centers. Probabilities near 0 represent likely C-type class membership, while probabilities near 1 represent likely S-type membership. We refer to this new metric as the photometric index; it complements the color index as another proxy for taxonomy. There are 361 asteroids with both a photometric index and color index available (Figure 19, right plot); the two indices are clearly correlated ( ${\rho }_{\mathrm{Spearman}}=0.73$ , $\gt 10\sigma$ significance). Note that asteroids only have a defined photometric index if they have an infrared-derived diameter available, so that ${A}_{\mathrm{bond}}$ is defined.

Figure 19. Refer to the following caption and surrounding text. — **Figure 19.** *Left*: we perform the same clustering analysis used in defining the color index (see appendix), this time on the G₁₂ vs. ${A}_{\mathrm{bond}}$ distribution, which contains 1631 PTF light curves, all of which have IR-derived diameters and reliable phase functions. The output of this clustering analysis is the *photometric index*, which, analogous to the color index, is a number between 0 (C type) and 1 (S type) quantifying to the class membership of each constituent asteroid data point. *Right*: correlation between the color index and our photometric index, a comparison that can be made for 361 objects. Note that most data are in the lower left and upper right corners.
Download figure:
Standard image High-resolution image

**Figure 19.** *Left*: we perform the same clustering analysis used in defining the color index (see appendix), this time on the G₁₂ vs. ${A}_{\mathrm{bond}}$ distribution, which contains 1631 PTF light curves, all of which have IR-derived diameters and reliable phase functions. The output of this clustering analysis is the *photometric index*, which, analogous to the color index, is a number between 0 (C type) and 1 (S type) quantifying to the class membership of each constituent asteroid data point. *Right*: correlation between the color index and our photometric index, a comparison that can be made for 361 objects. Note that most data are in the lower left and upper right corners.
Download figure:
Standard image High-resolution image

6.3.2. Wavelength Dependence

Observational evidence for the reddening of asteroid colors with increasing phase angle is discussed by Sanchez et al. (2012 and references therein). Color variation with phase angle can be equivalently stated as variation of the phase function with wavelength. Asteroids that have PTF light curves in both of the survey's filters (R and g band) allow us to investigate this phenomenon. We note, however, that Sanchez et al. (2012) describe phase reddening as being more pronounced at longer wavelengths (>0.9 μm) and larger phase angles ( $\alpha \gt 30^\circ$ ), such that a priori we should not expect a very pronounced effect (if any) in the visible-band PTF data.

Similar to the complication associated with comparing spin amplitudes from multiple oppositions (Section 3.1.1), an asteroid's mean color can potentially change if the spin axis varies with respect to our line of sight from year to year. Hence, we choose not to compare R-band and g-band phase-function fits from different oppositions. Aside from this constraint, we adopt the same two reliability selection criteria stated in Section 6.3, with a slight modification of requirement #2: here we allow four or more phase-angle bins of width ${\rm{\Delta }}\alpha =3^\circ$ , as opposed to the previous sections' five-bin requirement, because of the small sample size.

There are 92 asteroids with both R-band and g-band phase-function fits acquired during the same opposition that meet the above criteria. For each asteroid we difference the R-band G₁₂ value from the g-band G₁₂ value. The mean of this difference is $-{0.004}_{-0.14}^{+0.19}$ , indicating (for the whole sample) no significant non-zero difference between the two bands' G₁₂ values. Likewise, for β, we compute a difference of ${0.002}_{-0.003}^{+0.008}$ , also consistent with zero difference between the bands.

Since these fits provide absolute magnitudes in each band (i.e., H_g and H_R), we compute the color ${H}_{g}-{H}_{R}$ for the 92-asteroid sample. Figure 20 shows that the distribution of this color is bimodal, suggesting that it is a viable proxy for taxonomy. This is further supported by the strong correlation between ${H}_{g}-{H}_{R}$ and the R-band G₁₂ value. No correlation is seen, however, between ${H}_{g}-{H}_{R}$ and the difference between the two bands' G₁₂ value or β values.

Figure 20. Refer to the following caption and surrounding text. — **Figure 20.** For the 92 asteroids with both R-band and g-band light-curve fits from the same opposition, we use the resulting difference in the absolute magnitudes ${H}_{g}-{H}_{R}$ as a proxy for taxonomy. This color distribution is qualitatively bimodal (top left), and the correlation with G₁₂ is very robust (top right). We detect no significant difference in the G₁₂ and/or β parameters between the two bands, both in the sample as a whole and as a function of the ${H}_{g}-{H}_{R}$ color.
Download figure:
Standard image High-resolution image

6.4. Spins and Amplitudes versus Taxonomy

The union of the color-index data (see Appendix) and photometric-index data (Section 6.3.1) provides significantly better taxonomic coverage of the PTF light curves (Figure 21). With this composite taxonomic information in hand, we can repeat the spin-amplitude analyses of Section 6.2 (Figure 17), this time considering the C-type and S-type groups separately. We define objects with one or both of the indices less than 0.25 as C type and greater than 0.75 as S type. We detail the resulting 1795-object taxonomically classified sample in Figure 22. There were 20 asteroids with conflicting color-based and photometric-based classifications that are not included in this 1795-object sample.

Figure 21. Refer to the following caption and surrounding text. — **Figure 21.** Added completeness from supplementing the color index with the photometric index among asteroids having PTF light curves. Both indices are a proxy for the taxonomic type. The left- and right-hand plots apply separately to the subset labeled by the black line above each column.
Download figure:
Standard image High-resolution image

Figure 22. Refer to the following caption and surrounding text. — **Figure 22.** Taxonomic dependence on spin rate and amplitude, also vs. diameter, using the union of the color-index- and photometric-index-based C/S taxonomy.
Download figure:
Standard image High-resolution image

The one-dimensional histogram in Figure 22 indicates that S-type asteroids dominates the smallest objects with data in PTF while C type dominates the largest. This reflects the fact that the survey's upper and lower sensitivity limits are defined in terms of absolute magnitude H (affected by albedo) rather than physical diameter, i.e., S-type asteroids larger than ∼50 km will tend to saturate the PTF detector, while C-type asteroids fainter than ∼5 km will usually fall below the detection limit. Adding to this effect is the fact that S types mostly occupy the inner main belt, where they are brighter by virtue of smaller heliocentric and geocentric distances, as compared to the usually more distant C types. While the two classes have similar representation in the sample (882 S types versus 913 C types), their true population ratio also affects the relative numbers.

The right-hand side plots in Figure 22 show rotation rate and amplitude versus diameter separately for the two taxonomic groups. Rather than plot a two-dimensional histogram as was done in Figure 17, for readability we here just plot the geometric mean and percentiles, computed by exactly the same running-bin method described in Section 6.2. The most prominent trend is that among $5\lesssim (D/\mathrm{km})\lesssim 20$ asteroids, C types appear to rotate slower than S types and have larger amplitudes than S types. Assuming that both asteroid groups share the same mean angular momentum, the discrepancy could reflect the C types' ability to more efficiently redistribute material away from their spin axis, thereby increasing their moment of inertia (amplitude) while decreasing their angular rotation rate (i.e., a simple manifestation of conservation of angular momentum).

The above-stated assumption of a common mean angular momentum between C and S types is merely a simple case and is neither unique nor rigorously motivated. More careful consideration of, e.g., plausible ranges of internal tensile strengths of the two types could easily lead to more diverse scenarios wherein the two groups actually have different angular momenta and the observed spin-amplitude trends. As noted earlier (Section 6.2), large asteroids in general appear to have retained a significant primordial component in their spin distribution (Steinberg & Sari 2015); it is therefore important that differences in the origin of C types and S types (accretionary, temporal and/or spatial) be taken into account along with differences in collisional evolution and differing contributions from radiative forces like YORP. Simulations of the main belt's origin, such as the Grand Tack family of models (Walsh et al. 2011), should ultimately be modified to track particle spin evolution as well as orbits.

We also reproduce the period-versus-amplitude plot first shown in Figure 17, this time plotting separately the two taxonomic groups. The S types show a clearer cutoff at the 2 g cm⁻³ contour line, suggesting that they may in general be of greater bulk density than the C types, which show a softer boundary in this period-versus-amplitude space, the precise location of which appears to be somewhere between 1 and 2 g cm⁻³. Note that comparison to these density contours is only valid if the asteroids in consideration are held together mostly by self-gravity and approximated as fluids (as opposed to having significant internal cohesive or frictional resistance). These results are in general agreement with existing asteroid density estimates (Carry 2012 and references therein). Results from an independent analysis of a smaller, more densely sampled set of PTF asteroid light curves (Chang et al. in review; a study that follows closely the approach of Chang et al. 2014a) agree with the C type versus S type rotation rate discrepancy discussed here.

7. COMPARISON TO MPC-GENERATED MAGNITUDES

Absolute magnitudes available through the Minor Planet Center (MPC) and JPL Solar System Dynamics²¹ websites are fit using all available survey/observer-contributed photometry. These H magnitudes are used in various online ephemeris tools to compute predicted V magnitudes to accompany astrometric predictions. Their model assumes no rotational modulation, uses the Lumme–Bowell G-model (Section 3.2.2), and—with the exception of ∼100 large objects (nearly all with $D\gt 30$ km)—assumes a constant G = 0.15 for all asteroids. Our results (Figure 18 second row of plots) show that the G = 0.15 approximation does indeed agree well with the peak of the distribution of fitted G values. The PTF-fitted G values obviously, however, show some spread and variation with taxonomy. In this section we explore the resulting differences in the absolute magnitudes H and in predicted magnitudes.

7.1. Filter Transformations

In order to compare the MPC-listed ( ${H}_{\mathrm{MPC}}$ ) magnitudes, which are in V band, with PTF's absolute magnitudes ( ${H}_{\mathrm{PTF}}$ , corresponding to the G-model fit), which are in either R or g band, we must first compute an approximate transformation from V band to each PTF band. While some transformations are given by Ofek et al. (2012a), we here prefer to empirically estimate these using actual asteroid photometry from both PTF and the MPC, rather than generating them from the more general transformations of Ofek et al. (2012a).

Figure 23 plots ${H}_{\mathrm{PTF}}-{H}_{\mathrm{MPC}}$ for asteroids whose PTF-derived ${G}_{\mathrm{PTF}}$ is in the range $0.1\lt {G}_{\mathrm{PTF}}\lt 0.2$ . By restricting the comparison to objects with fitted ${G}_{\mathrm{PTF}}$ values close to 0.15, we in principle select ${H}_{\mathrm{MPC}}$ magnitudes for which the MPC's ${G}_{\mathrm{MPC}}=0.15$ assumption is actually valid (none of the asteroids in Figure 23 have MPC-listed G values other than the default 0.15). Furthermore, we only consider (in Figure 23) asteroids with PTF data in at least three phase-angle bins of ${\rm{\Delta }}\alpha =3^\circ$ and either a reliable period or fitted amplitude less than 0.1 mag.

Figure 23. Refer to the following caption and surrounding text. — **Figure 23.** *Left*: transformations between MPC V band and the PTF R and g bands for asteroids, based on the difference between MPC-fitted and PTF-fitted H magnitudes for asteroids whose PTF-fitted G values are in the range $0.10\lt G\lt 0.20$ , as well as other PTF-coverage constraints (see text). *Right*: R-band data only, with S and C types defined with color and/or photometric indices (again using the <0.25 and >0.75 index thresholds).
Download figure:
Standard image High-resolution image

**Figure 23.** *Left*: transformations between MPC V band and the PTF R and g bands for asteroids, based on the difference between MPC-fitted and PTF-fitted H magnitudes for asteroids whose PTF-fitted G values are in the range $0.10\lt G\lt 0.20$ , as well as other PTF-coverage constraints (see text). *Right*: R-band data only, with S and C types defined with color and/or photometric indices (again using the <0.25 and >0.75 index thresholds).
Download figure:
Standard image High-resolution image

Comparing the ${H}_{\mathrm{MPC}}$ and ${H}_{\mathrm{PTF}}$ magnitudes for this specific subset of asteroids, we obtain approximate transformations $R=V+(0.00\pm 0.10)$ and $g=V+(0.55\pm 0.16)$ . The $1\sigma$ uncertainties of 0.10 and 0.16 mag plausibly include a combination of the photometric calibration uncertainties of the MPC data (coming from a variety of surveys/observers), variation in H magnitude of a given asteroid between different oppositions (the MPC fits combine data possibly acquired at different viewing geometries), and the range of ${G}_{\mathrm{PTF}}$ used in selecting the asteroids in this sample. Consideration of a range of ${G}_{\mathrm{PTF}}$ values is equivalent to considering a range of asteroid colors (see the color-versus-G correlation seen in Figure 18). Hence, the uncertainties in these transformations also encompass the variation that might otherwise be formally fit in a color term for the transformations. Such a color term for R to V would almost certainly be less significant than that of g to V, as the former transformation is already zero within uncertainties. The larger uncertainty in the g to V transformation is likely attributable to both the smaller sample size and the fact that the V bandcenter is further displaced from g than from R, such that color variation has a more pronounced effect.

Given the above-computed transformations, and the fact that 89% of our fitted light curves are in R band, we proceed using only R-band light-curve fits, which we compare directly against MPC magnitudes (or formally, after applying the transformation of zero). A detail of the color dependence of the R to V transformation appears in the right plot of Figure 23; the mean transformation differs slightly between S and C types but not at a level comparable to the uncertainty in either.

7.2. Absolute Magnitudes

In Figure 24 we show the relative error in the MPC absolute magnitudes as compared to the PTF magnitudes, for all 1630 light curves with sufficient phase-angle coverage in PTF (with the five-bin phase-angle criterion). These errors should reflect not only any discrepancy due to the different phase function models (PTF's G₁₂ versus MPC's G) but also variation in absolute photometric calibrations (within the MPC data internally and/or between the MPC and PTF data sets). The 0.1 mag uncertainty in the R to V band transformation has a prominent contribution to the errors shown here (the mean and 84th percentile of the errors expected from the 0.1 mag transformation uncertainty alone are shown as yellow dashed lines and assume p_V = 0.07). The green line (computed mean) and upper red line (84th percentile) indicate that the errors are ∼1% greater than those expected from the transformation uncertainty alone, though this increases slightly for the largest ( $D\gt 30$ km) objects. Note that many of these largest asteroids are more frequently observed by programs other than the major sky surveys; these smaller facilities tend to use smaller aperture telescopes and different absolute calibration standards, which would contribute to the error.

Figure 24. Refer to the following caption and surrounding text. — **Figure 24.** Errors in the MPC-listed absolute magnitudes relative to the PTF H values (in R band and using the G₁₂ fit's H value), only considering asteroids with IR-derived diameters. On the right is the corresponding geometric albedo relative error. Pixels in the 2D histograms shown here are column normalized. The running-bin geometric mean and 16th and 84th percentiles are shown as green and red lines. Yellow dashed lines are the mean and 84th percentile expected from the 0.1 mag transformation uncertainty alone (for 7% geometric albedo).
Download figure:
Standard image High-resolution image

7.3. Predicted Apparent Magnitudes

Instead of comparing just the fitted H magnitudes, for every light curve with a reliable PTF-fitted phase function we also compare the rms residual of all PTF data in that light curve with respect to both our G₁₂-fit-predicted R magnitude and the MPC (G = 0.15) predicted V magnitude. Our fit includes more fitted parameters and obviously should result in smaller residuals; Figure 25 shows that we see a factor of ∼3 smaller residuals in particular using the PTF fit. Note that if the 0.1 mag R-to-V transformation uncertainty were the only significant contributor to the MPC residuals, then their peak would instead be at ∼0.07 mag rather than ∼0.25 mag. Ignored rotational modulation and inaccurate phase functions move the MPC residuals distribution to higher rms values.

Figure 25. Refer to the following caption and surrounding text. — **Figure 25.** Comparison of the rms residuals, with respect to the PTF ( $H,{G}_{12}$ ) plus rotation fit and the MPC () fit, for all light curves having a reliable R-band PTF phase-function fit.
Download figure:
Standard image High-resolution image

**Figure 25.** Comparison of the rms residuals, with respect to the PTF ( $H,{G}_{12}$ ) plus rotation fit and the MPC () fit, for all light curves having a reliable R-band PTF phase-function fit.
Download figure:
Standard image High-resolution image

The lower rms residuals afforded by the PTF light-curve model permit a more sensitive search for low-level transient activity (e.g., collisional events, cometary brightening) in these asteroids. For example, Cikota et al. (2014) perform a search for active main-belt asteroids using photometric residuals of all MPC data taken with respect to the MPC-predicted apparent V magnitudes. We currently are pursuing a similar analysis using these PTF light curves, as a follow-up to the morphology-based search already completed with PTF (Waszczak et al. 2013). A hybrid approach, wherein morphological measurements are made on stacked images of asteroids that have reliable light-curve fits, could further reveal this kind of subtle activity.

8. SUMMARY

From 5 yr of PTF survey data we have extracted over 4 million serendipitous detections of asteroids with known orbits. We fit a photometric model to ∼54,000 light curves, each consisting of at least 20 observations acquired within a given opposition in a single filter. We adopt a second-order (four-term) Fourier series for the rotation component and fit three distinct phase-function models. We assess the reliability of our retrieved rotation periods by subjecting them to both an automated classifier and manual review. Both vetting processes are trained on a sample of ∼800 asteroids with previously measured spin periods that also occur in our sample. We consider the intersection of the two screened samples for subsequent analysis.

Preliminary analysis (on distributions that are not de-biased) of the rotation period versus diameter confirms the previous finding that asteroids smaller than ∼40 km do not conform to a Maxwellian distribution in their normalized spin frequencies. Phase-function parameters are shown to correlate strongly with the bond albedo. None of the phase function parameters display bimodality in their measured distributions, however. Together with the bond albedo, we use the phase-function data to define a new taxonomic metric based solely on single-band light-curve properties together with infrared-derived diameters (G₁₂ and ${A}_{\mathrm{bond}}$ ). This metric complements the color-based index established previously by many visible-color and spectroscopic surveys. Combining these color- and photometry-based taxonomic indices allows us to separately examine the spin and amplitude distributions of the C-type and S-type asteroids with the largest possible sample sizes. Doing so reveals that, among small objects (5 km $\lt \;D\;\lt$ 20 km), the C types show larger amplitudes and slower spin rates. If the two populations shared a common angular momentum distribution, this could be interpreted as the two compositional types' differing tendencies to redistribute mass away from their spin axes. Comparison of the spin-amplitude distribution with contours of maximal spin rates for cohesionless bodies suggests that almost all asteroids are less dense than ∼2 g cm⁻³, with C types displaying a potentially less dense upper limit of between 1–2 g cm⁻³.

Finally, our fitted absolute magnitudes differ from those generated by the Minor Planet Center's automated fitting procedures, though the precise discrepancy is difficult to ascertain given uncertainty in the transformation between PTF R band and the MPC's V band. The utility in using our model to predict asteroid apparent magnitudes is seen in the threefold reduction in rms scatter about our model relative to the fiducial G = 0.15 model that neglects rotation. This reduced scatter is an essential prerequisite for sensitive searches for cometary, collisional, and other transient activity in what would otherwise be regarded as quiescent asteroids—potentially even bright objects.

This work uses data obtained with the 1.2 m Samuel Oschin Telescope at Palomar Observatory as part of the PTF, a scientific collaboration between the California Institute of Technology (Caltech), Columbia University, Las Cumbres Observatory Global Telescope Network, Lawrence Berkeley National Laboratory, the National Energy Research Scientific Computing Center, the University of Oxford, and the Weizmann Institute of Science (WIS).

Some data in this work (also from the 1.2 m Oschin Telescope) were obtained as part of the iPTF project, a collaboration among Caltech, the Kavli Institute for the Physics and Mathematics of the universe, Los Alamos National Laboratory, the Oskar Klein Centre, the University System of Taiwan, the University of Wisconson Milwaukee, and WIS.

A. Waszczak has been supported in part by the W. M. Keck Institute for Space Studies (KISS) at Caltech. E.O.O. is incumbent of the Arye Dissentshik career development chair and is grateful to support by grants from the Willner Family Leadership Institute Ilan Gluzman (Secaucus NJ), Israeli Ministry of Science, Israel Science Foundation, Minerva and the I-CORE Program of the Planning and Budgeting Committee, and The Israel Science Foundation.

This work also makes use of data products from the Wide-Field Infrared Survey Explorer, a joint project of the University of California Los Angeles and the Jet Propulsion Laboratory (JPL)/Caltech, funded by NASA. This work also makes use of data from NEOWISE, which is a project of JPL/Caltech, funded by the Planetary Science Division of NASA.

This work also makes use of data from the SDSS, managed by the Astrophysical Research Consortium for the Participating Institutions and funded by the Alfred P. Sloan Foundation, the Participating Institutions, the National Science Foundation, the US Department of Energy, NASA, the Japanese Monbukagakusho, the Max Planck Society, and the Higher Education Council for England.

Lastly, we thank an anonymous reviewer for helpful comments and feedback.

APPENDIX

A.1. Multi-survey Visible-band Color Index

The purpose of this Appendix section is to introduce a one-dimensional color metric, based on data from seven different colorimetric asteroid surveys, which quantifies an asteroid's first-order visible-band color-based taxonomy as a number between 0 (C-type endmember) and 1 (S-type endmember). Our primary motivation for doing this is to enable a uniform comparison of PTF light-curve-derived parameters with color spanning from the brightest/largest objects ( $H\approx 8$ –9 mag, or $D\approx 125$ –80 km diameters) down to PTF's detection limit for main-belt asteroids ( $H\approx 16$ mag, or $D\approx 2$ –4 km). Figure 26 panel A shows that the fraction of PTF light curves with color information increases by a factor of ∼3 among large asteroids when all seven surveys are considered, whereas for smaller objects the Sloan Digital Sky Survey's (SDSS; York et al. 2000; Ivezić et al. 2002; Parker et al. 2008) moving-object catalog provides essentially all of the color information.

Figure 26. Refer to the following caption and surrounding text. — **Figure 26.** *Panel A*: Fraction of PTF light curves with colorimetric data available, for both the reliable-period and reliable-period-plus-G₁₂ sets of light curves. *Panel B*: Two-dimensional color distributions for seven surveys, and correlations of FCM-clustering-derived classifications between all pairs of surveys.
Download figure:
Standard image High-resolution image

The seven surveys we use are described in Table 3. All of these surveys contain at least two independent color measurements, and when plotting their data in these two-dimensional spaces (or 2D subspaces defined by properly chosen principal components or spectral slope parameters), the first-order C-type and S-type clusters are in all cases prominently seen (Figure 26, panel B). To each such 2D color distribution we apply a two-dimensional fuzzy c-means (FCM) clustering algorithm (Bezdec 1981; Chiu 1994). For each survey data set, FCM iteratively solves for a specified number of cluster centers (in our case, two) in N dimensions (in our case, one dimension) by minimizing an objective function that adaptively weights each datum according to the robustness of its membership in a given cluster. The FCM output includes computed cluster centers and, for each datum, the probability that the datum belongs to each cluster (this being related to the datum's distance from each cluster center).

Table 3. Asteroid Colorimetry Data Sets Used in Computing This Work's C/S Color Metric

Survey Name	References	Data Description	# Asteroids
UBV colors	Bowell et al. (1978)	U, B, and V broadband photometry acquired mostly at	902
	Tedesco (1995)	Lowell Observatory in the 1970 s with photomultiplier tubes.
Eight-Color Asteroid	Zellner et al. (1998)	Photometry in eight custom filters measured with photomultipliers at Catalina and
Survey (ECAS)	Zellner et al. (2009)	Steward Observatories. We compute and use the principal component color index	480
	⋯	PC#. Excludes objects with PC#1 error >0.3 mag.
24-Color	Chapman & Gaffey (1979, p. 655)	Photometry in 24 interference filters measured with photomultipliers at Mauna Kea.	262
Asteroid Survey	Chapman et al. (1993)	We compute and use the mean spectral reflectance slope and first principal component.
Small Main-belt	Xu et al. (1995)	CCD spectroscopy (0.4–1.0 μm, $R\approx 100$ ) conducted mostly at Kitt Peak.
Asteroid Spectroscopic	Xu et al. (1996)	We compute and use the mean spectral reflectance slope and first principal component.	305
Survey (SMASS)	⋯	⋯
Small Main-belt	Bus & Binzel (2002)	CCD spectroscopy (0.4–1.0 μm, $R\approx 100$ ) conducted at Kitt Peak.
Asteroid Spectroscopic	Bus & Binzel (2003)	We compute and use the mean spectral reflectance slope and first principal component.	1313
Survey II (SMASS-2)	⋯	⋯
Small Solar System	⋯	CCD spectroscopy (0.5–9.0 μm, $R\approx 500$ ) conducted at ESO (La Silla).
Objects Spectroscopic	Lazarro et al. (2004)	We compute and use the mean spectral reflectance slope and first principal component.	730
Survey (S3OS2)	⋯	⋯
Sloan Digital Sky	Ivezić et al. (2002)	g,r,i, and z broadband CCD photometry acquired by SDSS from 1998–2009.
Survey (SDSS)	Parker et al. (2008)	Includes data in the Moving Object Catalog v4, supplemented with post-2007 detections	30,518
griz colors	Ivezić et al. (2010)	from SDSS DR10. We use the first principal component a* defined in the references.
	⋯	Excludes objects with a* error >0.05 mag or () error >0.1 mag.

Note. These data sets are visualized in Figure 26.

Download table as: ASCII Typeset image

Table 4. Parameters Describing PTF Light Curves with a Reliable Period or Phase Function. Byte-by-byte Description of file: ptf_asteroid_lc_parameters.txt

Bytes	Format	Units	Label	Explanations
1–4	I4	⋯	⋯	Light-curve ID number^a
6–11	I6	⋯	⋯	Asteroid number (IAU designation)
13–14	I2	yr	⋯	Last two digits of opposition year
16	I1	⋯	⋯	Photometric band: 1 = Gunn-g, 2 = Mould-R
18–20	I3	⋯	⋯	Number of observations in the light curve
22–26	F5.2	mag	⋯	Median apparent magnitude
28–37	F10.5	day	${t}_{\mathrm{min}}$	Time (MJD) of first observation
39–48	F10.5	day	${t}_{\mathrm{max}}$	Time (MJD) of final observation
50–54	F5.2	deg	${\alpha }_{\mathrm{min}}$	Minimum-observed phase angle
56–60	F5.2	deg	${\alpha }_{\mathrm{max}}$	Maximum-observed phase angle
62–63	I2	⋯	⋯	Number of sampled phase-angle bins of 3-deg width
65–68	F4.2	⋯	p	Reliability score from machine classifier: 0 = bad, 1 = good
70	I1	⋯	⋯	Manually assigned reliability flag: 0 = bad, 1 = good
72	I1	⋯	⋯	Period reliability flag: 0 = bad, 1 = good (product of two previous columns)
74–79	F6.3	mag	H	Absolute magnitude from G₁₂ fit
81–85	F5.3	mag	⋯	Uncertainty in absolute magnitude from G₁₂ fit
87–91	F5.3	⋯	G₁₂	Phase-function parameter G₁₂
92–98	F6.3	⋯	⋯	Uncertainty in ${G}_{12}^{2}$
100–105	F6.3	⋯	G	Phase-function parameter G
107–113	F7.4	mag/deg	β	Phase-function parameter β
115–119	F6.3	mag	C	Phase-function parameter C
121–124	F4.2	mag	⋯	Amplitude from G₁₂ fit (max − min)
126–134	F9.4	hr	P	Period from G₁₂ fit
136–144	F9.4	hr	⋯	Period uncertainty from G₁₂ fi
146–152	F7.4	mag	A₁₁	Fourier coefficient ${A}_{\mathrm{1,1}}$ from G12 fit
154–160	F7.4	mag	A₁₂	Fourier coefficient ${A}_{\mathrm{1,2}}$ from G12 fit
162–168	F7.4	mag	A₂₁	Fourier coefficient ${A}_{\mathrm{2,1}}$ from G12 fit
170–176	F7.4	mag	A₂₂	Fourier coefficient ${A}_{\mathrm{2,2}}$ from G12 fit
178–181	F4.2	⋯	⋯	Ratio of the two peak heights in folded rotation curve^c
183–186	F4.2	⋯	${\chi }_{\mathrm{red}}^{2}$	Reduced chi-squared of the fit
188–192	F5.3	mag	⋯	"Cosmic error" (see Section 4.1)
194–198	F5.3	mag	⋯	Root-mean-square residual of observations w.r.t the fit
200–206	F7.3	hr	⋯	Reference period (from http://sbn.psi.edu/pds/resource/lc)
208–213	F6.2	km	D	Diameter derived from thermal IR data^d
215–218	F4.2	km	⋯	Uncertainty in diameter
220–224	F5.3	⋯	${A}_{\mathrm{bond}}$	Bond albedo^e
226–231	F6.4	⋯	⋯	Uncertainty in bond albedo
233–236	F4.2	⋯	⋯	Color-based taxonomic index: 0 = C-type, 1 = S-type
238–241	F4.2	⋯	⋯	Photometry-based taxonomic index: 0 = C-type, 1 = S-type

Notes. ^aID number labels individual observations in Table 4. ^bSet to −1 if larger than the interval tested in grid search. ^cSet to 0 if there is only one maximum in the folded light curve. ^dReferences for the IR diameters are given in the text (Appendix). ^eBond albedo only computed for objects with reliable G₁₂ and available diameter.

Only a portion of this table is shown here to demonstrate its form and content. Machine-readable and Virtual Observatory (VOT) versions of the full table are available.

Download table as: Machine-readable (MRT)Virtual Observatory (VOT)Typeset image

Table 5. Parameters Describing PTF Light Curves with a Reliable Period or Phase Function. Byte-by-byte Description of File: ptf_asteroid_lc_observations.txt

Bytes	Format	Units	Label	Explanations
1–4	I4	⋯	⋯	Light curve ID number^a
6–15	F10.5	day	τ	Light-time-corrected observation epoch
17–26	F10.7	AU	r	Heliocentric distance
28–37	F10.7	AU	Δ	Geocentric distance
39–43	F5.2	deg	α	Solar phase angle
45–50	F6.3	mag	R or g	Apparent magnitude^b
52–56	F5.3	mag	⋯	Uncertainty in apparent magnitude
58–62	F5.3	mag	⋯	Uncertainty in apparent magnitude with cosmic-error
64–69	F6.3	mag	⋯	Magnitude corrected for distance and G₁₂ phase function
71–76	F6.3	mag	⋯	Magnitude corrected for distance and rotation (G₁₂ fit)
78–83	F6.3	mag	⋯	Residual with respect to the G₁₂ fit
85–89	F4.1	⋯	⋯	Rotational phase from 0 to 1 (G₁₂ fit)

^aID number also corresponds to the line number in Table 4. ^bFilter/band is specified in Table 4.

Only a portion of this table is shown here to demonstrate its form and content. Machine-readable and Virtual Observatory (VOT) versions of the full table are available.

Download table as: Machine-readable (MRT)Virtual Observatory (VOT)Typeset image

In the color-distribution plots of Figure 26, panel B (the plots with black backgrounds arranged diagonally), each pixel/bin is colorized according to the average cluster-membership probability of asteroids in that pixel. Blue indicates high probability of membership in cluster 1, while orange represents high probability of membership in cluster 2.

Our color index provides a more quantitative label than that offered by popular letter-based taxonomic systems (e.g., Bus et al. 2002 and references therein). Several such letter-based nomenclatures were in fact defined on the basis of one or more of these seven surveys, oftentimes using a method similar to the clustering technique we use here. We identify our blue cluster with C-type asteroids and our orange cluster with S-type asteroids, though we make this association purely for connection/compatibility with the literature. This is because our computed clusters have their own unique identity/definition, formally distinct from that given in any other work. Our clusters' definitions are nonetheless completely specified/reproducible by the FCM algorithm we used to compute them.

In reducing the taxonomic classification to a single number defined by the two most prominent groups (C and S types), we lose the ability to distinguish secondary classes like V types, D types, and so on. If such a sub-group is separated from both of the two main clusters, its members will be assigned membership scores of close to 0.5. For example, in the SDSS a* versus $i-z$ complex, the clearly seen V-type "tail" protruding down from the S-type cluster appears mostly green in color, reflecting its intermediate classification. Likewise for the less clearly seen D types, which in the SDSS plot lie above the S types and to the right of the C types (again in a green-colored region). The orders of magnitude lower numbers of such secondary types make them mostly irrelevant for the purpose of this analysis.

We compute the numerical uncertainty (variance) of a given asteroid's cluster-membership score in a particular survey by performing many bootstrapped trials wherein we first randomly perturb all data points by random numbers drawn from Gaussian distributions whose width are the quoted $1\sigma$ measurement (i.e., photometric) uncertainties in each of the two dimensions, and then repeat the FCM analysis on the perturbed data. The variance in each object's reported cluster probability is then computed after a large number of bootstrap trials.

Some asteroids appear in only one of the seven surveys; for such objects the color index is simply its cluster-membership score in that particular survey. For asteroids appearing in multiple surveys, we take the variance-weighted average of the multiple membership scores (and compute that composite score's variance by summing the component variances in inverse quadrature, as usual).

The many off-diagonal plots in Figure 26, panel B compare the cluster-membership scores of all asteroids appearing in all possible survey intersections. The number of asteroids in each survey (and in the intersection of each survey pair) appears above each plot ( $N=\ldots$ ). The survey-pair distributions are 2D histograms where higher densities of data points correspond to black pixels/bins and low density or lack of data points is white. Evidently all possible survey combinations contain at least some asteroids (several share hundreds), and in all cases the individual taxonomic indices (on the horizontal and vertical axes) correlate strongly, confirming the consistency of the cluster membership between surveys.

In Figure 27 we illustrate some useful applications of this color index by comparing it with various asteroid surface observations. One of these quantities (SDSS a* color) was used in computing the color index, so its correlation with the clustering index is expected and thus confirmed.

Figure 27. Refer to the following caption and surrounding text. — **Figure 27.** Relationship between various asteroid surface measurements (from the UV to near-IR) and this work's visible-color-derived C/S color index. See text for descriptions of data the data sets used here, and accompanying references.
Download figure:
Standard image High-resolution image

Figure 28. Refer to the following caption and surrounding text. — **Figure 28.** Relationship between proper orbital elements and this work's visible-color-derived C/S color index for 30,508 asteroids.
Download figure:
Standard image High-resolution image

Figure 29. Refer to the following caption and surrounding text. — **Figure 29.** Fraction of PTF light curves with thermal-IR-based diameter estimates available, for both the reliable-period and reliable-G₁₂ sets of light curves.
Download figure:
Standard image High-resolution image

In the leftmost plot of Figure 27, asteroid photometry from GALEX²² (NUV band), compiled by A. Waszczak et al. (2015, in preparation), is normalized by the nominal G = 0.15 phase-model (Section 3.2.2) predicted brightness at the time of the GALEX observations, and the resulting NUV–V color evidently correlates with the visible color index. This indicates that asteroid reflectance slopes in the visible persist into the UV.

Figure 27 also plots our color index against the $W1$ -band geometric albedo derived from WISE²³ observations obtained during its fully cryogenic mission. We only include asteroids that were detected in both of the thermal bands ( $W3$ and $W4$ ) and that therefore have a reliable diameter estimate. Use of this diameter in Equation (15) then permits estimation of the albedo, where the $W1$ -band albedos use the corresponding WISE photometry (H in Equation (15) being replaced with the appropriate $W1$ -band absolute magnitude).

The rightmost plot in Figure 27 shows our color index's relationship to a near-infrared color from the ground-based 2MASS (Skrutskie et al. 2006). Serendipitous asteroid detections were extracted from 2MASS by Sykes (2000, 2010) and include fluxes in J band (1.25 μm), H band (1.65 μm—not to be confused with the absolute visible magnitude H, used elsewhere in this work), and K band (2.17 μm).

Figure 28 plots our color index against proper orbital elements retrieved from the Asteroids Dynamic Site (AstDyS; Knežević & Milani 2012), revealing the distinct colors of dynamical families and the overall transition from S to C types with increasing semimajor axis. These are similar to the plots of Parker et al. (2008), which is not surprising given that the majority of the asteroids' color indices are based on SDSS data alone. Of the 32,5023 asteroids with a defined color index, there are 30,508 with proper orbital elements which are represented in Figure 28.

A.2. Compilation of IR-derived Diameters

Similar to how we combined several surveys' colorimetric data in the previous section, here we compile thermal-infrared-derived diameter estimates from four surveys. Our aim is again to provide the largest possible sample for comparison with PTF-derived light curve data. Just as SDSS is the main contributor of colorimetry overall but suffers from incompleteness for large/bright asteroids, analogously WISE provides the vast majority of IR-based diameter measurements but levels off at ∼80% completeness at the bright end (Figure 29). We thus supplement WISE with diameter data from the Infrared Astronomical Satellite (IRAS; Matson et al. 1986; Tedesco et al. 2002), the Mid-Course Space Experiment (MSX; Tedesco et al. 2002), and AKARI Usui et al. (2011, 2014) compares several of these different data sets in terms of coverage and accuracy. As we did when defining the color index, asteroids occurring in multiple IR surveys are assigned the variance-weighted average diameter.

Regarding the WISE data in particular, we again use only those diameters that resulted from a thermal fit constrained by fluxes in all four WISE bands during the cryogenic mission. Furthermore, we use the latest (revised) diameter estimates published by Masiero et al. (2014), which adopted an improved thermal modeling technique first discussed by Grav et al. (2012).

A.3. Light-curve Data Tables

The online version of this article includes two electronic tables containing the derived light-curve parameters and the individual photometric observations in each light curve. Tables 4 and 5 describe the columns and formatting of these tables, which include data on all reliable-period light curves, as well as those having amplitudes less than 0.1 mag and sampling in five or more 3-deg-wide phase-angle bins (which have reliable G₁₂ fits). Using these tables, one can produce plots of the PTF light curves we have analyzed in this work.

Footnotes

7
http://ptf.caltech.edu
8
Bus et al. (2002) review these and other asteroid taxonomic classes, which are defined on the basis of low-resolution ( $R\approx 100$ ) visible reflectance spectra.
9
IAU Minor Planet Center, http://minorplanetcenter.net.
10
The Mould-R filter is very similar to the SDSS-r filter; see Ofek et al. (2012a) for its transmission curve.
11
Principal-axis rotation (a stable equilibrium state) is assumed for most planetary bodies. Burns & Safronov (1973) discuss the relevant timescales of spin evolution.
12
In Shevchenko's original notation, β is denoted b and C is denoted a. Moreover, in the original notation, $\phi (0)=-a$ ; we here added a constant term to make $\phi (0)=1$ , following convention with other phase functions.
13
Also known as the visible albedo or the physical albedo.
14
B&S actually stated the correlation in terms of $\mathrm{log}{p}_{V}$ versus β, though the range in β is sufficiently small that p_V versus β is essentially valid as well.
15
More accurately, the single-scattering albedo w, which is the analog of ${A}_{\mathrm{bond}}$ for a "point-source" particle, more fundamentally embodies this bulk-compositional attribute. Hapke (2012) details how ${A}_{\mathrm{bond}}$ is solely a function of w for an asteroid whose surface consists of isotropic scatterers; we here use ${A}_{\mathrm{bond}}$ as a proxy for w.
16
Rather that attributing it to any specific author(s), we note that Equation (15) may be derived directly using Equation (8) and the following definition of the bond albedo, which we stated in words immediately after Equation (8):

where $V(\alpha )=H-2.5{\mathrm{log}}_{10}\phi (\alpha )$ is Equation (1) evaluated at $\delta =0$ and $r={\rm{\Delta }}=1$ AU.
17
http://www.netlib.org/blas
18
http://openmp.org
19
https://code.google.com/p/randomforest-matlab
20
Another standard, slightly different approach is to evenly split the training data into k disjoint sets (a process called k-folding). Also, our choice to separately partition the two classes into training and test subsamples could be omitted.
21
http://ssd.jpl.nasa.gov
22
The Galaxy Evolution Explorer (GALEX) is a NASA Small Explorer-class space telescope that from 2003 to 2012 conducted an imaging survey in a far-UV band (FUV, 130–190 nm) and a near-UV band (NUV, 180–280 nm). Martin et al. (2005) discuss the extragalactic science program; Morissey et al. (2005, 2007) discuss the on-orbit performance, survey calibration, and data products. The A. Waszczak et al. (2015, in preparation) NUV data shown here are derived from data available at http://galex.stsci.edu.
23
The Wide-field Infrared Survey Explorer (WISE) is a NASA Medium Explorer-class space telescope that in 2010 conducted a cryogenic IR imaging survey in four bands: , and , centered at 3.4, 4.6, 12, and 22 μm, respectively. Wright et al. (2010) detail mission/performance; Masiero et al. (2011 and references therein) present preliminary asteroid data.

Please wait… references are loading.

ASTEROID LIGHT CURVES FROM THE PALOMAR TRANSIENT FACTORY SURVEY: ROTATION PERIODS AND PHASE FUNCTIONS FROM SPARSE PHOTOMETRY

Article metrics

Permissions

Share this article

Dates

ABSTRACT

1. INTRODUCTION

1.1. Asteroid Rotation

1.2. Asteroid Phase Functions

2. OBSERVATIONS

2.1. Overview of the PTF Survey

2.2. This Work's Data Set

3. LIGHT-CURVE MODEL

3.1. Rotation Component

3.1.1. Intra-opposition Constraint

3.1.2. Second-order Fourier Series

3.2. Phase-function Component

3.2.1. Two-parameter Shevchenko Model

3.2.2. Lumme–Bowell G Model

3.2.3. Muinonen et al. G12 Model

3.2.4. Multi-parameter Hapke Model

4. LIGHT-CURVE-FITTING ALGORITHM

4.1. Linear Phase-function Parameters

4.2. Nonlinear Phase-function Parameters

4.3. Comments on Implementation

5. RELIABILITY OF FITTED ROTATION PERIODS

5.1. Known-period Subsample

5.2. Machine Learning

5.2.1. Classifier Training

5.2.2. Classifier Cross-validation

5.2.3. Machine-vetted Light Curves

5.3. Manual Screening

5.4. Asteroids with Multiple Fitted Periods

6. PRELIMINARY LIGHT-CURVE-BASED DEMOGRAPHICS

6.1. Disclaimer Regarding De-biasing

6.2. Rotation Rates and Amplitudes

6.3. Phase Functions and Bond Albedos

6.3.1. Taxonomy from Light-curve Data

6.3.2. Wavelength Dependence

6.4. Spins and Amplitudes versus Taxonomy

7. COMPARISON TO MPC-GENERATED MAGNITUDES

7.1. Filter Transformations

7.2. Absolute Magnitudes

7.3. Predicted Apparent Magnitudes

8. SUMMARY

APPENDIX

A.1. Multi-survey Visible-band Color Index

A.2. Compilation of IR-derived Diameters

A.3. Light-curve Data Tables

Footnotes

3.2.3. Muinonen et al. G₁₂ Model