I. Introduction

In Mapp v. Ohio,1 the U.S. Supreme Court extended the due process protections of the exclusionary rule to include all “constitutionally unreasonable searches” that were done without a basis of probable cause.2 In the seven years after Mapp, when homicide rates in the U.S. nearly doubled,3 riots broke out in at least forty-seven U.S. cities.4 During the same era, a heroin epidemic gripped the nation’s urban centers,5 giving rise to street drug markets and associated violence and pressures on law enforcement to curb those markets.6 As violence increased, a turn in the nation’s political culture questioned Mapp’s restraints on police discretion to stop and search criminal suspects.7 Indeed, some writers wondered if the Mapp standard, with its reliance on the exclusionary rule to deter violations of Fourth Amendment rights, had inflicted social costs on the public through over-deterrence of police, leading to elevated crime rates.8

It was no surprise, then, that after those seven years the Supreme Court in Terry v. Ohio9 “uncoupled . . . the two clauses of the Fourth Amendment” that regulated temporary detentions and searches by police.10 Terry dealt with a different “rubric of police conduct”: the beat officer stopping and patting down an individual on the street, more commonly known as an “investigative stop.”11 The Terry test was (and is) thus to balance the scope of the intrusion against the “specific and articulable facts which, taken together with rational inferences from those facts, reasonably warrant [the] intrusion.”12 Justice Douglas, in dissent, labeled this “reasonable suspicion.”13 Although intended to be a narrow departure from Mapp’s standard, it was in fact a big break from Mapp.14 The Court said that the Mapp rule simply did not fit the realities of street policing in an era of rising crime rates.

Under Terry, the police must articulate specific and individualized indicia of suspicion, and those indicia must be salient enough to justify police action. Hunches by police worried the Terry Court.15 The standards then and now do not really tell a police officer doing modern police work how much suspicion is enough to satisfy constitutional standards, or when the quantity of suspicion reaches a threshold of “reasonableness” to justify the intrusion. That question became even more challenging as a series of opinions inflated the scope of “reasonable suspicion” to include pretextual probable cause stops—often minor traffic violations—that open the door to investigations of other crimes,16 or stops where a suspect’s presence in a “high crime area” multiplies less salient factors into actionable suspicion,17 or facially subjective rationales such as “furtive movements” or other “criminal appearances.”

As the fiftieth anniversary of the Terry opinion approaches, it is more than reasonable to ask whether Terry’s move away from probable cause was original sin—whether the dilution and expansion of standards for an investigative stop over time compromised or advanced the very law enforcement interests that animated the Terry opinion. Two criteria of (constitutional) focus are wrapped up in the “law enforcement interest” doctrine: catching offenders and seizing contraband (hit rates), and controlling crime (crime rates). Whether contemporary and expanded Terry standards can achieve or undermine these interests is the primary question for this paper.

Do these sins pay? This is the central question for this article. Sins where officers stop, temporarily detain, question, and possibly frisk a person based on the person’s vague or subjectively perceived actions (appearances, movements) may be less efficient in locating contraband or suppressing crime than stops based on actuarial characteristics (locations). But both may be less efficient than stops based on behavioral indicia of crime. In other words, this article asks empirically whether stops based on indicia that approximate probable cause (based on behavioral indicia that are unambiguously indicative of crime) advance law enforcement interests significantly more than stops based on the more subjective and vague standards that have become commonplace features of contemporary investigative stop programs.

Perhaps these sins pay for only certain types of crime. Terry’s ruling came in the midst of a violent crime spike in the late 1960s through the early 1970s.18 But Terry is now applied broadly for violent and other serious crimes as well as for drug and weapon offenses. And in an era of proactive and “broken windows” policing, minor misdemeanors are theorized as predicates of crime and therefore are indicia of suspicion in and of themselves.19 This leads to the second question for this paper: whether the dilution of standards has differential effects by crime seriousness.

The answers to these questions follow. The first section assesses the doctrinal progression from Mapp to Terry, showing that the original officer safety rationale was eclipsed over time by Terry’s crime control agenda. The second section presents the details of the empirical inquiry on the two research questions. Data on crimes, stops, and arrests from 2004 to 2012 from the Floyd v. City of New York20 litigation are analyzed to address these questions. The data include the bases of suspicion for each stop, and stops are assigned as probable cause or suspicion stops based on the articulated rationale. The third section presents the empirical results. The analyses show significant reductions in crime in neighborhoods (census block groups) with greater numbers of probable cause stops, and ratios of probable cause stops to other stops. The opposite, however, is not evident. Crime neither increases nor decreases in places with higher numbers of non-probable cause stops; those stops simply have no effect on local crime rates. This is an empirical argument about what kinds of police observations of suspicion are indicative of criminal activity, and how acting on those indicia can advance Terry’s public safety agenda.

The final section discusses a set of potential regulatory and doctrinal responses to these results that suggest the application of harm principles to inform the practice of Terry stops that raise privacy and positive liberty interests. This section presents a functional, institutional argument about what kinds of observations of supposedly suspicious activity are susceptible to meaningful review and oversight. It turns out that redemption for Terry’s sins may be close at hand: fleshing out the Terry standard by setting clear rules about what constitutes “reasonable suspicion,” and concretely linking the Terry standard to specific actions indicative of criminal activity will reduce errors in suspicion and better prevent crime.

II. Background

A.    From Mapp to Terry

Bad timing is one factor that led to the shift in standards for investigative stops and searches from Mapp to Terry. Rising violent crime rates through much of the 1960s,21 together with riots in dozens of American cities,22 helped create new social tensions and a legal and policy context in which law enforcement interests eclipsed the restraint on Fourth Amendment violations that was Mapp’s inspiration. As crime rates continued to rise through the next two decades, the focus of Terry’s jurisprudence—and the law enforcement interests that it embodied—shifted from officer safety to public safety. The standards regulating reasonable suspicion, the foundation of Terry, also shifted over time as the public safety interests of Terry hardened and expanded. In this section, the trajectory of this subtle jurisprudential shift is examined, laying the foundation of contemporary Fourth Amendment jurisprudence on the limits of police stop-and-search power.

1. What Mapp did and did not do.

In Mapp v. Ohio, the Supreme Court held that the exclusionary rule applies to state prosecutions.23 The Court had previously held that the exclusionary rule applies to unconstitutionally seized evidence in federal prosecutions in Weeks v. United States.24 But when the Court applied the  Fourth Amendment’s warrant clause to the states in Wolf v. Colorado,25 the Court held that the exclusionary rule was not a necessary component of the Fourth Amendment’s protection.26

Mapp thus stands for the proposition that the probable cause requirement is toothless if not backed by a consequence that “remov[es] the incentive to disregard it.”27 The Weeks Court more fully discussed the details of the probable cause requirement.28 In Weeks, the Court held for the first time that the Constitution requires exclusion from federal criminal prosecutions evidence obtained without a warrant, issued by a judge, supported by probable cause, and describing the object of the search. Implicit in the Weeks decision is the proposition that the “reasonableness” of a given search under the first clause of the Fourth Amendment is defined by the warrant requirement in the second clause of the amendment.29 More specifically, Weeks said the Fourth Amendment protects individuals against “all unreasonable searches and seizures,” which it then defined as those searches done without a “warrant issued as required by the Constitution.”30

Similarly, Mapp described itself as “extending the substantive protections of due process,” that is, the exclusionary rule, “to all constitutionally unreasonable searches”—those done without a warrant issued on a showing of probable cause.31 The meaning of an unreasonable search was a search conducted without a warrant; the two clauses of the Fourth Amendment were linked.32

2. The limits of exclusion.

Earl Dudley notes that the meanings of neither reasonable suspicion to justify a frisk nor Terry stops to justify minor “physical intrusions” are readily apparent from the text of the Terry opinion.33 In Adams v. Williams,34 the first post-Terry decision on stop and frisk, Justice Rehnquist doubled-down on the Warren Court’s reasonableness standard by conflating “stops” with protective “frisks,” and applying the same vague reasonableness standard to both levels of intrusion.35

The phrase “reasonable suspicion” comes from Justice Douglas’s dissenting opinion, when he criticized the Court’s departure from the “certainty” and historical grounding of the probable cause requirement.36 Scott Sundby similarly notes, “Chief Justice Warren’s cautious opinion suggests that the use of the reasonableness balancing test was meant to be viewed as a narrow departure from the norm of probable cause.”37 And Stephen Saltzburg argues that, taking the opinion at face value, “the Court would appear to have decided little.”38 Therefore, for courts in the aftermath of Terry, it was not at all clear how to regulate investigative stops in a novel framework of reasonable suspicion.

However, before the Court reached the merits of Terry’s claim, it discussed the exclusionary rule in a highly suggestive way that highlights the break the Court was making from Mapp.39 In effect, the Court said that the exclusionary rule was powerless and irrelevant to the realities of contemporary beat policing.40 First, the Court noted that, for many interactions, “the police either have no interest in prosecuting or are willing to forgo successful prosecution in the interest of serving some other goal,” rendering the exclusion of evidence useless.41 Second, the Court mentioned the risk of “wholesale harassment” of minority groups by police, but again stated that this “will not be stopped by the exclusion of . . . evidence from any criminal trial.”42

Third and most important, the Court stated that applying the exclusionary rule where it is incapable of stopping police abuse “may exact a high toll in human injury and frustration of efforts to prevent crime.”43 This mention of the crime control objective is muted here, but grows in importance in subsequent rulings, and occupies center stage in the Floyd litigation and in replications of the Terry regime in a swath of U.S. cities and across several countries.44 In effect, the Court took a lesser of evils approach where abuses—both of minorities and the boundaries of investigative stops—are tolerated in return for enhanced crime control, a matter that is challenged in the empirics here.

The focus on the limits of the exclusionary rule at the beginning of the Terry opinion reads as a justification for applying a standard other than probable cause to the types of investigative stops upheld in Terry. The Court says that the Mapp rule does not fit the realities of beat policing because it is too slow to account for public safety concerns and because the remedy—exclusion of evidence—fails to correct police misconduct. In effect, the Court minimized the possibility of deterrent effects on future police misconduct of the suppression of evidence. The Terry standard of reasonable suspicion is thus a vindication of the Court’s public-safety concerns over the trial-focused exclusionary rule of Mapp. Perhaps the Court had a regulatory purpose in mind instead of a deterrence purpose.  The Court was optimistic that police could lean heavily on the internal processes and self-discipline of their institutions to do the work that would have fallen within Mapp’s litigation domain. But the Court also discounted the prospect of an inevitable parade of suppression hearings that would follow the  shift from the more demanding Mapp standard to the subjective, if not inchoate, standard in Terry.

B.    The Terry Standard and the Regulation of Police: Defining—or Failing to Define—Reasonable Suspicion

The new Terry standards did no favor to trial courts by defining in such a subjective way the new standards for street stops that could be challenged. But as the analysis of those standards in this section shows, the Court may have (whether by design or not) mitigated that risk by advancing a standard where subjectivity was subordinated to a highly proceduralized standard.

1. Terry, investigative stops, and a new set of state intrusions.

By requiring reasonable suspicion for stop and frisks, Terry extended the Fourth Amendment to seizures less intrusive than arrests. In effect, the Supreme Court “uncoupled . . . the two clauses of the Fourth Amendment.”45 The Terry Court did not overrule Mapp: the warrant requirement, backed by the exclusionary rule, still applies to the traditional police-at-the-front-door search of an individual’s dwelling.46 But it is hard not to see Terry as a victory for police because it recovers much of the discretion and, to be frank, the power that had been revoked in Mapp,47 and later in Papachristou v. City of Jacksonville.48

Terry dealt with a different “rubric of police conduct” than did Mapp: the beat officer stopping and patting down an individual on the street.49 In footnote sixteen, the Terry majority went further: it refused to consider the question of whether an “investigative ‘seizure’ upon less than probable cause for purposes of ‘detention’ and/or interrogation” violates the Fourth Amendment.50 Then the Court narrowly defined the notion of “seizure” as instances “when the officer, by means of physical force or show of authority, has in some way restrained the liberty of a citizen.”51

Still, even as the Court “emphatically reject[ed]” suggestions that the stop-interrogate-and-frisk interaction is not subject to the Fourth Amendment,52 the Court just as clearly rejected the notion that probable cause was required for the “limited search for weapons” at issue in the case.53 It is not hard to see the standard for such limited searches metastasizing over time into the Floyd regime in New York, with millions of street searches over a decade producing few guns or contraband.54 Instead, the Court adopted a rule from a case involving administrative searches of homes, Camara v. Municipal Court,55 requiring courts to “balanc[e] the need to search (or seize) against the invasion which the search (or seizure) entails.”56

The Terry test, in turn, balanced the scope of the intrusion (e.g., whether the police officer “patted down the outer clothing” or “conduct[ed] a general exploratory search”) against the “specific and articulable facts which, taken together with rational inferences from those facts, reasonably warrant the intrusion.”57 This move in effect distinguishes Terry from Mapp and all cases involving actual arrests. Because “[a]n arrest is the initial stage of criminal prosecution,” it serves very different social interests than a Terry search, which was designed to protect “the police officer, where he has reason to believe he is dealing with an armed and dangerous individual.”58

Although the majority opinion does not address a frisk or pat-down, a Terry frisk also required reasonable suspicion that the individual being frisked presents a danger to the officer or others at the time.59 This specificity requirement is in contrast with the “inchoate and unparticularized suspicion or ‘hunch,’” which is insufficient to justify the intrusion on an individual’s liberty.60 On the other hand, the Court’s disapproval of “hunches” was tempered by its tolerance of “the specific reasonable inferences which [the police officer] is entitled to draw from the facts in light of his experience.”61 Ultimately, the standard is characterized by the Terry Court as “objective,” and must hold up to the “more detached, neutral scrutiny of a judge” who is to ask whether “the facts available to the officer at the moment of the seizure or the search ‘warrant a man of reasonable caution in the belief’ that the action taken was appropriate.” This approach requires balancing the level of suspicion of danger against the invasion of privacy and autonomy, something that an officer in the moment of an encounter with a citizen may be hard pressed to do.62

After nearly four decades, the Terry standard remains rather opaque.63 In describing the government’s burden, William Stuntz analogizes to a statistical determination:

The threshold is not defined mathematically, but one could easily enough think of it that way, and courts and lawyers basically do think of it that way . . . . (Though, I should quickly add, there is no clear agreement on what the right mathematical line is. Probable cause officially means “a fair probability;” in practice, it means, roughly, more-likely-than-not. Reasonable suspicion plainly requires less than probable cause. A good approximation, then, might be something like a one-in-five or one-in-four chance.)64

Thus, the burden of proof is quite low. Given an estimate of a twenty or twenty-five percent chance a crime is about to occur or has occurred, proving a “hunch”—the type of suspicion, with a capacious tolerance for error, Terry condemns—requires a very low probability indeed.

2. Regulating reasonableness.

In the same part of the opinion that explains the shortcomings of the exclusionary rule as a check on street policing, Terry appears to suggest that the rule remains the primary judicial check. For instance, the Court says that, when applying Terry, if courts identify “over-bearing or harassing” conduct, “it must be condemned by the judiciary and its fruits must be excluded from evidence in criminal trials.”65 Still, the Court here is directing lower courts to focus on conduct during the stop, not the indicia of suspicion that motivated the stop.

The Court in fact excluded evidence on these grounds in Sibron v. New York,66  a companion case to Terry. Both Terry and Sibron came to the Supreme Court as appeals from trial court denials of motions to suppress.67 While introduction of the weapon uncovered from the frisk in Terry was admissible, the Sibron evidence was excluded on grounds that the search was not justified by the protective interest that motivated the officer in Terry. Sibron involved a police officer who searched Sibron after observing him “talking to a number of known addicts.”68 The Court noted that the Terry rule only justified limited frisks where particular facts support an inference of danger to the officer, which was not present based on an individual speaking with known addicts.69 The Court went further, distinguishing the Sibron search from the Terry search, which initially “consisted solely of a limited patting of the outer clothing of the suspect for concealed objects,” with the officer “plac[ing] his hands in [Terry’s] pockets” only after “discover[ing] [a concealed] object.”70

The Terry-Sibron comparison illuminates the Court’s concerns about individual dignity. In Terry, the Court justified its application of the Fourth Amendment to frisks (but not stops) because it found a frisk to be “a serious intrusion upon the sanctity of the person.”71 The Court addressed this intrusion by requiring a balancing between the level of intrusion and the level of suspicion. It could be the case that the harm to a person’s dignity from the intrusion of a frisk may be greater than the harm to dignity resulting from a full search incident to arrest. The reason may lie in the difference in the standard for a frisk versus a search: a full search incident to arrest requires probable cause, which suggests specific behavioral and inherently concrete indicia of suspicion. A frisk, in contrast, need only be justified by the more subjective and inchoate standards of reasonable suspicion.

3. How much reasonable suspicion? What indicia?

Both in the run-up to Terry and the decades after, courts never developed a constitutional consensus as to how much suspicion is needed to give rise to reasonable suspicion.72 Nor are there substantive indicia to prioritize or weigh which behaviors or factors matter; the courts have said only that these indicia must be reasonable. Some courts have argued for a test based on the efficacy of stops in detecting crime or locating contraband, but here too, there is no agreement on what constitutes an acceptable “hit rate” that satisfies the reasonableness standard across cases. In Navarette v. California,73 for example, Justice Antonin Scalia suggested that at least five, if not ten percent, of the entire universe of incidents would need to be an accurate “hit” to be indicative of reasonable suspicion. According to Scalia, absent such a showing, the basis of suspicion is not reasonable without further information.74 A similar outcomes test was considered in Floyd to claim that the police were so often wrong in the bases of suspicion for their stops that those bases were categorically faulty.75

But after nearly five decades of Terry, courts have rejected a substantive review of the criteria of “reasonable suspicion.” Instead, courts have consistently decided cases based on some rendering of the reasoning of the officers at the scene (based on a post-hoc account) pursuant to a specific fact, and whether that reasoning was, well, reasonable to an experienced officer.76 But it gets worse. Until recently, a series of cases required that the basis of the information on which reasonable suspicion was determined be reliable.77 But in Navarette, the Court largely abandoned the reliability doctrine by holding that an anonymous 911 call without any corroboration meets a test of reasonable suspicion to justify a stop and seizure.78

Under the real-time demands of police work, and with little oversight to correct misapplication of the perceptual and reasoning processes, the articulation of suspicion often defaults to behavioral scripts that are matched to fill in the empty cognitive spaces in the actual bases of suspicion.79 In three out of four street stops in New York City, for example, police observe a suspect for less than two minutes before proceeding to what New York state law80 defines as an “intrusion.”81 The stop requires officers to perform a quick perceptual and cognitive sorting of complicated and highly contextualized information that shapes the initial evaluation of suspicion. As the interaction unfolds, this sorting is modified and narrowed through interactions and exchanges between the suspect and the officer(s). After all this, the officer then retreats to an unspecified location under uncertain conditions to record the reasons for the encounter, reasons that may have taken place and been cognitively encoded an hour or more before.

It is no wonder that police officers may default to a script. But even with the handy crutch of a script, the cognitive burden to both articulate the reasons for the suspicion, and how those reasons got beyond a (not well articulated) threshold to take action, leaves a wide space for error in perceptions, weighing, and decision-making.

The configuration of Terry and its progeny simply begs the question as to what factors meet the test of articulable and individualized. These cases continue, as did Terry itself, the fiction that there is a threshold of suspicion that renders police action constitutionally permissible. Suspicion in this formulation thus becomes a hurdle model, or a binary category, in which the stop is either constitutional or not.82 Courts worry more than the police about whether there is enough suspicion to get over that hurdle and satisfy the “individualized” suspicion test. And the elasticity of the Terry standards complicates the job of courts to regulate those decisions.83 Officers are left to the extremes of roll call training on the one hand and litigation challenges on the other to define a space in which their actions comport with the shifting territory of the Fourth Amendment.84

C.    Terry’s Crime Control Agenda

Terry’s original sin took two forms. First the majority created the reasonable suspicion standard that allowed subjective assessments of suspects’ behavior to substitute for the more demanding standard of probable cause. This was done, as discussed earlier, in the interest of protecting officers from harm. The Terry Court declined to articulate clear standards of suspicion, defaulting the professional “experience” and judgment of the officer.85 The second sinful act was the doctrinal shift over time from the original officer safety rationale to permitting reasonable suspicion stops in the interest of crime control. This section examines the evolution of this second sin, and describes the rationale for modern Terry practice.

1. Terry’s hidden crime control agenda.

A first reading of the majority opinion in Terry suggests that it had little to do with crime control, and everything to do with the safety of police officers in conducting investigative stops or field interrogations.86 The Court seemed to be well aware that it was making a trade-off: in allowing “something less than a ‘full’ search” at a new and relaxed standard of “reasonable suspicion,” the Court held that the Terry stop “must be limited to that which is necessary for the discovery of weapons which might be used to harm the officer or others nearby.”87

At first glance then, the Terry Court’s concern seemed to be less about public safety generally, but rather the safety of the officer when approaching and questioning individuals like John Terry. The President’s 1965 Commission on Law Enforcement and the Administration of Criminal Justice reported that “[c]ommission observers of police streetwork in high-crime neighborhoods of some large cities report that 10 percent of those frisked were found to be carrying guns, and another 10 percent were carrying knives.”88 But the report did not mention officer injuries or deaths in routine contacts as posing the danger that drove the Terry ruling.

In fact, the only evidence the Terry Court cites about the dangers of policing is a reference to the same 1965 Presidential Commission and is contained in a footnote, worrying that frisks often exacerbate tensions between the police and minority groups.89 In the end, the Court does not tie the crime-control or officer-safety aspects of the opinion to any evidence. In the years between Mapp and Terry, officer deaths rose as overall rates of violent crime rose, but the Terry Court made no note of this.90

But in fact, crime control and public safety were as much on the minds of the justices as was officer safety. Two concurring justices and a third dissenting justice more directly alluded to a general crime-prevention rationale for the Terry holding than does the Court’s majority opinion. First, Justice Harlan, who rejected the Court’s effort to decouple the “frisk” issue from the “stop” issue, concurred to make clear that, to him, the stop was constitutional “only because circumstances warranted [the officer] forcing an encounter with Terry in an effort to prevent or investigate a crime.”91 Thus, for Justice Harlan, the intrusion of the stop and frisk needed the crime-prevention rationale to survive constitutional scrutiny. Although Justice Harlan described his opinion as merely “fill[ing] in a few gaps” in the majority opinion, the reference to preventing crime more broadly than the potential harm to the police officer conducting the frisk nowhere appears in the Court’s opinion.92

Second, Justice White, in the second paragraph of his two-paragraph concurrence, provides “an additional word . . . concerning the matter of interrogation during an investigative stop.”93 Like Justice Harlan, Justice White emphasizes the link between “temporary detention” and the frisk that the majority opinion sought to avoid. But he went further than Justice Harlan to speculate about a possible crime-prevention benefit of frisks that fail to uncover any weapons: “Perhaps the frisk itself, where proper, will have beneficial results whether questions are asked or not. If weapons are found, an arrest will follow. If none are found, the frisk may nevertheless serve preventive ends because of its unmistakable message that suspicion has been aroused.”94 The two concurrences bookend two different theoretical supports for investigative stops: while Justice Harlan appears to have identified the crime-prevention rationale based on police intervention before a crime is committed, Justice White apparently saw a general crime-prevention effect in the failure to uncover weapons as a way to educate citizens about what police officers find suspicious.

Justice Douglas dissented alone in Terry. He was implacable about the necessity of the probable cause requirement for a temporary detention and a frisk or pat down of a suspect.95 In passing, however, he acknowledged—noting the escalating crime rates in the years after Mapp—that “[p]erhaps [the Terry rule] is desirable to cope with modern forms of lawlessness.”96 This seems an oblique reference to a general crime-prevention rationale that goes beyond the narrower interest in officer safety enunciated by the majority.97

In the few states that developed doctrine that differed from Terry, the controlling opinions also incorporated both crime control and officer safety prongs.98 In People v. De Bour,99 officer safety was a less pressing concern than was the broader public safety impetus for the pursuit and frisk of the suspect. The New York State Court of Appeals upheld the introduction into evidence of a gun discovered when police officers asked a man to unzip his jacket.100 The court held the encounter was a “legitimate . . . inquir[y] as to [De Bour’s] identity” because it was without “harassment or intimidation,” “brief,” involved prevention of the “serious crime” of narcotics, “occurred after midnight in an area known for its high incidence of drug activity,” and because “De Bour had conspicuously crossed the street.”101

The crime control function of Terry stops is one of a number of governmental interests that could potentially authorize a stop, but case law after Terry focused increasingly narrowly on violations of criminal law as the primary government interest.102 For example, in United States v. Brignoni-Ponce,103 the Court extended the rationale and basis of Terry stops to a broader government interest: immigration control by roving patrol “to prevent the illegal entry of aliens at the Mexican border” such that warrantless seizures, based on reasonable suspicion, could be used.104 In United States v. Martinez-Fuerte,105 the Court found checkpoints within 100 miles of the border to be reasonable even without an element of suspicion in the stopping of cars.106 These border-control searches could comfortably be described as crime control measures, and the Supreme Court accepts the governmental interest of preventing illegal immigration.107

2. In plain sight: Terry’s explicit crime control strategy.

After Adams v. Williams,108 which affirmed Terry while blurring the line between stops and protective frisks, the Court proceeded to incrementally extend the constitutionality of Terry stops beyond the narrow governmental interests of officer safety or border control, and, in so doing, affirmed its crime control rationale. In Michigan v. Summers,109 the Court extended Terry’s reach to investigative stops incident to a search warrant.110 Here, a division opened within the Court between those who would restrict Terry stops to the already authorized safety (or border control) exceptions and those who sought to extend Terry in favor of police action more closely in tune with a crime control perspective. The dissenters in Summers favored the narrower view. Justice Stewart, joined by Justices Brennan and Marshall, explained that “some governmental interest independent of the ordinary interest in investigating crime and apprehending suspects” must be important enough “to overcome the presumptive constitutional restraints on police conduct.”111 The majority, however, used the border-control-search cases to demonstrate that the exception for limited intrusions that may be justified by special law enforcement interests is not confined to the momentary, on-the-street detention accompanied by a frisk for weapons involved in Terry and Adams. . . . Most obvious is the legitimate law enforcement interest in preventing flight in the event that incriminating evidence is found. Less obvious, but sometimes of greater importance, is the interest in minimizing the risk of harm to the officers.112

Over time, the Court further extended Terry’s subtext authorizing investigative stops as a crime-fighting tool, each time increasing the scope of their permissible contexts. For example, in Michigan v. Long,113 the Supreme Court held that seizure of non-weapon contraband during a weapons search of a vehicle did not violate the Fourth Amendment.114 Minnesota v. Dickerson115 extended Long to contraband found by touch during a pat down for weapons.116 And a further extension of the weapons search rationale came in Maryland v. Buie,117 where the Court authorized a “protective sweep” of an individual’s house where he was arrested pursuant to a warrant. In Hayes v. Florida,118 the Court authorized, in principle, officers in the field to take fingerprints incidental to Terry stops if the officers had a reasonable suspicion that the individual had committed a crime.119

The final stages of Terry’s expansion to crime control are evident in the pretextual stop authorized in Whren v. United States,120 which permits an investigative stop and search once an officer has probable cause to believe that any crime has occurred, no matter how trivial.121 In that case, the Court not only refused to take into account the subjective motivation of the narcotics officers (who had used a traffic infraction as a pretext to stop suspected drug traffickers); it also refused to consider the argument that an objectively reasonable officer, “acting reasonably,” would not have made the stop “for the reason given.”122 Nor did the Whren Court consider the racial lopsidedness of the incorporation of Terry stops in the practices cited in Whren.123 And in Illinois v. Wardlow124 a suspect’s presence in a “high crime area” was validated as a multiplier of less salient factors into actionable suspicion, including facially subjective rationales such as furtive movements or other criminal appearances.125 Yet, neither the Wardlow majority nor any subsequent cases attempted to standardize the parameters of a “high crime area,” completing the subjectivization of what Terry had launched three decades earlier.126 Judge Alex Kozinski, dissenting in United States v. Montero-Camargo127 summed up the Wardlow challenge in the same year: “Just as a man with a hammer sees every problem as a nail, so a man with a badge may see every corner of his beat as a high crime area.”128

3. Modern Terry doctrine.

This trajectory of cases suggests then that from the initially delimited weapons search in Terry, intended to protect police, the Court has thus extended the reasons for which Terry stops may be conducted well beyond its original boundary: law enforcement can use Terry stops to investigate future, ongoing and past crimes; to check identity, even through fingerprinting; to search a car for identification; to search residences for contraband and weapons; and to search luggage, regardless of the presence of the owner.129 By 1986, the Court had stopped discussing the government’s justification for a stop-and-search in terms of broader government interest or of officer safety. By that year, for example, the most detailed articulations of the crime control functions of a Terry stop were general statements such as that in Terry itself: “effective crime prevention and detection.”130 Or, as in United States v. Hensley:131 “solving crimes and bringing offenders to justice.”132 Or, as in Florida v. Royer:133 questioning related to “the suppression of illegal transactions in drugs or of any other serious crime.”134 Finally, in Floyd v. City of New York, the trial court noted that the conduct of Terry stops as part of a crime control “program,”135 suggestive of what Justice Marshall in Florida v. Bostick136 called a “dragnet,”137 violated the original intent of Terry: to conduct investigative stops to identify imminent or ongoing crimes based on articulable bases of suspicion.

The most recent expansion of Terry’s doctrine was actually not about the parameters of suspicion, but addressed the Fourth Amendment regulation of those boundaries, and whether a violation of reasonable suspicion can even trigger Fourth Amendment relief. In Utah v. Strieff,138 the Court held that the exclusionary rule did not apply to evidence discovered after an unlawful stop that turned up an outstanding arrest warrant. The most important feature of the Court’s opinion was its admission of evidence that was obtained by plainly unconstitutional conduct. Officer James Fackrell stopped Edward Strieff as he was leaving a residence that Fackrell believed was a drug selling location.139 “Over the course of about a week, Officer Fackrell conducted intermittent surveillance of the home. He observed visitors who left a few minutes after arriving there. These visits were sufficiently frequent to raise his suspicion that the occupants were dealing drugs.”140 Fackrell’s conclusions about the illegal activity at that spot were based on an anonymous call to a “drug-tip line” and Fackrell’s own personal experience.141 Once stopped, Fackrell discovered that Strieff had a “small” outstanding arrest warrant for a traffic violation.142 Conducting a search incident to arrest, Fackrell discovered drug paraphernalia and amphetamine in Strieff ’s pockets.

The Strieff Court recognized that the stop was unconstitutional.143  So did the Utah Supreme Court, which had nullified the arrest on the drug charges.144 But, because that conduct was (in the eyes of the Court) neither intentional nor flagrant, the evidence was admitted. Applying an attenuation doctrine that severed the police conduct from the causal chain between the stop and the seizure, the evidence was allowed to stand. The decision seems to go in two directions at once. The Court recognized that the discovery of the warrant was unforeseeable: there are no behavioral indicia that someone may have an outstanding warrant, nor was that condition noted in prior cases as a sign, as the Terry Court required, that “crime is afoot.”145 But the Court also wanted to allow the reasonableness of the stop and warrant check, despite the fact that the discovery of an outstanding warrant was unforeseeable. It is rare, except in extraordinary circumstances as in the Ferguson investigation,146 to discover an outstanding warrant during a routine pedestrian or traffic stop.147 In dissent, Justice Sotomayor characterized the warrant check as “part and parcel of the officer’s illegal ‘expedition for evidence in the hope that something might turn up.’”148 Perhaps most important, the attenuation doctrine applied by the Strieff Court essentially scrubs out reasonableness from the Terry formula.

Justice Sotomayor goes further, claiming that this is hardly an “isolated” event that the Strieff majority claims.149 She describes the same decades of expansion of the Terry logic to justify widespread investigative stops of both pedestrians and vehicles,150 and the risks of humiliating intrusions and abuses during these now routine contacts.151 She goes on to describe the racial skew in the risks of these contacts, describing a “double consciousness” of race and criminality that is instantiated in black and Latino youths.152

D.    Gains and Losses After Terry

The majority of opinions of the courts in the stop-and-frisk cases that followed Terry, as well as recent legal scholarship, argue, “Terry’s regime of stop-and-frisk may well be critical to the fight against violent crime. For that reason, the law enforcement benefits of Terry seem substantial, and the intrusion on liberty that it authorizes seems relatively limited.”153 Yet there has been remarkably little empirical analysis of Terry’s crime control contributions. This crime control agenda, and its claims of efficacy in reducing crime, provides the rationale then to test Terry’s effects on crime.

This essay starts with the notion that, searching for a crime control rationale to justify a broad standard for police intrusions via street stops, Terry’s original sin was forgoing a probable cause standard for investigative stops and substituting an inchoate standard, a standard that is inherently subjective and prone to cognitive distortion, bias and error. Somewhere between that elastic Terry standard in practice today—a practice that often instantiates into policy and program the hunches that so worried the Terry Court—and Mapp’s probable cause standard, lies a threshold of suspicion that can do three things: avoid the petty indignities that have become commonplace in the “new policing,”154 avoid the burdens on the innocents of inefficient stops and intrusions that consume both police resources and citizen trust,155 and contribute substantially to Terry’s crime control agenda. The empirical data in this paper seek out that threshold.

III. Empirical Details

A.    Data and Measures

Data produced in the litigation in Floyd v. City of New York were re-analyzed to address these questions.156 The study period was 2004 though 2012, a lengthy interval to examine trends by month that were sensitive to changes in police stop-and-frisk practices. The data included geocoded records of each stop, which were aggregated to generate counts of stops within police precincts and census block groups for each month.157 The stop data also included police reports of the crime suspected in each stop. These included 133 codes that were reduced to seven categories that reflected the crime categories of interest in the policy debate in New York on the stop regime.158 Crime counts were estimated from crimes reported to the police and geocoded to the nearest street block. These crime reports were then aggregated to generate counts of suspected crimes within precincts and census block groups for each month. The rationales for these units of analysis are discussed infra. The classification categories are shown in Appendix A.

Census data from the 2008 American Community Survey (the midpoint of the time series) were used to generate an empirical description of the social, economic, and demographic conditions for each census block. Although the use of a single time point omits changes in the economic and demographic characteristics of these census blocks during the time-period, only a small portion of the 6475 census block groups were changing dynamically during this interval. I address the effects that temporal trends in areas could have on my estimates by including a linear time trend for each police precinct-month. Precincts are administrative units encompassing census block groups and are substantively important, as this is the spatial unit where uniformed police officers are assigned, and crime control strategies are implemented and managed.

The nine law-defined categories of suspicion that police marked on each stop form were used to state the bases of reasonable suspicion for each stop.159 The boxes included affirmative stop rationales plus an option to check “other” and record the specifics by hand.160 The nine rationales incorporated a set of behavioral categories based on both state and federal case law that would survive a Fourth Amendment test for the individualized stop rationales.161 Officers could check as many boxes as needed to express the basis for the stop. Table 1 lists the categories available for officers to mark the bases of suspicion. In about ninety-five percent of the stops from 2004–2012, officers checked from one to six factors, creating 60,459 possible combinations that express the bases of suspicion for this subset

Table 1. Specific Stop Circumstances and Percent Based on Each Factor



Furtive Movements




Other Stop Circumstance


Evasive Actions


Fits Description


Carrying Crime Objects in Plain View


Drug Transaction


Suspicious Bulge


Actions Indicate Violent Crime


N= 4,575,787

Note: The total exceeds 100 percent due to multiple stop factors indicated per incident.

Source: NYPD Stop-and-Frisk Database, various years.


Three of the nine factors describe observable suspect behaviors that approximate criminal activity: (1) actions indicative of engaging in drug transaction, (2) actions indicative of violent crimes, or (3) “casing” victim or location.162 These factors on their face approximated a probable cause basis for a Terry stop. Each factor is narrow and behaviorally specific, avoiding the vagueness and subjectivity that worried the Terry Court163 and that has translated into recurring constitutional challenges based on Fourth Amendment violations.164 We have only vague ideas about how police discretion is managed in deciding who to stop, and even less information on what exactly they are looking for when they think an action or person looks suspicious.165 While there may be no algorithm to explain how police determinations of suspicious behavior are formed, there are at least observable patterns. The worry in this regime is about unconscious patterns, often racialized, that shape the formation of suspicion based on archetypes such as the “symbolic assailant” and other processes that shape cognition and interpretation of behavioral cues.166 Symbolic cues are clearly problematic, as they have no legal justification.

Judicial opinions make clear that stops based on observations of actions indicative of criminal behavior are constitutional. Actions indicative of a drug transaction that can survive a prima facie claim of probable cause include observed exchange of currency or an object that might contain drugs.167 Some case law suggests that these actions are indicia of criminal drug transactions only if they take place in a “drug-prone” location, although courts have never clarified the meaning of a “drug-prone” location.168 For example, the Strieff Court questioned the formation of suspicion by Officer Fackrell that a particular residence was a “drug location” based on an anonymous tip to a “drug-tip” line about “narcotics activity at a particular residence.”169 Other courts have ruled that a suspected drug transaction with specific behavioral indicia may justify a field interrogation but, absent other factors, cannot justify a frisk.170

Although “casing” can describe a number of different and potentially innocuous behaviors, actions legitimately indicative of casing either a victim or a location can justify a stop and frisk.171 Reasonable suspicion that a person may have been involved in a violent crime can support a stop and frisk, even without other evidence of actual violent or otherwise dangerous behavior.172 So too can threats of violence.173 Still, actions short of behavioral indicia of imminent or ongoing violence run the risk of vague and subjective interpretation by police contemplating a stop, and courts have urged caution in making the leap from “furtive movements” or “evasive actions” to violent crime.174

The behavioral grounding of these three categories provides little room for cognitive error or perceptual distortion, and is consistent with state and federal case law on probable cause.175 In addition, courts have said that observed criminal behavior is sufficient on its own to justify a police stop.176 In contrast, the other six categories of suspicion in these data require subjective judgments and attributions of intent: (1) furtive movements, (2) fits descriptions, (3) carrying objects in plain view, (4) suspicious bulge, (5) evasive actions, or (6) “other.”177 In contrast to observations of specific criminal activity, these factors are vulnerable to cognitive bias and error, as well as racialized attributions of suspicion or criminality.178

By hiving off the three categories of suspicion that are closer in meaning to a probable cause standard, the empirical strategy here is to determine how the use of these three categories of stops influences crime rates, net of other social and crime conditions. I estimate the number of probable and non-probable cause stops in each census block group each month to assess their separate and combined effects on crimes in later months.

B.   Empirical Strategy

All statistical models were estimated as Poisson regressions with standard errors clustered by block groups to control for unmeasured variation and correlation within block groups.179 The regressions include a measure of the total stop, question, and frisk activity (SQF ) per month and a measure of the subset of stops based on “probable cause” justifications. A separate parameter for each month-precinct is included to control for separate trends within the larger precinct units in which block groups are nested.  Precincts are relevant as the management unit to supervise officers and deploy them to locales within the precincts. Assignments of officers change each month within precincts, if not more frequently, based on decisions made by precinct commanders.

The model takes the form:

(1) Yi,b,t = µi + λp,(i),t + β1Di p t + β2Pi + β3Si+ β4D*Pit + β5D*Si + β6X + εi,p,t

where Yi,b,t  is the number of crimes in block group i located in precinct p in month t, λp(i),t is a measure of the crime rate in the block group the month before, D measures the number of stop factors indicated in stops, P measures the percent of probable cause stops, and S measures the total number of stops made in the block group in a month. For this model the parameter (β3) for S is constrained to equal one, so that P and D become rates per overall stop in each block month.180 The regression model also includes a time trend for the month-precinct. An interaction (D*P) between the number of probable cause stops and the average total of stop factors indicated in each stop is also included.

In this model, X represents a set of control variables measuring local social conditions, including racial composition (percent black, percent Hispanic), poverty, age structure, immigrant concentration, average educational attainment, and the housing vacancy rate. These are measured for the midpoint of the time series, 2009, using census data from the American Community Survey’s five-year estimates.181 I also control for block groups in low-crime and low-population business districts, where cues of suspicion leading to stops may be more likely to be formed based on observations of individuals’ behaviors and not priming from the local neighborhood context.182

The initial specifications estimate effects with lags of two months and leads of two months. This empirical strategy allows me to estimate the effects of stops net of the threats of reverse causation. The forward lag, or lead, tests the sensitivity of the results against spuriousness owing to temporal order183 or the possibility of regression to the mean.184 Blocks may receive an increase in overall stops due to a recent crime spike, so that mean reversion would lead to upwardly biased estimates in the regressions of monthly crime rates. To test for residual effects of stops on crime over a longer time-period the models are also estimated with six-month leads and lags.

IV. Results

There are 6,495 census block groups in New York City, as of the 2010 decennial census.185 Table 2 shows descriptive statistics for the city’s block groups. The average residential population is 1,348.9 persons located in land areas averaging 0.047 square miles.186 The racial and ethnic population characteristics suggest the diversity of the city’s population, with percent Non-Hispanic white, Non-Hispanic black, and Hispanic populations nearly equally distributed. Non-

Table 2.  Block Group Descriptive Statistics



Std. Dev.

Total Population



Racial and Ethnic Composition



     % White NH



     % Black NH



     % Hispanic



     % Other NH



Highest Education

     % > High School Grads



     % College Degree +





     Vacancy Rate



Income and Poverty


     % Public Assistance



     % Below Poverty



     Per Capita Income ($)





     Stops per Month



     PC Stops per Month



Crime per Month


     Violent - Felony



     Property - Felony



     Drug Crimes



     Other - Felony














N of Block Groups




Source: American Community Survey, 2006-10 Estimates;
New York City Police Department, Crime Complaints, various years;
New York City Police Department, Stop and Frisk Data, various years.


Hispanic whites are a plurality at 33.3%. Asians comprise 12.6% of the city’s population, the majority of the “Other Race” group of 15.3%. More than one in three adults over the age of twenty-five has a college degree or post-graduate study, and about one in five (22.4%) did not graduate from high school. One in five (21.6%) households live below the federally defined poverty threshold. Median per-capita income is $30,718 per year. The housing vacancy rate, a correlate of crime,187 averages 9.1% of total housing units in the block group.

Indicia of crime and enforcement also show considerable range and skew by census block group. Monthly crime counts appear low at first glance, but when aggregated across census block groups each month, the crime counts add up. The standard deviations again show the skew in these crime counts. Terry stops per month average 4.1 stops, with a standard deviation of 9.3. Of those stops, about half (46.1%) fit the definition of “probable cause” stops (hereinafter PC stops), with a standard deviation of 4.4. Figure 1 shows the distribution of PC and “non-probable cause” stops (hereafter NPC stops) per month over the nine-year period. PC stops were less frequent than NPC stops for nearly every month in the time series until May 2012. In that month, the counts of PC and NPC stops evened out, and the total number of stops declined sharply. The onset of the decline coincided with a class certification ruling in the Floyd litigation that allowed the litigation to proceed to trial.188



The specific stop circumstances that officers mark down for each stop were shown earlier in Table 1. Of the 46.1% that are classified as PC stops, more than half are based on suspicion of “casing,” the same circumstance that animated the 1963 stop of John Terry.189 A judgment that a suspect is casing a person or a location requires a subjective assessment and interpretations of specific behaviors that may be precursors of a crime. Of the three categories of PC stops, this is the most subjective. Suspicion under this category that rises to the level of action by an officer should require a lengthy period of police observation of the suspect or suspects in order to rule out innocent or casual actions and to show that the behavior is sustained over more than just a few minutes. The judgment requires more cognitive work than do judgments based on the other categories, where the behaviors may be more repetitive across events and circumstances, and where the actions and gestures are less ambiguous.190 And that cognitive work also can offset implicit biases in perception that can infect instantaneous or snap judgments of suspect actions, biases based on place, race, or archetypes such as the symbolic assailant.191 In contrast, “furtive movements,” marked as the basis of suspicion in movements, comprise half of the police stops in 2004–12, and represent the most vague and subjective indicia of suspicion.192

Next, Table 3 reports the results of regressions showing the effects of PC stops on crime. Two different model specifications regressions were estimated. The first analyzed the effects of PC stops alone on six different types of crime, with total number of both PC and NPC stops as a control variable. The second version analyzed the effects of PC stops controlling not only for the total number of stops, but also including a measure of the average number of indicia of suspicion marked in the stops conducted in each block group-month observation. In other words, this second estimate represents the effects of PC stops controlling for the totality of suspicion applied by officers in making PC stops in each block-group month. The results are reported as Incident Rate Ratios (IRR). An IRR expresses the change in the dependent variable given a change in the value of the predictor, with a mean of 1.0 indicating no change.193

The upper panel of Table 3 shows the effects of PC stops alone on crime. These estimates examine crimes with a lag and lead of two months. Each model is significant, although the large number of observations (6490 block groups for 108 months) reduces the importance of significance as a measure of model strength. More important are the effect sizes. The IRR estimates range from .924 for violent crimes to .969 for weapons offenses. Interpreting the IRR estimates as rates of change, these models show that for every increase of one PC stop, the various crime types will decline in each block group by anywhere from roughly three percent to seven percent. These are average effects across the block groups of the city over the 108 months of the study interval.

The lower panel of Table 3 shows the effects of PC stops interacted with the total number of stop factors marked by an officer in those stops, or the total quantity of suspicion in each case. The results are again significant in all models, and the IRR estimates this time are larger, ranging from .836 for weapons offenses to .680 for property crimes. Translating this into effect sizes, these estimates show reductions for each increase in PC stops from 16.2% for weapons offenses to approximately 32% for property and violent crimes. Again, these are average monthly effects across the block groups. In an era of declining crime rates in the city,194 these effects based on stop type are quite large. The implication as well is that higher concentrations of NPC stops are unproductive and add nothing to the crime control efforts of law enforcement.

The concentration of PC stops varied in each block-group- month.  The range of PC stop concentration raises the question of threshold effects. At what point do crime rates deflect downward as the concentration of PC stops increases? Figures 2.1 and 2.2 show the marginal effects of PC stops at ten percent intervals in the distribution of PC stops. Marginal effects are the values of a predictor variable on the dependent variable in a regression that is estimated from the specific or fixed values of that predictor with the other predictors held constant at their means or averages.195 In this case, marginal effects are estimated on total crime for each ten percent increment of the percentage of PC stops in a block group-month. Each figure corresponds to the two regression strategies reported in Table 3: PC stops as a predictor (Figure 2.1), and PC stops plus total suspicion as a predictor (Figure 2.2).



Each figure shows that the crime reduction effects of PC stop concentration increase beginning when these stops exceed fifty percent of all stops in a block-group-month.  The effects are stable and modest up to a fifty percent concentration of PC stops, and then increase at successive increments. The sharpest increase in crime reduction is at the highest concentrations of PC stops. Figure 2.1 shows that the marginal effects increase from about 1.75 fewer crimes at fifty percent to over two crimes per block group-month with a sharp increase from eighty percent to ninety percent. These are the effects attributable to PC stops and do not reflect other factors related to crime reductions, which the marginal effect model averages over the full range of PC stops in the marginal effects model. Figure 2.2 shows the same pattern. Estimates of the effects of PC stops together with total suspicion across those stops are stable up to a fifty percent concentration, and increase at each successive increment. The largest increase in marginal effects is at the last increment, from eighty percent to ninety percent.

These analyses show the short-term effects of PC stop concentration at two month projections. This form of residual effect could decay over time as would-be offenders adjust to the increased risk of being stopped by the police, or as police officers rotate into and out of patrol assignments which can interrupt their learning and updating of their practices. An important question, then, is what are the residual effects of PC stop concentrations over longer intervals?196 To test for residual effects, the models in Table 3 were re-estimated in Table 4. Table 4 shows only the IRR for total crime and six specific categories of crime using a six-month lag and lead time parameter. The panel on the left of the table shows the IRR for PC stops only after six months; the panel on the right shows the effects of PC stops plus the totality of suspicion in those stops.

Table 4 and Figures 3.1 and 3.2 show that for both PC stops and PC stops with total suspicion, the crime reduction effects at six months are similar to the effects at two months. There are small and negligible differences in the effect sizes from two to six months for both sets of models. PC stops produce a 6.6% decline in total crime, and crime-specific reductions ranging from 7.7% for violent crime to 3.1% for weapons offenses. The reductions in total crime for PC stops with total suspicion are nearly 30%, with crime-specific reductions ranging from 16.5% for weapons offenses to 32.3% for property crimes. These percentages are based on generally low offense counts, but these monthly reductions aggregate over time to produce important and sizable safety benefits.




The comparative advantage in crime reduction benefits of focusing stops on indicia of suspicion that are more closely aligned with probable cause and behavioral markers are evident in these analyses. It would be important to identify the underlying mechanisms for these effects, but that would require a very different and ethically challenging research enterprise, including strong identification strategies that account for concurrent sources of deterrence, as well as testing the specific underlying mechanisms of deterrence.197 For now, it is not hard to imagine that by narrowing the scope of suspicion to behaviors more closely aligned with criminal activities, the emphasis on accuracy allows the signals of deterrence to be aimed more directly and less speculatively or subjectively at persons who may be deciding about a possible crime. Contemporary theories of deterrence agree that the risks of detection and apprehension are essential to effective deterrence.198

V. Redeeming the Original Sin

There are tradeoffs in crime returns—shown here and in other studies assessing the effects of probable cause stops199—when stop regimes lean heavily on subjective or inchoate indicia of suspicion over more objective behavioral markers. This tradeoff is one part of Terry’s original sin. Officers can play hunches, but at a price. The original sin then, was less a question of moving away from Mapp’s probable cause standard than it was inviting police to use their authority to conduct temporary detentions and investigations based on the very hunches that worried the Terry Court.200

The Terry Court, and subsequent Fourth Amendment opinions, chose to define neither the substantive criteria of reasonable suspicion, nor how much suspicion is required for police officers to conduct an investigative stop.201 Even when courts exercised caution in expressing what is reasonable suspicion, the standards often were simply an elongated expression of Terry’s binary approach distinguishing reasonable suspicion from probable cause.202 For example, New York State standards are more demanding than the standards in other states for investigative stops203 as well as the Terry standard, yet the De Bour and Holman courts in New York defaulted to a subjective cascade of four increasing levels of intrusion.204 Redemption for Terry’s sins can come in two forms.

A.    Terry at Fifty

Whether in the binary or a more detailed articulation of reasonable suspicion such as De Bour, the Terry component of the new policing seems, as practiced, to have failed at least the crime control prong of Terry’s balancing test.205 Along the way, Terry’s failure to provide substance or quantity to the concept of actionable “suspicion” created a subjective terrain that invited police to use broad assessments of suspicion—at times, hunches—that have raised a steady stream of Fourth Amendment problems.206 While this was not unknown or unanticipated at the outset of the Terry era, the inherent subjectivity of reasonable suspicion became problematic decades later as the “new policing” unfolded and investigative stops became less a practice of individual discretion as a systematic policy-driven program.207 Investigative stops were an essential element of modern proactive policing, and those often were pursued aggressively both in quantity and interaction quality. Yet, these analyses show in New York City that stops based on general categories of suspicion that are not tied to a particular behavior have no crime reduction benefit, even though they were encouraged in an effort to reduce crime.

At the outset of the Terry era, in the midst of spiking crime rates and civil unrest,208 the Terry Court’s comments on reasonable suspicion were framed as making sure that an officer’s actions were reviewable both within police agencies and in the courts.209 Neither the Terry Court, nor later courts reviewing Terry’s standard, ever articulated sufficient detail to allow the police to know whether their actions were constitutional. Substance was almost never part of that discussion. While the reviewability prong of Terry’s doctrine no doubt anticipated a period of sorting out by appellate courts,210 what was a small caseload burden on the courts grew over the years into a long string of contentious decisions, nearly all of which expanded the scope of reasonable suspicion.211 So, one of Terry’s sins was placing a substantial burden of review on federal trial and appellate courts in a succession of suppression motions and constitutional challenges.212 That may well have been the opposite of what the Terry court perhaps sought: to create a procedural rule that would relieve courts of that burden. Asking courts to perform a regulatory function in sorting out constitutional violations in everyday policing is one thing, but asking those same courts to do so in the absence of an articulable standard of conduct is asking too much.

The story becomes more complicated by the fact that some stops, however subjective they may be, will be constitutional under the current case law. Even when a lawful stop proceeds with verbal or physical aggression by the officer, the stop can be lawful under Terry’s expansive view of reasonable suspicion.213 Stops are constitutional so long as officers can articulate facts that are reasonable to other officers given their knowledge and circumstances. But the story is complicated simply by the numbers: when suspicion becomes so inflated as to challenge the boundaries of legality, then it is the practice itself that becomes a contested constitutional matter. The boundary between lawful and unlawful policing is not easy to draw,214 but courts as well as government have done so now on several occasions.215 Still, when courts act as regulators of reasonable suspicion, police officers who are able and willing to spin their behavior in a way that will satisfy judges who reflexively defer to police “expertise,” while officers who are less verbally facile or who are transparent about their subjective assessment and motivations are more likely to be penalized.216

The lesson here is that some bases of suspicion are both constitutional and productive, while others may be constitutional but unproductive, and still others are neither constitutional nor productive. But since all stops have costs that are borne by innocents as well as the guilty, communities and those stopped should not have to pay for those costs that are not worth it. Those stopped shouldn’t have to bear the burdens of police hunches or stops otherwise based on thin suspicion if they in fact are not guilty. This was one part of the “impossibility” that Professor Stuntz wrote about in 1998,217 shortly after the downside of the new policing became a focus of legal, political, and social conflict.

B.    Moving to Regulation

A reset to more concrete indicia of suspicion suggested by the empirical results here hold promise to reverse those sins. Put simply, the Terry standard should be pushed toward a narrower and objective standard in light of this research. The reset involves two domains. One is a constitutional story that also raises regulatory issues. The constitutional story asks whether the procedure and subjective standards in Terry and later cases lead to violations of individual rights, and whether citizens can be asked to sacrifice those rights to social welfare criminal justice interests. The regulatory story is an institutional story: how to design models of oversight and assessment by accountable agents to ensure that the practices remain within the diffuse boundaries of reasonable suspicion, and also emphasizing the use of stops that maximize social welfare goals of crime deterrence. The regulatory challenge is to tether that practice both to the constitutional parameters and to practices that pay.

The prospects for regulation of Terry stops through a more narrowly tailored schema of suspicion are good. At the least, shifting stops toward probable cause or behavioral indicia will shrink the stop circumstances that might otherwise be legally contested, reducing the burdens on trial and appellate courts. A shift in emphasis also creates a vocabulary and logic for internal audit, supervision, and regulation. Officers can be required to answer for what they do, not what they say.218 In contrast, it is not hard to imagine the difficulty of internally auditing the indicia of suspicion for the vague categories of “furtive movements” that were sharply criticized in the Floyd opinion.219

Instead, the process of auditing a claim of “violent crime” or “drug transaction” or even the more subjective marker of “casing a store” can involve a perceptually shared set of behavioral categories that might be more amenable to training on substantive criteria in lieu of procedural ones. Auditing of officers’ expressions of suspicion can promote learning and updating, which should be visible when officers’ actions are viewed across a range of citizen contacts. Auditing internally also creates a context of observational data that can inform collaboration among officers, and for democratic participation by politically accountable agents with police and citizens. The positive returns of collaboration have been observed in studies of police institutional reform in Cincinnati and other smaller departments.220

C.    Harm Reduction

Hewing closer to objective and behaviorally specific markers of suspicion will narrow the circumstances where stops are conducted. A likely result will be a reduction in the scope and magnitude of false positives—low seizure and arrest rates, weak crime control returns—observed in the Floyd and reported by monitors in Bailey v. City of Philadelphia221 litigation. Even though the Terry Court was careful to state that reasonable suspicion was necessary to authorize a frisk, not necessarily the stop itself, its crime control agenda moved reasonable suspicion from the background of the original opinion to the forefront of contemporary case law.222 And in linking Terry stops to a crime control agenda, it was not hard to explain how reasonable suspicion became the basis to justify an investigative stop.223

Beyond the costs of a wrong guess by police that leads to a temporary street detention, the Terry Court worried about a variety of “petty indignities.”224 The indignities of this form of order maintenance in effect piled up from the accumulation of stops, not simply from publicly visible frisks.225 As Terry’s crime control agenda took root, the exposure of citizens, both innocents and those engaged in crime, to a new form street stops grew exponentially. The indignity problem arises not from the indignity of the frisk or the search, but from the context of the stop itself. And the dignity problem also arises not from the sheer prevalence of unproductive stops and the burden on innocents (although that itself is a concern), but from the ways those stops often are conducted.226 Even the most neutral of stops carries emotional freight and the threat of indignities. The concern here is what happens before, during, and after these stops, or how encounters with the police take place and then unfold, rather than on simply the regulatory questions of whether, where, and how often they occur.227

Professor Stuntz identified four types of harm from inchoate and unproductive stops: (1) privacy incursions, or the coercive invasion of one’s property or body; (2) targeting harm, being singled out in public by the police and treated like a criminal suspect;228 (3) the harm of using racial bias to justify these incursions on liberty, or using race as a signal of suspicion if not criminality on black citizens simply by virtue of being black or moving about in a black neighborhood;229 and (4) the risks of verbal and physical force that accompanies stops and searches.230

These psychological and perhaps physical harms risk not only indignities to the person, but also legitimacy costs to the larger community. Intrusive stops that are not based on actual perceived criminal behavior have reduce the perceived legitimacy of the police, and risk the harm of withdrawal of citizens from the co-production (with police) of security and public safety.231 The harms accrue from both direct and vicarious interactions—young persons observing the police are as likely to report lower police legitimacy as are those who have direct experience.232 They are less likely to serve on juries or cooperate with police in investigations of crime reports.233

Assume that legitimacy is produced through the aggregation of interactions within individuals and authorities, and is a language and shared currency of social exchange signals that acknowledge the individual’s dignity, respect, worth and belonging is essential to democratic participation.234 If that were right, then simply reducing the scope of stops and narrowing the bases of suspicion to objective indicia of criminal activity would narrow the scope of indignities and harms. Targeting harms would be reduced by reining in “hunches” or actuarial suspicion based on police officers’ use of collective suspicion or Bayesian attributions of criminal intent.235 And narrowing would also reduce the emotional and psychological toll of these unwarranted intrusions.236 “Why me?” would no longer be a salient question for the residents of many urban neighborhoods who bear much of the burden for the contemporary practice of Terry stops.

The racial component of targeting harms in particular could be addressed through narrowing. Racial tensions are inextricably linked to the drift in reasonable suspicion toward both subjectivity and programmatic overreach.237 Resetting to narrowly defined legal categories may not cure racial disparities, but would likely reduce the disparate exposure to policing by race by narrowing the circumstances for permissible stops. The heaping of indignities from law enforcement and other legal actors on African Americans has special meaning for that community. Glenn Loury explains how the pervasive societal stigmatization of African Americans marginalizes their community from the institutions and norms that “mainstream” society purports to value.238

For African Americans, the sense of gain or loss suffered through the aggregation of social interactions with the state is an important part of collective or shared experiences.239 Conferring respect before the law means conferring social and democratic belonging, a form of social recognition that conveys the shared moral and legal norms between citizens and those who enforce the law. This sense of belonging and social recognition is described in rich detail by Charles Epp and his colleagues in Pulled Over,240 and suggests another, and perhaps more important, potential dignitary benefit of a narrower basis for stops. Epp et al. make an important distinction between being treated respectfully (whether or not lawfully) and being treated legally.241 The respondents in their survey were more concerned with being treated legally than with politeness or other procedural qualities.242 In this form of social recognition in law, we imagine ourselves as how other people see us, and we understand who we are in and through our relationships with others. This form of recognition by legal actors—by applying law equally—affects the ties of groups to legal authority, and to the moral norms that legal actors both express and enforce. In other words, being nice is one thing, perhaps a low cost and generous if not patronizing act on the part of a legal actor holding considerable power, but expressively acknowledging the rights of a person before the law is quite another. Undoing dignitary harms leans strongly toward the latter view.

D.    Regulatory Algebra

Substantive laws that criminalize relatively harmless or benign acts can animate the use of police power to carry out Terry stops to a broad spectrum of behaviors and actions.  But it was never clear that the use of Terry’s stop power was aimed at trivial crimes, or as a pretext for conducting field investigations as a fishing exercise based on hunches.243 Those hunches often are instantiated in current practice,244 which seem to be artifacts of the capacious and inchoate indicia of reasonable suspicion. One remedy is to recalibrate suspicion in the context of Terry stops to move away from subjective criteria and reliance on officers’ experience-based judgments to a regime of objective, behaviorally-defined indicia of suspicion.

The move toward more objective and behavioral bases of suspicion does not mean that the police should abandon the practice of Terry stops. What it does imply is that there is a tradeoff to using this power too broadly, and the regulatory response requires adjusting the thresholds for police contact. As a matter of policy, the broad use of Terry’s stop power is encouraged by the new policing, especially in the context of robust order maintenance regimes.245 Therein lies the trouble. Terry stops should require a higher level of suspicion than an officer’s hunch or subjective appraisal that “crime is afoot”. The Terry Court never said which crimes had to be “afoot” to justify a stop, only that the act was criminal. When the criminal law is so broadly enforced, and when non-criminal violations or local ordinances are integrated with the overall mission of street policing to detect weapons and control violence, the likelihood increases that both benign and serious crimes will be part of the umbrella of suspicion. The burden of proof for administrative violations or low-level misdemeanor offenses is intrinsically lower than for felony offenses and places Terry’s fundamental rules at risk.

Imagine that we have two types of acts—a benign act and a harmful one. Intervening in the relatively benign act, such as a violation of an administrative code, seems to benefit almost no one—there are few public benefits to crime control in that instance, since the range of harm is largely private or de minimus. Even if one accepts that some aspects of disorder may be criminogenic, itself a heavily contested notion, the argument here is that the treatment through criminal enforcement may have iatrogenic effects on legitimacy and cooperation. That is, we may stop someone from smoking in public, or drinking from an open container, playing loud music in a residential area, or jumping turnstiles on public transit. We may signal “order” by enforcing these laws, but their relationship to public safety is path dependent on the questionable relationship between theories of social or physical disorder and crime.246 Worse, such enforcement may engender withdrawal if not resistance to cooperation with the police.247 This may seem like an efficient use of a scarce public good—policing—because “hit rates” may be high, but the yield for public safety is low if these low-level crimes are not gateways to violence or major property crimes.


Figure 4. Probability Distributions for Strength of Evidence

Source: Louis Kaplow, Burden of Proof, 121 Yale L. J. 738, 757 (2011).

More important, the burden of proof in these instances is intrinsically low.248 Policing benign acts may satisfy the demand for metrics of police productivity, but its contribution to crime control is dubious.249 Aggressive enforcement of benign acts also has efficiency costs by distracting police from intervening in the more harmful ones. It is only in the shared space of benign and harmful acts, where it makes sense to intervene in the benign act at a lower standard of proof, and the size of that shared space is part of a contentious debate.250 The social harms from undetected harmful acts will outweigh any private or small-scale benefits from intervening in the benign acts. Figure 4 illustrates how the social good in the form of public safety seems to be greater when we focus our attention on the more serious acts. In other words, do not sweat the little stuff, and focus on more serious acts with more consequential public harms. This is simple regulatory algebra.

VI. Conclusion

Both law enforcement and citizen interests are better served by a recalibration of Terry standards to move them closer to Mapp’s more exacting probable cause standard. A more workable and easily understood standard for regulating police use of the stop power would create a more comfortable space internally for police to monitor, audit, and regulate compliance with constitutional law as well as internal policy. And it also can provide a standard that moves away from the subjective criteria that Terry invited and toward criteria that are less vulnerable to cognitive error, perceptual distortions, and social harms. Secondary benefits for legitimacy may well follow. Penance for Terry’s original sin is within reach.


Appendix A           

Aggregate Category

Suspected Offenses



Violent Crime

Aggravated Assault


Aggravated Harassment


Aggravated Sexual Abuse









Minor Violent Crime









Reckless Endangerment


Resisting Arrest




Unlawful Imprisonment


Vehicular Assault

Drug Crime

Criminal Possession of Controlled Substances


Criminal Sale of Controlled Substances


Criminal Possession of Drug Paraphernalia


Other Drug Offenses

Marijuana Possession

Criminal Possession of Marijuana

Marijuana Sale

Criminal Sale of Marijuana

Part I Property Crime





Grand Larceny


Grand Larceny Auto

Minor Property Crime

Auto Stripping


Computer Trespass


Criminal Possession of Stolen Property


Criminal Mischief


Criminal Possession of Computer Materials


Criminal Possession of Forged Instruments


Criminal Tampering


Misapplication of Property


Petit Larceny

Possession of Burglar Tools

Reckless Endangerment of Property

Theft of Services

Unauthorized Use of a Vehicle

Fraud and Related

Falsifying Business Records




Forgery of a VIN




Fraudulent Accosting


Insurance Fraud


Tampering with a Public Record


Unlawful Use of Credit Card, Debit


Criminal Trespass

Prostitution and Related




Quality of Life/Disorder



Fortune Telling






Making Graffiti




Obstructing Firefighting Operations


Obstructing Governmental Administration


Possession of Graffiti Instruments


Trademark Counterfeiting


Unlawfully Dealing with Fireworks


Unauthorized Recording


Unlawful Assembly


Disorderly Conduct


Quality of Life


Riding Bike on the Sidewalk


Alcohol Violation

Sex Crimes and Related







Course of Sexual Conduct




Public Display of Offensive Sexual Material


Public Lewdness


Sexual Abuse


Sexual Misconduct




Forcible Touching

Other Minor Sex Crimes


