-
Notifications
You must be signed in to change notification settings - Fork 0
/
Where-s-dinner-coming-from--A-utility-based-investigation-of-access-to-nutrition-in-Utah..tex
1855 lines (1553 loc) · 81.2 KB
/
Where-s-dinner-coming-from--A-utility-based-investigation-of-access-to-nutrition-in-Utah..tex
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590
591
592
593
594
595
596
597
598
599
600
601
602
603
604
605
606
607
608
609
610
611
612
613
614
615
616
617
618
619
620
621
622
623
624
625
626
627
628
629
630
631
632
633
634
635
636
637
638
639
640
641
642
643
644
645
646
647
648
649
650
651
652
653
654
655
656
657
658
659
660
661
662
663
664
665
666
667
668
669
670
671
672
673
674
675
676
677
678
679
680
681
682
683
684
685
686
687
688
689
690
691
692
693
694
695
696
697
698
699
700
701
702
703
704
705
706
707
708
709
710
711
712
713
714
715
716
717
718
719
720
721
722
723
724
725
726
727
728
729
730
731
732
733
734
735
736
737
738
739
740
741
742
743
744
745
746
747
748
749
750
751
752
753
754
755
756
757
758
759
760
761
762
763
764
765
766
767
768
769
770
771
772
773
774
775
776
777
778
779
780
781
782
783
784
785
786
787
788
789
790
791
792
793
794
795
796
797
798
799
800
801
802
803
804
805
806
807
808
809
810
811
812
813
814
815
816
817
818
819
820
821
822
823
824
825
826
827
828
829
830
831
832
833
834
835
836
837
838
839
840
841
842
843
844
845
846
847
848
849
850
851
852
853
854
855
856
857
858
859
860
861
862
863
864
865
866
867
868
869
870
871
872
873
874
875
876
877
878
879
880
881
882
883
884
885
886
887
888
889
890
891
892
893
894
895
896
897
898
899
900
901
902
903
904
905
906
907
908
909
910
911
912
913
914
915
916
917
918
919
920
921
922
923
924
925
926
927
928
929
930
931
932
933
934
935
936
937
938
939
940
941
942
943
944
945
946
947
948
949
950
951
952
953
954
955
956
957
958
959
960
961
962
963
964
965
966
967
968
969
970
971
972
973
974
975
976
977
978
979
980
981
982
983
984
985
986
987
988
989
990
991
992
993
994
995
996
997
998
999
1000
% Options for packages loaded elsewhere
\PassOptionsToPackage{unicode}{hyperref}
\PassOptionsToPackage{hyphens}{url}
\PassOptionsToPackage{dvipsnames,svgnames,x11names}{xcolor}
%
\documentclass[
letterpaper,
number,
review,
3p]{elsarticle}
\usepackage{amsmath,amssymb}
\usepackage{iftex}
\ifPDFTeX
\usepackage[T1]{fontenc}
\usepackage[utf8]{inputenc}
\usepackage{textcomp} % provide euro and other symbols
\else % if luatex or xetex
\usepackage{unicode-math}
\defaultfontfeatures{Scale=MatchLowercase}
\defaultfontfeatures[\rmfamily]{Ligatures=TeX,Scale=1}
\fi
\usepackage{lmodern}
\ifPDFTeX\else
% xetex/luatex font selection
\fi
% Use upquote if available, for straight quotes in verbatim environments
\IfFileExists{upquote.sty}{\usepackage{upquote}}{}
\IfFileExists{microtype.sty}{% use microtype if available
\usepackage[]{microtype}
\UseMicrotypeSet[protrusion]{basicmath} % disable protrusion for tt fonts
}{}
\makeatletter
\@ifundefined{KOMAClassName}{% if non-KOMA class
\IfFileExists{parskip.sty}{%
\usepackage{parskip}
}{% else
\setlength{\parindent}{0pt}
\setlength{\parskip}{6pt plus 2pt minus 1pt}}
}{% if KOMA class
\KOMAoptions{parskip=half}}
\makeatother
\usepackage{xcolor}
\setlength{\emergencystretch}{3em} % prevent overfull lines
\setcounter{secnumdepth}{5}
% Make \paragraph and \subparagraph free-standing
\ifx\paragraph\undefined\else
\let\oldparagraph\paragraph
\renewcommand{\paragraph}[1]{\oldparagraph{#1}\mbox{}}
\fi
\ifx\subparagraph\undefined\else
\let\oldsubparagraph\subparagraph
\renewcommand{\subparagraph}[1]{\oldsubparagraph{#1}\mbox{}}
\fi
\providecommand{\tightlist}{%
\setlength{\itemsep}{0pt}\setlength{\parskip}{0pt}}\usepackage{longtable,booktabs,array}
\usepackage{calc} % for calculating minipage widths
% Correct order of tables after \paragraph or \subparagraph
\usepackage{etoolbox}
\makeatletter
\patchcmd\longtable{\par}{\if@noskipsec\mbox{}\fi\par}{}{}
\makeatother
% Allow footnotes in longtable head/foot
\IfFileExists{footnotehyper.sty}{\usepackage{footnotehyper}}{\usepackage{footnote}}
\makesavenoteenv{longtable}
\usepackage{graphicx}
\makeatletter
\def\maxwidth{\ifdim\Gin@nat@width>\linewidth\linewidth\else\Gin@nat@width\fi}
\def\maxheight{\ifdim\Gin@nat@height>\textheight\textheight\else\Gin@nat@height\fi}
\makeatother
% Scale images if necessary, so that they will not overflow the page
% margins by default, and it is still possible to overwrite the defaults
% using explicit options in \includegraphics[width, height, ...]{}
\setkeys{Gin}{width=\maxwidth,height=\maxheight,keepaspectratio}
% Set default figure placement to htbp
\makeatletter
\def\fps@figure{htbp}
\makeatother
% definitions for citeproc citations
\NewDocumentCommand\citeproctext{}{}
\NewDocumentCommand\citeproc{mm}{%
\begingroup\def\citeproctext{#2}\cite{#1}\endgroup}
\makeatletter
% allow citations to break across lines
\let\@cite@ofmt\@firstofone
% avoid brackets around text for \cite:
\def\@biblabel#1{}
\def\@cite#1#2{{#1\if@tempswa , #2\fi}}
\makeatother
\newlength{\cslhangindent}
\setlength{\cslhangindent}{1.5em}
\newlength{\csllabelwidth}
\setlength{\csllabelwidth}{3em}
\newenvironment{CSLReferences}[2] % #1 hanging-indent, #2 entry-spacing
{\begin{list}{}{%
\setlength{\itemindent}{0pt}
\setlength{\leftmargin}{0pt}
\setlength{\parsep}{0pt}
% turn on hanging indent if param 1 is 1
\ifodd #1
\setlength{\leftmargin}{\cslhangindent}
\setlength{\itemindent}{-1\cslhangindent}
\fi
% set entry spacing
\setlength{\itemsep}{#2\baselineskip}}}
{\end{list}}
\usepackage{calc}
\newcommand{\CSLBlock}[1]{\hfill\break\parbox[t]{\linewidth}{\strut\ignorespaces#1\strut}}
\newcommand{\CSLLeftMargin}[1]{\parbox[t]{\csllabelwidth}{\strut#1\strut}}
\newcommand{\CSLRightInline}[1]{\parbox[t]{\linewidth - \csllabelwidth}{\strut#1\strut}}
\newcommand{\CSLIndent}[1]{\hspace{\cslhangindent}#1}
\usepackage{rotating}
\makeatletter
\@ifpackageloaded{bookmark}{}{\usepackage{bookmark}}
\makeatother
\makeatletter
\@ifpackageloaded{caption}{}{\usepackage{caption}}
\AtBeginDocument{%
\ifdefined\contentsname
\renewcommand*\contentsname{Table of contents}
\else
\newcommand\contentsname{Table of contents}
\fi
\ifdefined\listfigurename
\renewcommand*\listfigurename{List of Figures}
\else
\newcommand\listfigurename{List of Figures}
\fi
\ifdefined\listtablename
\renewcommand*\listtablename{List of Tables}
\else
\newcommand\listtablename{List of Tables}
\fi
\ifdefined\figurename
\renewcommand*\figurename{Figure}
\else
\newcommand\figurename{Figure}
\fi
\ifdefined\tablename
\renewcommand*\tablename{Table}
\else
\newcommand\tablename{Table}
\fi
}
\@ifpackageloaded{float}{}{\usepackage{float}}
\floatstyle{ruled}
\@ifundefined{c@chapter}{\newfloat{codelisting}{h}{lop}}{\newfloat{codelisting}{h}{lop}[chapter]}
\floatname{codelisting}{Listing}
\newcommand*\listoflistings{\listof{codelisting}{List of Listings}}
\makeatother
\makeatletter
\makeatother
\makeatletter
\@ifpackageloaded{caption}{}{\usepackage{caption}}
\@ifpackageloaded{subcaption}{}{\usepackage{subcaption}}
\makeatother
\journal{WSTLUR Symposium 2024}
\ifLuaTeX
\usepackage{selnolig} % disable illegal ligatures
\fi
\usepackage{bookmark}
\IfFileExists{xurl.sty}{\usepackage{xurl}}{} % add URL line breaks if available
\urlstyle{same} % disable monospaced font for URLs
\hypersetup{
pdftitle={Where's dinner coming from? A utility-based investigation of access to nutrition in Utah.},
pdfauthor={Gregory S. Macfarlane; Emma Stucki; Myrranda Salmon; Alisha H. Redelfs; Lori A. Spruance},
pdfkeywords={Accessibility, Utility-based access, Access to
nutrition, Passive location data},
colorlinks=true,
linkcolor={blue},
filecolor={Maroon},
citecolor={Blue},
urlcolor={Blue},
pdfcreator={LaTeX via pandoc}}
\setlength{\parindent}{6pt}
\begin{document}
\begin{frontmatter}
\title{Where's dinner coming from? A utility-based investigation of
access to nutrition in Utah.}
\author[1]{Gregory S. Macfarlane%
\corref{cor1}%
}
\ead{gregmacfarlane@byu.edu}
\author[1]{Emma Stucki%
%
}
\author[1]{Myrranda Salmon%
%
}
\author[2]{Alisha H. Redelfs%
%
}
\author[2]{Lori A. Spruance%
%
}
\affiliation[1]{organization={Civil and Construction Engineering
Department, Brigham Young University},city={Provo, Utah
USA},postcode={84602},postcodesep={}}
\affiliation[2]{organization={Public Health Department, Brigham Young
University},city={Provo, Utah USA},postcode={84602},postcodesep={}}
\cortext[cor1]{Corresponding author}
\begin{abstract}
Convenient access to high-quality nutrition is a critical element of
public health as well as an important interface between communities and
the transportation system. In this research, we seek to construct a
detailed picture of the nutrition environment in three communities in
Utah, alongside the community members' ability to access that
environment through multiple transportation modes. In doing so we
construct a utility-based accessiblity model enabled by modern mobility
device data. This model reveals the tradeoffs between the quality and
price of goods on one hand and the distance traveled to reach them on
the other. We then apply this model to a series of potential
access-improving policies: building a new store, improving an existing
store, and improving the non-automobile transport network between
residents and existing stores. The results show that new or improved
store locations bring substantially higher benefits than improvements to
the transportation system, at likely lower costs. The findings suggest
that transportation agencies work to increase the availability of
community-sized grocery stores in low-access areas, and consider
activity-based methods of measuring resource access.
\end{abstract}
\begin{keyword}
Accessibility \sep Utility-based access \sep Access to
nutrition \sep
Passive location data
\end{keyword}
\end{frontmatter}
\bookmarksetup{startatroot}
\section{Introduction}\label{introduction}
The ability of people to access quality nutrition has been studied at
length in public health and urban geography for decades (Beaulac et al.,
2009; Walker et al., 2010). This interest is motivated in large part by
an observed spatial disparity in nutrition access in many communities
--- though this issue may be particularly pronounced in the United
States (Beaulac et al., 2009). The spatial disparity has been linked at
an aggregate level with negative public health outcomes (Chen et al.,
2016; Cooksey-Stowers et al., 2017), though other complicating factors
including prices and habits may be present as well (Ghosh-Dastidar et
al., 2014).
At the same time, access to nutrition and to community resources in
general is frustratingly hard to define. ``Accessibility'' is an
abstract concept without a specific quantitative definition (Handy \&
Niemeier, 1997). However, using accessibility as a policy measure
requires comparative quantification, and transportation and public
health researchers have constructed several quantitative measures, such
as presence of a store within a travel time buffer, or the distance to
the nearest store. These measures are relatively easy to calculate using
readily available GIS software, but elide much useful information (Dong
et al., 2006; Logan et al., 2019). These types of measures require the
researcher to make a series of assumptions and assertions: why is 30
minutes chosen instead of 40? Is that time by transit or highway or
walking? Should these definitions change for individuals in different
socioeconomic groups? And people do not always go to the closest grocery
store to begin with (Clifton, 2004; Hillier et al., 2011); how much
further are people willing to travel to go to a store that is cheaper or
that has a wider variety of goods? And perhaps the home location isn't
the only spatial point of reference (Liu et al., 2022). A measure that
potentially combines many of these different considerations is
desirable.
In this research, we develop and explore an accessibility measure based
on destination choice models estimated for three distinct communities in
Utah. This methodology is based on a unique dataset made by linking
between three extensive data sources:
\begin{enumerate}
\def\labelenumi{\arabic{enumi}.}
\tightlist
\item
A detailed survey of the nutrition market in three Utah communities.
\item
Location-based services data derived from mobile phone records
revealing which grocery stores are frequented by residents of
different neighborhoods.
\item
Multi-modal network data providing detailed mobility data by car,
walking, and public transit.
\end{enumerate}
These data will be combined in order to develop accurate logit models
that demonstrate the variables that are significant to grocery store
choice in Utah. These models could then be used to find accessibility to
stores and impact transportation policy to improve quality of life for
all communities in Utah.
This paper is organized in a typical manner. A methodology for data
collection and modeling is described in Section~\ref{sec-methods} and a
description of the nutrition environment and choice models estimates
follows in Section~\ref{sec-results}. Section~\ref{sec-scenarios}
presents a series of scenarios to which we apply the models estimated in
Section~\ref{sec-results}, illustrating the interrelated elements of
nutrition quality and transportation infrastructure in developing more
complete access to nutrition. Section~\ref{sec-conclude} places the
findings of this research in context, while discussing limitations to
the methodology and associated future research opportunities.
\bookmarksetup{startatroot}
\section{Methods}\label{sec-methods}
The nutrition access literature is long and has been approached from
numerous angles including public health, urban science and economics,
and social justice. In general, researchers have sought to link spatial
access to nutrition with health outcomes including obesity, caloric
intake, and the like. Complete --- though somewhat dated --- reviews of
this literature can be had from (Beaulac et al., 2009) and (Walker et
al., 2010). More recent work has extended the description and refinement
of the measures used to evaluate food access and control for confounding
variables. (Widener \& Shannon, 2014) considered that temporal access to
quality food is as important as spatial access. (Aggarwal et al., 2014)
suggested that spatial access was not as important as store choice,
given that most people were not observed to shop at the nearest vendor.
By contrast, (Chen et al., 2016) compared spatial access to quality food
vendors with observed food expenditures and showed poor access explained
obesity even when controlling for consumption. (Cooksey-Stowers et al.,
2017) jointly pursued spatial proximity with quality of offerings and
showed the latter might be more predictive of obesity rates.
What has not been frequently attempted in the nutrition access
literature, however, is a serious comparison of multiple alternative
policies to address the problem, which would require a multi-dimensional
analysis of spatial access, store quality, and observed tradeoffs
between the two. (Macfarlane et al., 2021) illustrated the potential for
a utility-based model of access to establish relationships between urban
green space access and health, and then continued that methodology into
a policy analysis of park space during Covid-19
(\textbf{macfarlane2022a?}). The potential for application of this
methodology to the nutrition literature is well-motivated by the
previous attempts as well as the lack of clear policy solutions (Wright
et al., 2016).
This section describes how we construct a model of access to grocery
stores in communities in Utah. We first describe the theoretical model,
and then describe data collection efforts to estimate this model and
apply it.
\subsection{Model}\label{model}
A typical model of destination choice (Recker \& Kostyniuk, 1978) can be
described as a random utility maximization model where the utility of an
individual \(i\) choosing a particular destination \(j\) is
\begin{equation}\phantomsection\label{eq-utility}{ U_{ij} = \beta_{s}f(k_{ij}) + \beta_{x}(X_j) }\end{equation}
where \(f(k_{ij})\) is a function of the travel impedance or costs from
\(i\) to \(j\) and \(X_{j}\) represents the location attributes of
\(j\). The coefficients \(\beta\) can be estimated given sufficient data
revealing the choices of individuals. The probability that individual at
location \(i\) will choose alternative \(j\) from a choice set \(J\) can
be estimated with a multinomial logit model (MNL) (McFadden, 1974),
\begin{equation}\phantomsection\label{eq-mnl}{ P_i(j) = \frac{\exp(U_{ij})}{\sum_{j' \in J}{\exp(U_{ij'})}}}\end{equation}
The overall fit of the model can be described with the Akaike
Information Criterion (AIC) --- which should be minimized --- or by the
McFadden likelihood ratio
\(\rho^2_0 = 1 - \ln\mathcal{L} / \ln\mathcal{L}_0\). In this ratio
\(\ln{\mathcal{L}}\) is the model log-likelihood and
\(\ln{\mathcal{L}_0}\) the log-likelihood of an alternative model where
all destinations are equally likely; a higher \(\rho^2_0\) value
indicates more explanatory power relative to this null, random chance
only model.
The idea of using destination choice logsums as accessibility terms is
not new, and the theory for doing so is described in (Ben-Akiva \&
Lerman, 1985, p. 301). Effectively, the natural logarithm of the
denominator in Equation~\ref{eq-mnl} represents the consumer surplus ---
or total benefit --- available to person \(i\):
\begin{equation}\phantomsection\label{eq-cs}{ CS_i = \ln\left(\sum_{j \in J} \exp(U_{ij})\right)}\end{equation}
A difference in logsum measures may exist for a number of reasons that
affect the utility functions described in Equation~\ref{eq-utility}. For
example, individuals at different locations or with different mobility
will see different impedance values \(k_{ij}\) and therefore affected
utility. Changes to the attributes of the destinations \(X_j\) will
likewise affect the utility.
Despite the relative maturity of this theory, applications of
utility-based access in the literature are still rare, outside of public
transport forecasting analyses (Geurs et al., 2010). The rarity is
likely explained by an unfamiliarity with destination choice models and
the ready availability of simpler methods on one hand (Logan et al.,
2019), and the difficulty in obtaining a suitable estimation dataset for
particular land uses on the other (Kaczynski et al., 2016). This second
limitation has been somewhat improved by a new methodology developed by
(\textbf{macfarlane2022a?}), relying on commercial location-based
services data to estimate the affinity for simulated agents to visit
destinations of varying attributes and distances.
\subsection{Data}\label{data}
In this research, we develop a unique dataset to estimate the
destination choice utility coefficients for grocery store choice in
three different communities in Utah. The three communities were selected
to maximize potential observed differences in utility between community
residents. The three communities are Utah County, West Salt Lake County,
and San Juan County. Note that in this document we refer to the second
community as ``Salt Lake'' even though this does not refer to the entire
Salt Lake County nor to Salt Lake City, rather, we focus on communities
in the western part of the valley, such as Magna, Kearns, and West
Valley City. The communities are shown in a wider context in
Figure~\ref{fig-communities}.
\begin{figure}
\centering{
\includegraphics{03_methods_files/figure-pdf/fig-communities-1.pdf}
}
\caption{\label{fig-communities}Location of study regions in Utah.}
\end{figure}%
Table~\ref{tbl-acsdata} shows several key population statistics based on
2021 American Community Survey data for block groups in the three
communities of interest. Utah County is a largely suburban county with
high incomes and a low percentage of minority individuals. The Salt Lake
region is more dense with somewhat lower incomes and household sizes but
a high share of minority individuals. San Juan County is primarily
rural, with a few small communities and a large reservation for the
Navajo Tribe.
\begin{table}
\caption{\label{tbl-acsdata}Demographic Statistics of Study Regions}
\centering{
\centering
\begin{tabular}[t]{lrrr}
\toprule
& Utah & Salt Lake & San Juan\\
\midrule
Total population & 627,098 & 655,830 & 7,091\\
Total households & 171,538 & 216,731 & 2,090\\
Housing units per sq. km & 599 & 831 & 103\\
Median income & 79,453 & 64,868 & 58,586\\
Percent minority individuals & 18 & 36 & 26\\
\bottomrule
\end{tabular}
}
\end{table}%
Estimating the utility model described in Equation~\ref{eq-mnl} for
grocery stores requires three interrelated data elements:
\begin{enumerate}
\def\labelenumi{\arabic{enumi}.}
\tightlist
\item
An inventory of grocery store attributes \(X_j\);
\item
A representative travel impedance matrix \(K\) composed of all
combinations of origin \(i\) and destination \(j\);
\item
A database of observed person flows between \(i\) and \(j\) by which
to estimate the \(\beta\) coefficients. We describe each of these
elements in turn in the following sections.
\end{enumerate}
\subsubsection{Store Attributes}\label{store-attributes}
The store attributes were collected using the Nutritional Environment
Measures Survey --- Stores (NEMS-S) tool (Glanz et al., 2007). This tool
was developed to reveal significant differences in the availability and
cost of healthy foods in an environment, and has been validated for this
purpose. Beyond superficial attributes such as the store category
(dollar store, convenience store, ethnic market, etc.) and the number of
registers, the NEMS-S collects detailed information about numerous store
offerings such as the availability of produce, dairy products, lean
meats, juices, and canned and dry goods of various specific types. Of
particular interest to the survey are availability and price
differentials of lower-fat alternatives: for example, the survey
instrument requests the shelf space allocated to milk products of
various fat levels and the price of each product.
Student research assistants collected the store attributes by visiting
grocery stores, dollar stores, ethnic markets, and other food markets in
the three communities of interest described above. Stores were
identified using internet-based maps combined with in-person validation
and observation. The student researchers completed the NEMS-S instrument
with the aid of a digital survey and a tablet computer. Each researcher
who collected data was trained to use the survey at a control store in
Provo, and the training data helped to eliminate the risk of surveyor
bias. The store attributes were collected in the spring of 2021 for Utah
County and spring of 2022 for Salt Lake and San Juan Counties. In Utah
and Salt Lake Counties, we included dollar stores and grocery stores but
did not include convenience stores. Given the rural nature of San Juan
County, we made two adjustments to capture the entirety of the nutrition
environment. First, we included convenience stores and trading posts if
they were the only food market in a community. We also included
full-service grocery stores in Cortez, Colorado, and Farmington, New
Mexico in the San Juan data collection, as community conversations made
it clear that many residents will drive these long distances for
periodic shopping with greater availability and lower prices.
Using the information in the NEMS-S survey, two measures of a store can
be calculated: an availability score based on whether stores stock
particular items as well as lower-calorie options; and a cost score
describing the spread between prices of these options. These score
values are described in (Lunsford et al., 2021), and we developed an R
package to compute the scores; this package is available at
\url{https://github.com/byu-transpolab/nemsr}. In the availability
score, each store is given a value for whether or not there are more
healthful options available in the store, such as low-calorie chips, or
low-fat milk. If the store does not have a more healthful option in a
category it receives a lower score, so stores with more availability of
healthful food options will receive a higher availability score. For the
cost score, the measure is the price spread between healthful and less
healthful options: if the price of whole wheat bread is cheaper than
white bread, the store receives positive points for the cost option, if
the price is the same then zero points are awarded, and if the wheat
bread is more expensive then the store receives negative points. Thus a
store with a higher availability and cost score will have both more
healthful options, and a more advantageous pricing scheme towards those
options.
One important store attribute that the NEMS-S instrument does not
collect or compute directly is a measure of the cost of common goods
that can be compared across stores. We therefore used the data collected
from the NEMS-S instrument to construct a market basket-based
affordability measure that could be compared across stores, following
the approach of (Hedrick et al., 2022). This market basket score is
based on the US Department of Agriculture (USDA) 2021 Thrifty Food Plan
(FNS, 2021), which calculates a reference market basket for a family of
four. Because this market basket contains more (and sometimes different)
items than what the NEMS-S instrument requests, we chose relevant items
from our NEMS-S data as replacements. For example, the USDA market
basket contains a certain amount of poultry, but the NEMS-S score
collects the per-pound cost of ground beef at various fat contents. For
any stores that were missing any of the elements in the market basket,
we first substituted with another ingredient that would fit the
nutritional requirements. If no substitute was available, we included
the average price of the missing good at other stores in that community
multiplied by 1.5 as a penalty for not containing the product. The final
market basket score is the total cost of all foods in the market basket.
These costs can then be compared from store to store to understand
general affordability comparisons between stores.
\begin{table}
\caption{\label{tbl-nems}Grocery Store Attributes}
\centering{
\centering
\resizebox{\linewidth}{!}{
\begin{tabular}[t]{llrrrrrr}
\toprule
\multicolumn{2}{c}{ } & \multicolumn{2}{c}{Utah (N=63)} & \multicolumn{2}{c}{Salt Lake (N=39)} & \multicolumn{2}{c}{San Juan (N=50)} \\
\cmidrule(l{3pt}r{3pt}){3-4} \cmidrule(l{3pt}r{3pt}){5-6} \cmidrule(l{3pt}r{3pt}){7-8}
& & Mean & Std. Dev. & Mean & Std. Dev. & Mean & Std. Dev.\\
\midrule
Registers (incl. self checkout) & & 12.5 & 11.7 & 9.9 & 8.9 & 6.1 & 8.8\\
NEMS-S availability score & & 18.7 & 8.4 & 16.2 & 8.1 & 13.2 & 7.6\\
NEMS-S cost score & & 1.9 & 2.3 & 2.3 & 2.2 & 1.9 & 1.9\\
Market basket cost & & 126.1 & 21.5 & 141.6 & 19.2 & 157.6 & 16.8\\
\midrule
& & N & Pct. & N & Pct. & N & Pct.\\
Type & Convenience Store & 2 & 3.2 & 0 & 0.0 & 10 & 20.0\\
& Dollar Store & 5 & 7.9 & 11 & 28.2 & 15 & 30.0\\
& Grocery Store & 50 & 79.4 & 27 & 69.2 & 19 & 38.0\\
& Other & 6 & 9.5 & 1 & 2.6 & 6 & 12.0\\
Pharmacy & FALSE & 42 & 66.7 & 32 & 82.1 & 43 & 86.0\\
& TRUE & 21 & 33.3 & 7 & 17.9 & 7 & 14.0\\
Ethnic market & FALSE & 55 & 87.3 & 30 & 76.9 & 47 & 94.0\\
& TRUE & 8 & 12.7 & 9 & 23.1 & 3 & 6.0\\
Other merchandise sold & FALSE & 52 & 82.5 & 35 & 89.7 & 47 & 94.0\\
& TRUE & 11 & 17.5 & 4 & 10.3 & 3 & 6.0\\
\bottomrule
\end{tabular}}
}
\end{table}%
Table~\ref{tbl-nems} presents the store attribute data collected for
each community. Utah County generally has the largest average store size
(as measured by the number of checkout registers) while having the
lowest market basket cost, the highest availability of healthful food
(measured by the NEMS-S availability score) and the lowest difference
between healthy and unhealthy food (the NEMS-S cost score). San Juan
County has the smallest average stores, highest costs, and the lowest
availability of healthy options, and Salt Lake falls in between.
\paragraph{Imputation of Missing Store
Data}\label{imputation-of-missing-store-data}
We collected detailed store attributes for a complete census of stores
in Utah County, San Juan County, and a portion of Salt Lake County using
the NEMS-S survey instrument. These attributes form the basis of the
choice models used to determine access and provide a complete picture of
access in those communities, assuming people do not leave the
communities for grocery trips. But understanding access in other parts
of Salt Lake County -- including how stores outside of the West Salt
Lake County area might shape access inside that community --- requires
us to impute the measured attributes onto the stores that we did not
directly measure.
To do this, we used web-based mapping databases (including OpenStreetMap
and Google Maps) to obtain a list of grocery stores, dollar stores, and
appropriate convenience stores throughout the state. From this search,
we were able to determine each store's location, brand name, and store
type, which we also collected in the manual data assembly efforts. Using
this information, we built a multiple imputation model using the
\texttt{mice} package for R (van Buuren \& Groothuis-Oudshoorn, 2011).
The predictor variables in the imputation included the store brand and
type, as well as the average income and housing density in the nine
closest block groups to the store location (based on population-weighted
block group centroids and Euclidean distances).
\begin{figure}
\centering{
\includegraphics[width=5in,height=\textheight]{03_methods_files/figure-pdf/fig-marketimp-1.pdf}
}
\caption{\label{fig-marketimp}Imputed market price values for 12 random
grocery stores.}
\end{figure}%
Thirty iterations of the multiple imputation algorithm were run for each
of ten independent imputations. Figure~\ref{fig-marketimp} shows the
density of the ten imputed market basket prices for a randomly selected
set of 12 stores. As the figure reveals, there is some general peaking
in the predicted market price for most stores, but the imputation model
still predicts a wide range of possible prices for most stores. When
using the imputed data for analysis, we take the mean of the ten
predictions for continuous values, and the mode for discrete values.
\subsubsection{Travel Impedances}\label{sec-mcls}
The second element of the utility equation in Equation~\ref{eq-utility}
is the travel impedance between \(i\) and \(j\). Many possibilities for
representing this impedance exist, from basic euclidean distance to
complex network paths. A primary purpose of the model we are developing
in this research is to study comparative tradeoffs between
infrastructure-focused and environment-focused improvements to the
nutrition access of households. It is therefore essential that we use a
travel impedance measure that can combine and compare the cost of
traveling by multiple modes so that highway improvements and transit /
active transport improvements can be compared in the same basic model.
Just as the log-sum of a destination choice model is a measure that sums
the utility of multiple destination attributes and costs in a rigorous
manner, the log-sum of a mode choice model combines the utilities of all
available travel modes. In this study we assert the following mode
choice utility equations: \begin{align*}
V_{\mathrm{auto}, ij} &= -0.028(t_{\mathrm{auto}, ij})\\
V_{\mathrm{bus}, ij} &= -4 -0.028(t_{\mathrm{bus}, ij}) -0.056(t_{\mathrm{wait}, ij}) -0.056(t_{\mathrm{access}, ij})\\
V_{\mathrm{walk, ij}} &= -5 -0.028(t_{\mathrm{walk}, ij}) -1.116(d_{ij<1.5}) -5.58(d_{ij>1.5})\\
\end{align*} where \(t\) is the in-vehicle travel time in minutes for
each mode between \(i\) and \(j\). The transit utility function
additionally includes the wait time for transit as well as the time
necessary to access the transit mode on both ends by walking. The walk
utility includes a per-mile distance disutility that increases for
distances greater than 1.5 miles. These equations and coefficients are
adapted from a statewide mode choice model for home-based non-work trips
in urban and rural regions developed for UDOT research (Barnes, 2021).
The log-sum, or total weighted impedance by all modes is therefore
\begin{equation}\phantomsection\label{eq-mcls}{
k_{ij} = \ln(e^{V_{\mathrm{auto}, ij}} + e^{V_{\mathrm{bus}, ij}} + e^{V_{\mathrm{walk},ij}})
}\end{equation}
In this implementation, \(i\) is the population-weighted centroid of a
2020 Census block group, and \(j\) is an individual grocery store. We
measure the travel times from each \(i\) to each \(j\) using the
\texttt{r5r} implementation of the R5 routing engine (Conway et al.,
2017, 2018; Conway \& Stewart, 2019; Pereira et al., 2021). This
algorithm uses common data elements --- OpenStreetMap roadway and active
transport networks alongside General Transit Feed Specification (GTFS)
transit service files --- to simulate multiple realistic route options
by all requested modes. We obtained OpenStreetMap networks and the Utah
Transit Authority GTFS file valid for May 2023 and requested the minimum
total travel time by each mode of auto, transit, and walking for a
departure between 8 AM and 9 AM on May 10, 2023. The total allowable
trip time by any mode was set to 120 minutes, and the walk distance was
capped at 10 kilometers; if a particular \(i,j\) pair exceeded these
parameters then the mode was presumed to not be available and
contributes no utility to the log-sum.
\subsubsection{Mobile Device Data}\label{mobile-device-data}
The final element of destination utility presented in
Equation~\ref{eq-utility} is the set of coefficients, which are often
estimated from household travel surveys in a travel demand context. It
is unlikely, however, that typical household diaries would include
enough trips to grocery stores and similar destinations to create a
representative sample.
Emerging mobile device data, however, could reveal the typical home
locations for people who are observed in the space of a particular
store. (\textbf{macfarlane2022a?}) present a method for estimating
destination choice models from such data, which we repeat in this study.
We provided a set of geometric polygons for the grocery stores of
interest to StreetLight Data, Inc., a commercial location-based services
aggregator and reseller. StreetLight Data in turn provided data on the
number of mobile devices observed in each polygon grouped by the
inferred residence block group of those devices during summer 2022. We
then created a simulated destination choice estimation dataset for each
community resource by sampling 10,000 block group - grocery store
``trips'' from the StreetLight dataset. This created a ``chosen''
alternative; we then sampled ten additional stores from the same
community at random (each simulated trip was paired with a different
sampled store) to serve as the non-chosen alternatives. Random sampling
of alternatives is a common practice that results in unbiased estimates,
though the standard errors of the estimates might be larger than could
be obtained through a more carefully designed sampling scheme (Train,
2009).
\bookmarksetup{startatroot}
\section{Results}\label{sec-results}
This section presents results on the nutrition environment in each of
the three communities of Utah County, West Salt Lake County, and San
Juan County, along with destination choice model estimates and their
application to creating accessibility maps of each community and the
entire state of Utah.
\subsection{Nutrition Environment}\label{sec-nems}
Though some basic descriptive statistics of the grocery store attributes
were presented in Table~\ref{tbl-nems}, some additional exploration of
these attributes is valuable to understand the nutrition environment in
these three communities.
\begin{figure}
\centering{
\includegraphics{04_estimation_files/figure-pdf/fig-nems-market-avail-1.pdf}
}
\caption{\label{fig-nems-market-avail}Relationship between NEMS
availability score and market basket score in each study community. Utah
county prices adjusted for 2021-2022 annual inflation.}
\end{figure}%
Figure~\ref{fig-nems-market-avail} presents the relationship between the
recorded NEMS availability score and the USDA market basket cost at the
stores by community and store type. In all three communities, the
relationship is strongly negative, with stores that stock more varieties
of goods also having overall lower prices for those goods. This is
emphasized by the bottom-right quadrants of these plots (high
availability, low-cost) being dominated by full-service grocery stores,
which have more availability and lower prices than convenience stores or
dollar stores, but require higher traffic and demand to make up for
their lower profit margins. Average prices in Utah County are lower than
prices in the other two communities across the availability spectrum;
this is true even after adjusting for 9.4\% annual inflation between
March 2021 and March 2022 in food products (Bureau of Labor Statistics,
2023).
\begin{figure}
\centering{
\includegraphics{04_estimation_files/figure-pdf/fig-nems-cost-avail-1.pdf}
}
\caption{\label{fig-nems-cost-avail}Relationship between NEMS
availability score and cost score in each study community.}
\end{figure}%
Figure~\ref{fig-nems-cost-avail} shows the relationship between the NEMS
availability and cost scores. In this case the relationship is generally
positive, with stores that stock more healthful options also placing
these options at competitive prices. Conversely, stores with fewer
options tend to place the options they do stock at a higher price point.
This relationship between availability and cost of healthful goods is
strongest in San Juan County, with convenience stores anchoring the
low-availability, high-premium quadrant for healthy food. It should be
noted that these convenience stores also exist in the Utah County
community, but we explicitly included them in the San Juan data
collection as they are the only food markets of any kind in multiple
towns, with dozens of miles separating towns from each other.
\subsection{Destination Choice}\label{sec-estimation}
Using the data collected and MNL destination choice model as described
in Section~\ref{sec-methods}, we estimate a series of model
specifications in each community with the \texttt{mlogit} package for R
(Croissant, 2020). To illustrate the role of different data elements on
destination choice, we develop and estimate four different utility
equations: \begin{align*}
\mathrm{Access} &= \beta_{MCLS}( k_{ij})\\
\mathrm{NEMS} &= \beta_{n-a} (\mathrm{NEMS-Availability}) + \beta_{n-c}\mathrm({NEMS-Cost})\\
\mathrm{Attributes} &= \beta_{mkt} (\mathrm{Market Basket}) + \beta_{reg} (\mathrm{Registers}) + \mathbf{\beta}_{type}(\mathrm{Type})\\
\mathrm{All} &= \mathrm{Access} + \mathrm{NEMS} + \mathrm{Attributes}\\
\end{align*} The Access model includes only the mode choice logsum
described in Equation~\ref{eq-mcls}. The NEMS model includes the NEMS
cost and availability scores describing the goods the store offers,
while the Attributes model contains information that might be more
conventionally available to shoppers including the size, type, and
average prices at the store. As the nutrition environment in each
community contains different types of stores, the specific type
coefficients differ by community. The All model contains all of the
other three sets of estimated coefficients.
\begin{table}
\caption{\label{tbl-utah-models}Estimated Models of Utah County}
\centering{
\centering
\begin{tabular}[t]{lcccc}
\toprule
& Access & NEMS & Attributes & All\\
\midrule
Mode Choice Log-sum & 8.063** & & & 8.686**\\
& (95.356) & & & (87.770)\\
NEMS Availability Score & & 0.032** & & -0.024**\\
& & (22.020) & & (-7.369)\\
NEMS Cost Score & & 0.035** & & 0.041**\\
& & (7.688) & & (5.946)\\
USDA Market Basket & & & -0.007** & -0.008**\\
& & & (-9.814) & (-8.855)\\
Registers & & & 0.056** & 0.065**\\
& & & (54.586) & (41.357)\\
Store Type: Dollar Store & & & 1.948** & 2.086**\\
& & & (56.866) & (41.000)\\
Store Type: Convenience Store & & & -1.990** & -2.422**\\
& & & (-7.400) & (-8.495)\\
Store Type: Other & & & -1.785** & -1.813**\\
& & & (-12.641) & (-11.991)\\
\midrule
AIC & 31,874.44 & 49,051.55 & 41,985.6 & 25,402.43\\
$\rho^2_0$ & 0.359 & 0.013 & 0.155 & 0.489\\
\bottomrule
\multicolumn{5}{l}{\rule{0pt}{1em}* p $<$ 0.05, ** p $<$ 0.01}\\
\multicolumn{5}{l}{\rule{0pt}{1em}t-statistics in parentheses}\\
\end{tabular}
}
\end{table}%
Table~\ref{tbl-utah-models} presents the estimated coefficients in the
Utah County community. In general, the utility coefficients are
statistically significant and in a direction that would be expected by
informed hypothesis. The Access model has a positive coefficient on its
mode choice log-sum term, which indicates that as the mode choice logsum
between a block group and a store increases --- indicating lower travel
costs between Census block groups and the store, because travel times in
Equation~\ref{eq-mcls} have a negative relationship with utility --- a
higher proportion of mobile devices residing in that block group are
observed to travel to that store. The NEMS model shows a positive
relationship between both environment variables and utility, indicating
that people are more likely to choose stores with higher availability of
healthy goods and more advantageous prices for those goods, all else
equal. The Attributes model suggests that people are less willing to
visit stores with higher prices, fewer registers, and convenience stores
or other non-standard grocery stores with the exception of dollar
stores, which they are \emph{more} attracted to. Combining all of these
variables in the All model retain the significance, direction, and basic
scale of all previous estimates with the exception of the NEMS
availability variable. In this case, it seems that the previous positive
relationship may have been a result of correlation between NEMS
availability and other variables such as cost or the number of
registers. And when controlling for all other variables, the role of
transportation access becomes somewhat more important than considering
only distance alone, implying that people are willing to travel somewhat
further for stores with attributes they value.
The overall fit of the four models in Table~\ref{tbl-utah-models} is
also revealing: the model with only NEMS variables against almost no
predictive power over randomly selecting any store in the community (as
revealed by the \(\rho_0^2\) statistic). Though all sets of variables
contribute to the overall fit, it is apparent that the bulk of model
explanatory power is due to transportation proximity.
\begin{table}
\caption{\label{tbl-sl-models}Estimated Models of West Salt Lake Valley}
\centering{
\centering
\begin{tabular}[t]{lcccc}
\toprule
& Access & NEMS & Attributes & All\\
\midrule
Mode Choice Log-sum & 9.870** & & & 12.044**\\
& (74.095) & & & (74.423)\\
NEMS Availability Score & & 0.129** & & 0.002\\
& & (71.898) & & (0.580)\\
NEMS Cost Score & & -0.036** & & 0.057**\\
& & (-7.857) & & (9.607)\\
USDA Market Basket & & & -0.010** & -0.006**\\
& & & (-13.065) & (-6.953)\\
Registers & & & 0.104** & 0.129**\\
& & & (70.019) & (52.110)\\
Store Type: Dollar Store & & & 0.325** & 0.494**\\
& & & (7.250) & (8.943)\\
Store Type: Other & & & 0.248* & 0.556**\\
& & & (2.211) & (4.614)\\
\midrule
AIC & 42,820.66 & 42,541.44 & 40,195.01 & 31,863.31\\
$\rho^2_0$ & 0.138 & 0.144 & 0.191 & 0.359\\
\bottomrule
\multicolumn{5}{l}{\rule{0pt}{1em}* p $<$ 0.05, ** p $<$ 0.01}\\
\multicolumn{5}{l}{\rule{0pt}{1em}t-statistics in parentheses}\\
\end{tabular}
}
\end{table}%
\begin{table}
\caption{\label{tbl-sj-models}Estimated Models of San Juan County}
\centering{
\centering
\begin{tabular}[t]{lcccc}
\toprule
& Access & NEMS & Attributes & All\\
\midrule
Mode Choice Log-sum & 0.709** & & & 1.205**\\
& (81.466) & & & (73.616)\\
NEMS Availability Score & & 0.139** & & 0.065**\\
& & (63.869) & & (12.857)\\
NEMS Cost Score & & 0.227** & & 0.055**\\
& & (35.386) & & (6.721)\\
USDA Market Basket & & & -0.011** & -0.033**\\
& & & (-15.376) & (-24.850)\\
Registers & & & 0.022** & 0.049**\\
& & & (20.517) & (25.969)\\
Store Type: Dollar Store & & & -2.222** & -1.377**\\
& & & (-46.835) & (-18.903)\\
Store Type: Convenience Store & & & -3.597** & -1.080**\\
& & & (-31.496) & (-7.747)\\
Store Type: Other & & & -1.451** & -1.331**\\
& & & (-28.680) & (-19.097)\\
\midrule
AIC & 40,811.4 & 35,119.64 & 37,230.81 & 23,556.77\\
$\rho^2_0$ & 0.179 & 0.293 & 0.251 & 0.526\\
\bottomrule
\multicolumn{5}{l}{\rule{0pt}{1em}* p $<$ 0.05, ** p $<$ 0.01}\\
\multicolumn{5}{l}{\rule{0pt}{1em}t-statistics in parentheses}\\
\end{tabular}
}
\end{table}%
Table~\ref{tbl-sl-models} presents the estimated coefficients in the
west Salt Lake County community, and Table~\ref{tbl-sj-models} presents
the estimated coefficients in San Juan County. The same general story
about coefficient direction and hypotheses applies in both of these
communities, except in regards to the NEMS variables. In Salt Lake, the
NEMS cost score appears negative when estimated alone but becomes
positive when other variables are included. In San Juan, these variables
are consistently positive. Additionally, the story of model fit is
reversed: in both Salt Lake and San Juan, the attributes of the store
explain more of the model fit than the transportation impedance term.
\begin{figure}
\centering{