0% found this document useful (0 votes)

68 views17 pages

Scenario 1:: Acknowlegement

This document acknowledges code adapted from two PySAL notebooks and provides contact information. It then imports libraries and connects to an Oracle Autonomous Database to run spatial queries on US traffic accident data stored in Oracle Spatial. The results are analyzed in Python using PySAL, a geospatial data science library, to calculate global and local spatial autocorrelation and identify statistically significant clusters ("hot spots") and outliers ("cold spots") of traffic accidents involving speeding.

Uploaded by

HOWARD NICOLAS SOLARTE MORA

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

68 views17 pages

Scenario 1:: Acknowlegement

Uploaded by

HOWARD NICOLAS SOLARTE MORA

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 17

Acknowlegement

Code for this PySAL analyses below was adapated from

http://pysal.org/notebooks/explore/esda/Spatial_Autocorrelation_for_Areal_Unit_Data.html (http://pysal.org/notebooks
/explore/esda/Spatial_Autocorrelation_for_Areal_Unit_Data.html)
http://pysal.org/notebooks/explore/giddy/Markov_Based_Methods.html (http://pysal.org/notebooks/explore/giddy
/Markov_Based_Methods.html) (Serge Rey [email protected], Wei Kang [email protected])

Contact

[email protected]

Import libraries needed for the 2 demo scenarios

In [33]: import cx_Oracle

import pandas as pd
import geopandas as gpd
import matplotlib.pyplot as plt
import pysal.lib as lp
from pysal.viz import mapclassify as mc
from pysal.explore import giddy
from pysal.explore import esda
from shapely.wkt import loads
import numpy as np
np.set_printoptions(precision=4)
%matplotlib inline

Scenario 1:
#### Oracle Spatial storing large multi year nationwide auto accident data
#### Spatial query calculates % of traﬃc accidents that involved speeding within geograpahic tiles covering the US
#### Result pulled into Python for spatial statistical analysis

Connect to Oracle Autonomous Database

In [2]: file = open('/opt/dbconfig_adw.txt', 'r')

user = file.readline().strip()
pwd = file.readline().strip()
host_port_service = file.readline().strip()
connection = cx_Oracle.connect(user, pwd, host_port_service)
cursor = connection.cursor()

Handle CLOBs

1 of 17 8/10/20, 10:33 AM
In [3]: def OutputTypeHandler(cursor, name, defaultType, size, precision, scale):
if defaultType == cx_Oracle.CLOB:
return cursor.var(cx_Oracle.LONG_STRING, arraysize = cursor.arraysize)
connection.outputtypehandler = OutputTypeHandler

Run query returning "WKT" geometry

In [4]: cursor.execute("""
SELECT sdo_util.to_wktgeometry(geometry) as geometry
FROM grid_us
""")
gdf = gpd.GeoDataFrame(cursor.fetchall(), columns = ['geometry'])
gdf['geometry'] = gpd.GeoSeries(gdf['geometry'].apply(lambda x: loads(x)))
gdf

Out[4]:
geometry

0 POLYGON ((-68.90625 46.40625, -67.50000 46.406...

1 POLYGON ((-70.31250 45.00000, -68.90625 45.000...

2 POLYGON ((-68.90625 45.00000, -67.50000 45.000...

3 POLYGON ((-101.25000 43.59375, -99.84375 43.59...

4 POLYGON ((-102.65625 43.59375, -101.25000 43.5...

... ...

473 POLYGON ((-88.59375 37.96875, -87.18750 37.968...

474 POLYGON ((-92.81250 42.18750, -91.40625 42.187...

475 POLYGON ((-94.21875 47.81250, -92.81250 47.812...

476 POLYGON ((-84.37500 46.40625, -82.96875 46.406...

477 POLYGON ((-70.31250 46.40625, -68.90625 46.406...

478 rows × 1 columns

View result

2 of 17 8/10/20, 10:33 AM
In [5]: fig, ax = plt.subplots(figsize=(10,5))
ax.set_clip_on(False)
ax.set_facecolor("lightblue")
result=gdf.plot(ax=ax,linewidth=1.5,facecolor="#cccccc",edgecolor="darkgrey",legen
d=False)
leg=ax.get_legend()
us48shp = gpd.read_file('data/us48.shp')
us48shp.plot(ax=ax, facecolor='none', edgecolor='white')

Out[5]: <matplotlib.axes._subplots.AxesSubplot at 0x7f63663c5b50>

Query for "% of traﬃc accidents that involved speeding" per tile

In [6]: cursor.execute("""
WITH x as (
SELECT b.grid_key as grid_key, 100*sum(a.speeding_involved)/count(speeding_invo
lved) as metric
FROM fars_pysal_mv a, grid_us b
WHERE sdo_anyinteract(a.geometry,b.geometry)='TRUE'
GROUP BY b.grid_key having count(b.grid_key)>10 )
SELECT x.grid_key, x.metric, sdo_util.to_wktgeometry(y.geometry) as geometry
FROM x, grid_us y
WHERE x.grid_key=y.grid_key
""")
gdf = gpd.GeoDataFrame(cursor.fetchall(), columns = ['grid_key','metric', 'geometr
y'])
gdf['geometry'] = gpd.GeoSeries(gdf['geometry'].apply(lambda x: loads(x)))
gdf.head()

Out[6]:
grid_key metric geometry

0 253 46.341463 POLYGON ((-68.90625 46.40625, -67.50000 46.406...

1 254 47.368421 POLYGON ((-70.31250 45.00000, -68.90625 45.000...

2 255 38.461538 POLYGON ((-68.90625 45.00000, -67.50000 45.000...

3 256 9.523810 POLYGON ((-101.25000 43.59375, -99.84375 43.59...

4 257 29.411765 POLYGON ((-102.65625 43.59375, -101.25000 43.5...

3 of 17 8/10/20, 10:33 AM
View the result

In [7]: fig, ax = plt.subplots(figsize=(11,5))

ax.set_clip_on(False)
ax.set_facecolor("lightblue")
result=gdf.plot(ax=ax, column='metric',cmap='OrRd',linewidth=0.3,edgecolor="lightg
rey",legend=True)
leg=ax.get_legend()
us48shp.plot(ax=ax, facecolor='none', edgecolor='white')

Out[7]: <matplotlib.axes._subplots.AxesSubplot at 0x7f6365b9ea10>

Analyze with PySAL (geospatial data science library) http://pysal.org/

(http://pysal.org/)

Spatial autocorrelation

Correlation of home prices and household income: 2 variables... makes sense

Correlation of home prices and location: 1 variable and itself across space... less intuitive
Spatial autocorrelation measures similarity of a variable to nearby values
Provides local and global measures
Incorporates "Spatial Lag" (weighted avg of neighbors)

Calculate global spatial sutocorrelation

4 of 17 8/10/20, 10:33 AM
In [8]: wq = lp.weights.Queen.from_dataframe(gdf)
wq.transform = 'r'
y = gdf['metric']
ylag = lp.weights.lag_spatial(wq, y)
np.random.seed(12345)
mi = esda.Moran(y, wq)
mi.I

Out[8]: 0.3521637718312477

View in comparison to spatial randomness

In [9]: import seaborn as sbn

sbn.kdeplot(mi.sim, shade=True)
plt.vlines(mi.I, 0, 1, color='r')
plt.vlines(mi.EI, 0,1)
plt.xlabel("Moran's I")

Out[9]: Text(0.5, 0, "Moran's I")

Calculate local spatial autocorrelation

In [10]: li = esda.Moran_Local(y, wq)

p_sim is probability that value is spartially random

In [11]: li.p_sim[0:20]

Out[11]: array([0.04 , 0.025, 0.021, 0.22 , 0.04 , 0.254, 0.3 , 0.482, 0.41 ,
0.07 , 0.369, 0.001, 0.01 , 0.05 , 0.002, 0.075, 0.159, 0.008,
0.001, 0.103])

q is quandrant location: 1 HH (hot spot), 2 LH, 3 LL (cold spot), 4 HL

5 of 17 8/10/20, 10:33 AM
In [12]: li.q[0:20]

Out[12]: array([1, 1, 1, 3, 3, 1, 3, 2, 4, 1, 3, 3, 1, 3, 3, 3, 1, 3, 3, 1])

Hot spots: quadrant 1 and probability of spatial randomness <5%

In [13]: sig = li.p_sim < 0.05

hotspot=sig*li.q==1
spots = ['n.sig.', 'hot spot']
labels = [spots[i] for i in hotspot*1]
from matplotlib import colors
hmap = colors.ListedColormap(['red', 'lightgrey'])
f, ax = plt.subplots(1, figsize=(12,5))
ax.set_clip_on(False)
ax.set_facecolor("lightblue")
gdf.assign(cl=labels).plot(column='cl', categorical=True, \
k=2, cmap=hmap, linewidth=0.1, ax=ax, \
edgecolor='black', legend=True)
us48shp.plot(ax=ax, facecolor='none', edgecolor='white')
plt.show()

Cold spots: quadrant 3 and probability of spatial randomness <5%

6 of 17 8/10/20, 10:33 AM
In [14]: coldspot=sig*li.q==3
spots = ['n.sig.', 'cold spot']
labels = [spots[i] for i in coldspot*1]
from matplotlib import colors
hmap = colors.ListedColormap(['darkblue', 'lightgrey'])
f, ax = plt.subplots(1, figsize=(12,5))
ax.set_clip_on(False)
ax.set_facecolor("lightblue")
gdf.assign(cl=labels).plot(column='cl', categorical=True, \
k=2, cmap=hmap, linewidth=0.1, ax=ax, \
edgecolor='black', legend=True)
us48shp.plot(ax=ax, facecolor='none', edgecolor='white')
plt.show()

Scenario 2:
#### For a region of interest, how is unemployment over time related to location?
#### Determine probabilities of changes based on 'regional' unemployment

Connect to Oracle Autonomous Database

In [15]: file = open('/opt/dbconfig_adw.txt', 'r')

user = file.readline().strip()
pwd = file.readline().strip()
host_port_service = file.readline().strip()
connection = cx_Oracle.connect(user, pwd, host_port_service)
cursor = connection.cursor()

Handle CLOBs

In [16]: def OutputTypeHandler(cursor, name, defaultType, size, precision, scale):

if defaultType == cx_Oracle.CLOB:
return cursor.var(cx_Oracle.LONG_STRING, arraysize = cursor.arraysize)
connection.outputtypehandler = OutputTypeHandler

7 of 17 8/10/20, 10:33 AM
Preview the unemployment data

In [17]: cursor.execute("""
SELECT statefips, countyfips, year, unemp_pct
FROM bls_unemployment
where rownum<10
""")
pd.DataFrame(cursor.fetchall(), columns = ['STATEFIPS','COUNTYFIPS','YEAR', 'UNEMP
_PCT'])

Out[17]:
STATEFIPS COUNTYFIPS YEAR UNEMP_PCT

0 13 013 2000 3.0

1 13 015 2000 3.7

2 13 017 2000 4.9

3 13 019 2000 4.0

4 13 021 2000 4.4

5 13 023 2000 4.3

6 13 025 2000 4.2

7 13 027 2000 4.7

8 13 029 2000 3.2

Create region of interest: 100 mi buﬀer around Kansas, Missouri, Oklahoma, Tennessee, Kentucky, Arkansas

In [18]: cursor.execute("""
select sdo_util.to_wktgeometry(
sdo_geom.sdo_buffer(
sdo_aggr_union(sdoaggrtype(c.geom, 0.05)), 100, 0.05, 'unit=MILE')) a
s geometry
FROM states c
WHERE state in ('Kansas','Missouri','Oklahoma','Tennessee','Kentucky','Arkansas')
"""
)
gdfAOI = gpd.GeoDataFrame(cursor.fetchall(), columns = ['geometry'])
gdfAOI['geometry'] = gpd.GeoSeries(gdfAOI['geometry'].apply(lambda x: loads(x)))
gdfAOI['geometry']

Out[18]: 0 POLYGON ((-95.58789 32.38036, -95.58620 32.377...

Name: geometry, dtype: geometry

View the region

8 of 17 8/10/20, 10:33 AM
In [19]: f, ax = plt.subplots(1, figsize=(12,5))
ax.set_facecolor("lightblue")
us48shp.plot(ax=ax, facecolor='white', edgecolor='#c0c0c0')
gdfAOI.plot(ax=ax, facecolor='none', edgecolor='darkblue', hatch='|||')

Out[19]: <matplotlib.axes._subplots.AxesSubplot at 0x7f635ec9c3d0>

Prepare unemployment data for analysis (pivot, add geometry, spatial filter)

In [20]: cursor.execute("""
WITH

-- pivot the unemployment data

unemp_data as (
SELECT * FROM
(select statefips, countyfips, unemp_pct, year from bls_unemployment)
PIVOT( avg(unemp_pct)
FOR year IN (1996,1997,1998,1999,2000,2001,2002,2003,
2004,2005,2006,2007,2008,2009,2010,2011,
2012,2013,2014,2015,2016,2017,2018) )
),

-- define region of interest

aoi as (
select sdo_geom.sdo_buffer(
sdo_aggr_union(sdoaggrtype(c.geom, 0.05)), 100, 0.05, 'unit=MILE') as
geom
FROM states c
WHERE state in ('Kansas','Missouri','Oklahoma','Tennessee','Kentucky','Arkansas
')
)

-- add geometry, county/state names, filter for counties in the region of interest
SELECT c.state, c.county, a.*, sdo_util.to_wktgeometry(b.geom) as geometry
FROM unemp_data a, cb_2018_us_county_500k b, fips_county c, aoi
WHERE a.statefips=b.statefp and a.countyfips=b.countyfp
AND a.statefips=c.state_fips and a.countyfips=c.county_fips
AND sdo_anyinteract(b.geom,aoi.geom)='TRUE'
""")

Out[20]: <cx_Oracle.Cursor on <cx_Oracle.Connection to demo_data@spatial2_medium>>

9 of 17 8/10/20, 10:33 AM
In [21]: gdf = gpd.GeoDataFrame(cursor.fetchall(), columns = ['STATE','COUNTY','STATEFIPS
','COUNTYFIPS','1996','1997',
'1998','1999','2000','2001','
2002','2003','2004','2005',
'2006','2010','2007','2008','
2009','2011','2012','2013',
'2014','2015','2016','2017','
2018','geometry'])
gdf['geometry'] = gpd.GeoSeries(gdf['geometry'].apply(lambda x: loads(x)))
gdf.head()

Out[21]:
STATE COUNTY STATEFIPS COUNTYFIPS 1996 1997 1998 1999 2000 2001 ... 2009 2011 2012

0 Mississippi Leake 28 079 5.0 5.2 4.8 6.1 6.3 5.8 ... 9.7 9.7 9.2

1 Mississippi Lee 28 081 4.6 4.7 4.1 3.8 4.4 4.9 ... 10.4 9.6 8.3

2 Mississippi Leflore 28 083 9.2 9.3 8.9 8.4 8.8 9.0 ... 15.3 14.9 14.0

3 Mississippi Lincoln 28 085 5.4 5.5 4.9 4.8 5.4 5.4 ... 10.5 9.8 8.6

4 Mississippi Lowndes 28 087 6.5 7.3 7.5 5.8 5.5 6.4 ... 11.8 10.5 9.1

5 rows × 28 columns

View regional unemployment over time

10 of 17 8/10/20, 10:33 AM
In [22]: index_year = range(2004,2018,2)
fig, axes = plt.subplots(nrows=2, ncols=3,figsize = (15,7))
for i in range(2):
for j in range(3):
ax = axes[i,j]
gdf.plot(ax=ax, column=str(index_year[i*3+j]), cmap='OrRd', scheme='quanti
les', legend=True)
us48shp.plot(ax=ax, facecolor='none', edgecolor='#c0c0c0')
gdfAOI.plot(ax=ax, facecolor='none', edgecolor='darkblue')
ax.set_title('unemployment pct %s Quintiles'%str(index_year[i*3+j]))
ax.axis('off')
leg = ax.get_legend()
leg.set_bbox_to_anchor((0.18, 0.0, 0.16, 0.2))
plt.tight_layout()

Temporal analysis with Discrete Markov Chains (DMC)

DMC is only suitable for certain scenarios (https://en.wikipedia.org/wiki/Markov_chain (https://en.wikipedia.org
/wiki/Markov_chain))
Input: Set of observation time series
Result: Probabilities and predictions of future observation transitions
PySAL supports Classic Markov and 2 spatial variants Spatial Markov and LISA Markov
Classic Hello World Example:

|-> 80% Sunny day

Sunny day |
|-> 20% Rainy day

|-> 70% Sunny day

Rainy day |
|-> 30% Rainy day

11 of 17 8/10/20, 10:33 AM
Prepare regional unemployment data

Data needs to be perpared as "array of time series" where the values are classified unemployment (we'll use quintile) per
year
[
[county 1 unemployment time series],
[county 2 unemployment time series],
...
[county n unemployment time series]
]

For example [2, 3, 3, 1, 4, 2, ....] is a time series showing 1st year unemployment in the 2nd quintile, 2nd
year unemployment in the 3rd quintile, and so on.

Generate arrays of binned values, where each array is a year with values for each county

In [23]: binnedData = np.array([mc.Quantiles(gdf[str(y)],k=5).yb for y in range(2004,201

8)])
print(binnedData.shape); print(); print(binnedData)

(14, 1271)

[[3 2 4 ... 4 2 3]
[4 3 4 ... 4 2 2]
[4 3 4 ... 3 2 3]
...
[3 2 4 ... 4 3 4]
[3 2 4 ... 4 3 4]
[4 2 4 ... 4 4 4]]

Transpose so that each array is a county with values for each year (time series)

In [24]: binnedDataT = binnedData.transpose()

print(binnedDataT.shape); print(); print(binnedDataT)

(1271, 14)

[[3 4 4 ... 3 3 4]
[2 3 3 ... 2 2 2]
[4 4 4 ... 4 4 4]
...
[4 4 3 ... 4 4 4]
[2 2 2 ... 3 3 4]
[3 2 3 ... 4 4 4]]

Create "aspatial" Markov instance

12 of 17 8/10/20, 10:33 AM
In [25]: m5 = giddy.markov.Markov(binnedDataT)

The Markov Chain is irreducible and is composed by:

1 Recurrent class (indices):
[0 1 2 3 4]
0 Transient classes.
The Markov Chain has 0 absorbing states.

View transition counts

In [26]: m5.transitions

Out[26]: array([[2946., 463., 44., 10., 5.],

[ 483., 2057., 673., 123., 24.],
[ 42., 718., 1663., 686., 117.],
[ 14., 98., 740., 1813., 584.],
[ 3., 15., 87., 638., 2477.]])

View transition probabilities

In [27]: print(m5.p)

[[0.8495 0.1335 0.0127 0.0029 0.0014]

[0.1438 0.6122 0.2003 0.0366 0.0071]
[0.013 0.2226 0.5155 0.2126 0.0363]
[0.0043 0.0302 0.2278 0.558 0.1797]
[0.0009 0.0047 0.027 0.1981 0.7693]]

View first mean passage time

In [28]: print(giddy.ergodic.fmpt(m5.p))

[[ 4.6041 7.9945 14.8811 22.1314 35.6067]

[22.8275 4.9295 9.0061 16.3859 29.9697]
[31.2434 10.4107 5.1934 10.635 24.3351]
[35.8927 15.3302 7.5034 5.0902 17.1953]
[39.2734 18.7488 11.0187 5.9992 5.2373]]

Before starting Spatial Markov, test for spatial randomness

Global spatial autocorrelation over time

Compare to random

13 of 17 8/10/20, 10:33 AM
In [29]: wq = lp.weights.Queen.from_dataframe(gdf)
years = np.arange(2004,2019)
mitest = [esda.Moran(gdf[str(x)], wq) for x in years]
res = np.array([(mi.I, mi.EI, mi.seI_norm, mi.sim[974]) for mi in mitest])
fig, ax = plt.subplots(nrows=1, ncols=1,figsize = (10,5) )
ax.plot(years, res[:,0], label='Moran\'s I')
ax.plot(years, res[:,1]+1.96*res[:,2], label='Upper bound',linestyle='dashed')
ax.plot(years, res[:,1]-1.96*res[:,2], label='Lower bound',linestyle='dashed')
ax.set_title("Global spatial autocorrelation",fontdict={'fontsize':15})
ax.set_xlim([2004, 2018])
ax.legend()

Out[29]: <matplotlib.legend.Legend at 0x7f635f46ba10>

Spatial Markov

Markov for regional context, using classification (bins) of Spatial Lag

Spatial Markov classifies the lag and observations for us
We will use quintiles as the classification
Result is Markov for each quintile of Spatial Lag

14 of 17 8/10/20, 10:33 AM
In [30]: npData = np.array([gdf[str(y)] for y in range(2004,2018)])
sm = giddy.markov.Spatial_Markov(npData.transpose(), wq, fixed = True, k = 5, m=5,
fill_empty_classes=True)
sm.summary()

--------------------------------------------------------------
Spatial Markov Test
--------------------------------------------------------------
Number of classes: 5
Number of transitions: 16523
Number of regimes: 5
Regime names: LAG0, LAG1, LAG2, LAG3, LAG4
--------------------------------------------------------------
Test LR Chi-2
Stat. 651.303 673.460
DOF 72 72
p-value 0.000 0.000
--------------------------------------------------------------
P(H0) C0 C1 C2 C3 C4
C0 0.726 0.127 0.077 0.054 0.016
C1 0.291 0.390 0.117 0.123 0.079
C2 0.042 0.329 0.359 0.114 0.155
C3 0.039 0.053 0.303 0.417 0.188
C4 0.023 0.078 0.078 0.244 0.576
--------------------------------------------------------------
P(LAG0) C0 C1 C2 C3 C4
C0 0.770 0.120 0.066 0.034 0.011
C1 0.447 0.280 0.096 0.120 0.057
C2 0.110 0.329 0.315 0.178 0.068
C3 0.077 0.154 0.462 0.154 0.154
C4 0.000 0.000 0.000 0.000 0.000
--------------------------------------------------------------
P(LAG1) C0 C1 C2 C3 C4
C0 0.599 0.148 0.104 0.117 0.033
C1 0.270 0.388 0.127 0.131 0.084
C2 0.035 0.393 0.258 0.118 0.197
C3 0.051 0.082 0.378 0.306 0.184
C4 0.000 0.000 0.111 0.444 0.444
--------------------------------------------------------------
P(LAG2) C0 C1 C2 C3 C4
C0 0.615 0.156 0.138 0.083 0.009
C1 0.257 0.442 0.112 0.102 0.087
C2 0.040 0.349 0.363 0.099 0.148
C3 0.007 0.055 0.362 0.346 0.230
C4 0.016 0.065 0.048 0.387 0.484
--------------------------------------------------------------
P(LAG3) C0 C1 C2 C3 C4
C0 0.429 0.286 0.143 0.143 0.000
C1 0.170 0.491 0.104 0.179 0.057
C2 0.042 0.256 0.434 0.125 0.144
C3 0.032 0.040 0.313 0.425 0.191
C4 0.016 0.034 0.030 0.423 0.496
--------------------------------------------------------------
P(LAG4) C0 C1 C2 C3 C4
C0 0.000 0.000 0.000 0.000 0.000
C1 0.000 1.000 0.000 0.000 0.000
C2 0.105 0.158 0.395 0.211 0.132
C3 0.088 0.082 0.201 0.491 0.139
C4 0.025 0.089 0.090 0.197 0.598
--------------------------------------------------------------

15 of 17 8/10/20, 10:33 AM
Visualize the Spatial Markov result

In [31]: fig, axes = plt.subplots(2,3,figsize = (15,10))

for i in range(2):
for j in range(3):
ax = axes[i,j]
if i==0 and j==0:
p_temp = sm.p
im = ax.imshow(p_temp,cmap = "Reds",vmin=0, vmax=1)
ax.set_title("Pooled",fontsize=18)
else:
p_temp = sm.P[i*3+j-1]
im = ax.imshow(p_temp,cmap = "Reds",vmin=0, vmax=1)
ax.set_title("Spatial Lag %d"%(i*3+j),fontsize=18)
for x in range(len(p_temp)):
for y in range(len(p_temp)):
text = ax.text(y, x, round(p_temp[x, y], 2),
ha="center", va="center", color="black")

fig.subplots_adjust(right=0.92)
cbar_ax = fig.add_axes([0.95, 0.228, 0.01, 0.5])
fig.colorbar(im, cax=cbar_ax)

Out[31]: <matplotlib.colorbar.Colorbar at 0x7f635d9f1350>

From the summary above, for a county with unemployment in 5th quintile

the probability of remaining in 5th quintile is ~60% if its neighbors are in the 5th quintile
the probability of remaining in 5th quintile is ~50% if its neighbors are in the 4th quintile

16 of 17 8/10/20, 10:33 AM
Spatially conditional first mean passage times when neighbors are in 2nd quintile

In [32]: print(sm.F[1])

[[ 5.4579 6.369 6.1909 6.4034 10.3668]

[ 8.7361 4.9012 5.8694 6.0232 9.5586]
[11.7555 4.8698 4.9459 5.6106 8.1261]
[12.6411 6.7157 3.8864 4.4734 7.7496]
[14.264 8.1465 4.9091 2.9221 5.3472]]

From the summary above, for a county with neighbors in the 2nd quintile

it will take roughy 5 years to transition to the 5th quintile to 3nd quintile
it will take roughy 8 years to transition to the 5th quintile to 2nd quintile

17 of 17 8/10/20, 10:33 AM

Python Geospatial Data Guide
No ratings yet
Python Geospatial Data Guide
33 pages
Gds Scipy16 PDF
No ratings yet
Gds Scipy16 PDF
190 pages
A Beginners Guide To Geospatial Data Analysis
No ratings yet
A Beginners Guide To Geospatial Data Analysis
11 pages
21AD71 Module 4 Textbook
No ratings yet
21AD71 Module 4 Textbook
107 pages
An Introduction To GeoPandas. Everything You Need To Get You Started - by Mark Friese - Aug, 2022 - Medium
No ratings yet
An Introduction To GeoPandas. Everything You Need To Get You Started - by Mark Friese - Aug, 2022 - Medium
9 pages
Open Data Cube Guide for Analysts
No ratings yet
Open Data Cube Guide for Analysts
1 page
Local Spatial Autocorrelation - Geographic Data Science With Python
No ratings yet
Local Spatial Autocorrelation - Geographic Data Science With Python
24 pages
Unit 12 GIS Analysis Using R-I
No ratings yet
Unit 12 GIS Analysis Using R-I
24 pages
Module 4
No ratings yet
Module 4
55 pages
Exercise6 Solution
No ratings yet
Exercise6 Solution
8 pages
Advanced Visualization For Data Scientists With Matplotlib
No ratings yet
Advanced Visualization For Data Scientists With Matplotlib
38 pages
Geoplotlib Research Paper PDF
No ratings yet
Geoplotlib Research Paper PDF
21 pages
Kriging vs. Simulation, A 2D Map Example - GeostatsPy Well-Documented Demonstration Geostatistical Workflows
No ratings yet
Kriging vs. Simulation, A 2D Map Example - GeostatsPy Well-Documented Demonstration Geostatistical Workflows
16 pages
Where Can Buy Machine Learning On Geographical Data Using Python 1st Edition Joos Korstanje Ebook With Cheap Price
100% (6)
Where Can Buy Machine Learning On Geographical Data Using Python 1st Edition Joos Korstanje Ebook With Cheap Price
66 pages
Shaheed Zulfikar Ali Bhutto Institute of Science & Technology
No ratings yet
Shaheed Zulfikar Ali Bhutto Institute of Science & Technology
12 pages
Chapter 2
No ratings yet
Chapter 2
13 pages
CovidData - Ipynb - Colaboratory
No ratings yet
CovidData - Ipynb - Colaboratory
4 pages
GIS 4653/5653: Spatial Programming and GIS
No ratings yet
GIS 4653/5653: Spatial Programming and GIS
86 pages
Geopandas 50 Exercises
No ratings yet
Geopandas 50 Exercises
2 pages
Chapter 3 Plotting and Visualizing Your Data
No ratings yet
Chapter 3 Plotting and Visualizing Your Data
14 pages
West Rox
No ratings yet
West Rox
29 pages
Data Exploration and Visualization Laboratory - AD3301 - Lab Manual
No ratings yet
Data Exploration and Visualization Laboratory - AD3301 - Lab Manual
55 pages
Intro To Spatial Data Analysis in Python - FOSS4G NA 2015
100% (1)
Intro To Spatial Data Analysis in Python - FOSS4G NA 2015
27 pages
Cheat Sheet-Building Unsupervised Learning Models
No ratings yet
Cheat Sheet-Building Unsupervised Learning Models
3 pages
MLRecord
No ratings yet
MLRecord
24 pages
Python Lecture 6 (2025)
No ratings yet
Python Lecture 6 (2025)
27 pages
AI & Data Science Lab Record
No ratings yet
AI & Data Science Lab Record
28 pages
Lab Record Dev
No ratings yet
Lab Record Dev
20 pages
Codigo 2
No ratings yet
Codigo 2
21 pages
Python
No ratings yet
Python
16 pages
PRO Level Data Visualization Cheat Sheet
No ratings yet
PRO Level Data Visualization Cheat Sheet
15 pages
Exercise3 Solution
No ratings yet
Exercise3 Solution
19 pages
Plot Per Columns Features Kde or Normal Distribution Seaborn in Details
No ratings yet
Plot Per Columns Features Kde or Normal Distribution Seaborn in Details
272 pages
Unit V Notes
No ratings yet
Unit V Notes
11 pages
Geopandas Documentation: Release 0.2.0.dev
No ratings yet
Geopandas Documentation: Release 0.2.0.dev
45 pages
Autoregressive Model with Statsmodels
No ratings yet
Autoregressive Model with Statsmodels
10 pages
Vector Data Exploration With R
No ratings yet
Vector Data Exploration With R
9 pages
Spatial Gap Index Analysis Guide
No ratings yet
Spatial Gap Index Analysis Guide
27 pages
Basemap Toolkit for Python Mapping
No ratings yet
Basemap Toolkit for Python Mapping
24 pages
Module 4-Geopandas
No ratings yet
Module 4-Geopandas
35 pages
Spatial Statistics in R
No ratings yet
Spatial Statistics in R
29 pages
DEV Manual
No ratings yet
DEV Manual
23 pages
01 Matplotlib PDF
No ratings yet
01 Matplotlib PDF
9 pages
Lecture 7.1 Spatial Analysis Raster Data
No ratings yet
Lecture 7.1 Spatial Analysis Raster Data
55 pages
Thematic Maps
No ratings yet
Thematic Maps
13 pages
Combinepdf
No ratings yet
Combinepdf
30 pages
Practical 5
No ratings yet
Practical 5
6 pages
@PowerBI - Ir - Data Visualization Cheat Sheet
No ratings yet
@PowerBI - Ir - Data Visualization Cheat Sheet
15 pages
Presentation Geoxp
No ratings yet
Presentation Geoxp
9 pages
Compiler Queue Syntax Hash Integer Stack
No ratings yet
Compiler Queue Syntax Hash Integer Stack
3 pages
Feature Engineering with Pandas Techniques
No ratings yet
Feature Engineering with Pandas Techniques
14 pages
Data Analysis With Python - Jupyter Notebook
No ratings yet
Data Analysis With Python - Jupyter Notebook
10 pages
Spatial Statistics in R
No ratings yet
Spatial Statistics in R
29 pages
Unit 4 DSF
No ratings yet
Unit 4 DSF
15 pages
Dev Lab Record
No ratings yet
Dev Lab Record
31 pages
Matploib
No ratings yet
Matploib
24 pages
Pandas and Numpy
No ratings yet
Pandas and Numpy
9 pages
2011 Csde Esda Exercise
No ratings yet
2011 Csde Esda Exercise
7 pages
Dialaw@Y Ip System Specifications
No ratings yet
Dialaw@Y Ip System Specifications
15 pages
Connector Connector: GH Connector GH Connector
No ratings yet
Connector Connector: GH Connector GH Connector
4 pages
Resume - Suchita Chavan
No ratings yet
Resume - Suchita Chavan
2 pages
Industrial Ethernet/IP Module Guide
No ratings yet
Industrial Ethernet/IP Module Guide
2 pages
CLASS 3 Computer PT1 (PARTII) REVISION
100% (1)
CLASS 3 Computer PT1 (PARTII) REVISION
4 pages
Online Teaching and Learning
No ratings yet
Online Teaching and Learning
4 pages
Web Security Challenge Guide
No ratings yet
Web Security Challenge Guide
13 pages
Project Scheduling Essentials
No ratings yet
Project Scheduling Essentials
8 pages
Legal and Ethical Compliance in Community Services
No ratings yet
Legal and Ethical Compliance in Community Services
3 pages
VMware Monitoring Setup
No ratings yet
VMware Monitoring Setup
5 pages
Coffee Shops in Academic Libraries
No ratings yet
Coffee Shops in Academic Libraries
30 pages
Image - Fusion 1
No ratings yet
Image - Fusion 1
4 pages
Variant Tables: 1. Overview
100% (2)
Variant Tables: 1. Overview
11 pages
1.0 System Analysis and Design 9th Edition - Shelly Cashman-56-57
No ratings yet
1.0 System Analysis and Design 9th Edition - Shelly Cashman-56-57
2 pages
Boxer Marlin - User Manual
No ratings yet
Boxer Marlin - User Manual
43 pages
Lateral Movement
No ratings yet
Lateral Movement
42 pages
(2020129) On Layer Normalization in The Transformer Architecture
No ratings yet
(2020129) On Layer Normalization in The Transformer Architecture
17 pages
Examination Guidelines - HWI 2025 Qualifiers
No ratings yet
Examination Guidelines - HWI 2025 Qualifiers
3 pages
Lesson 7.2 Online Platforms
No ratings yet
Lesson 7.2 Online Platforms
22 pages
Market Basket Analysis Using Apriori Algorithm Gro
No ratings yet
Market Basket Analysis Using Apriori Algorithm Gro
9 pages
xMxx0 HandheldPlatform GSG Es Oct2012
No ratings yet
xMxx0 HandheldPlatform GSG Es Oct2012
52 pages
eMMS: Mobile Governance for MGNREGS
No ratings yet
eMMS: Mobile Governance for MGNREGS
3 pages
Edgistify Company Profile
No ratings yet
Edgistify Company Profile
9 pages
Data Structures Lab Programs Guide
No ratings yet
Data Structures Lab Programs Guide
87 pages
Wired Remote Control Guide
100% (1)
Wired Remote Control Guide
28 pages
Gamification vs. Game-Based Learning
No ratings yet
Gamification vs. Game-Based Learning
6 pages
Image Registra, On With 3D Slicer: Sonia Pujol, PH.D
No ratings yet
Image Registra, On With 3D Slicer: Sonia Pujol, PH.D
36 pages
BMS Project Report for CSE Students
No ratings yet
BMS Project Report for CSE Students
19 pages
Quantum Computing for MIS Students
No ratings yet
Quantum Computing for MIS Students
10 pages
1.1 Elements and Principles of Art
No ratings yet
1.1 Elements and Principles of Art
27 pages