Data & Tools

Datasets

The CONNECTExplorer is a web-based application that offers EagleView’s high-resolution aerial imagery gives organizations the finest level of detail to rapidly make crucial everyday decisions. EagleView imagery (formerly Pictometry) is captured with patented proprietary camera systems that enable you to see the world from multiple perspectives.

  • Resolution: Detail as fine as .75 inches ground sample distance—16x higher resolution than satellite
  • Coverage: 98% coverage of the United States population
  • Frequency: Regularly updated imagery to provide current and historical views

CONNECTExplorer allows the users to view and analyze high-resolution aerial imagery, including high-quality oblique and orthogonal images. The application offers useful functions such as measuring height and area of buildings, spaces, or even trees. The annotation details can be exported as CSV or KML files. The images can be exported as graphic files including JPEG, GIF, PNG, TIFF, GeoTIFF (Orthos only) which are generated in Zip file,  KMZ (LatLonQuad or LatLonBo), or PDFs. The exported aerial imagery can be integrated into any GIS software.

Getting an account

Access to CONNECTExplorer is provided on a case-by-case basis because Yale University Library holds 25 accounts. Please submit your request to geospatialhelp@yale.edu and we will contact you to discuss your project and the length of time you will require the account. 

Training for CONNECTExplorer

PolicyMap offers a large selection of variables about communities. In addition to business data, there are are also variables that cover many aspects of housing, lending, income, health and quality of life. These variables have been collected by The Reinvestment Fund, “a nonprofit community development financial institution that works across the Mid-Atlantic”. Browse their wide selection of variables to get a sense of the time periods and locations covered, as some of the coverage can be for a relatively small area. 

Signing In

You can reach Yale’s PolicyMap subscription at yale.policymap.com/maps. If you do not want everyone at Yale to have access to your data, you will need to create an individual PolicyMap account. These accounts are free with the Yale site license.

Creating Regions

PolicyMap allows you to create custom regions that you can use for analysis. These areas are saved and are one of the geographic options when you create reports. There are three ways to create regions:

Draw Custom Region

As the instructions will tell you, to Draw a Custom Region you just have to click points on the map until you have the shape you want, and then click on the first point again to close the polygon.

Assemble Custom Region

You can assemble a custom region from existing boundaries. In addition to the standard census boundaries like tracts and block groups, you can also choose to create a region based on School Districts, Congressional Districts or Community Development Block Grant (CDBG) areas. All of the regions you choose will have to be of the same type.

Radius Region

The Radius Region option allows you to etiher type in an address or click a point on the map, choose a radius, and then create an area based on those two pieces of information.

Displaying Variables

The variables available in PolicyMap are organized into categories available as tabs at the top of the map window. Inside the main categories are subcategories, organized by whether the data will be displayed as Data Layers, where the variable will be represented as different colored areas on the map (a choropleth map), or Data Points, where the data are shown as distinct points on the map.

In the main map window, you can choose one variable that is displayed as a Data Layer and multiple variables are displayed as Data Points to view in your map at the same time. Each of the variables shown as Data Points will appear with a different symbol, which can’t be changed. You can reduce the number of points by clicking on the “Filter Points” button and choosing the attribute that you want to use to narrow down the points, followed by the value that you are interested in. An example of this is displaying the Public Schools location variable and then only showing schools that have a “Grades Offered” value of “12”. Now only schools that offer grade 12 will be shown on the map.

You can change the color of the Data Layer using the “Edit>Colors” menu and choosing one of the pre-defined color ramps. You can also use “Edit>Ranges” to manually change the number of ranges used for the data, as well as the break points. 

If you want to use Data Layers to filter your data, you can create a 3-Layer Map. Choose 3-Layer Maps in the bar at the very top of the screen. You will be able to add 3 Data Layers to the map, and the addition of each layer will filter the geographic areas that you see. This can be done in any of the geographic levels (State down to Census Tract) if those geographic levels are available for that variable. Once you have chose the variables you can limit the values using the minimum and maximum boxes in the Data Layers menu.

Data Loader

The Data Loader allows you to add data to a map in PolicyMap, either by putting the points on the map individually (Add Points Directly to the Map) or uploading a spreadsheet of points (Upload Spreadsheet of Points).

If you choose to add points to the map you will be asked to name your dataset, share it with all Yale users (unless you get a personal login) and to choose an icon. You can then create your dataset. If it tells you that there is a problem creating your dataset, it may be because the name for your dataset has already been used by someone else at Yale - try given using a different name. Once your dataset is created, a bar will appear in the top of the map window. You can Enter Address for Site and it will place the icon in the appropriate place, or you can click on the map to add points. When you are finished, you can Save your dataset or you can Download Points as a csv, with the Title, Description, Image and latitude and longitude. 

If you add a spreadsheet of addresses to the map, PolicyMap will place icons on the map at those address and add the latitude and longitude information to the table for you. 

Any data you create using the Data Loader will be available to anyone at Yale. Please see the box at the top of this guide for information on getting a private, personal login.

SimplyAnalytics is the place to go for marketing and business data. You can get point data for over 4.5 million US Businesses and Points-of-Interest including their contact and general information. You can also access market and media research industry data from Mediamark Research, Inc. Easy Analytic Software Inc. (EASI) has created a Life Stages data set that is also available through Simply Map.

The Yale Simply Map subscription provides acces to the Census data from 2000 and 2010, each in the 2000 and 2010 boundariesgeographies.

Signing In

Access to SimplyMap is provided to everyone at Yale at www.simplymap.com. If you are not on campus you will need to use the VPN client to get access. You do not have to create an account unless you want to save your work. To sign in without creating an account, use the Sign in as a guest link below the Sign In form.

Creating an account will allow you to create and save multiple maps and reports, filters, and favorite locations. SimplyMap will also remember recent variables, locations, and businesses and allow you to use them in new maps and reports. 

Setting the Location

Setting the location will zoom the map to the level appropriate to show that feature on your screen. You can choose a feature at many of the levels available in SimplyMap, from the entire USA down to the Census Block Level. You can choose to use a single location or a custom location. At the larger scales, you can create a custom location that includes everything within a specified radius around your chosen location. You can also create a custom location by choosing multiple locations from your recent or favorite locations list.

You can add the locations to your favorites list if you want to use them in future maps. You do this by clicking on the “Actions” button that shows up when you hover over a location. You can remove them from the list in the Actions menu as well.

If you only want to see your chose locations, click display and choose “Apply location mask”. 

Choosing the Variables

The variables in SimplyMap are organized in categories that generally relate to the sources of the data. Within the categories are folders, where you can choose between looking at the data in 2000 or 2010 values. This allows for analysis between these two years of the census, which would otherwise not be possible because of the changes to the boundaries between 2000 and 2010. From there you can choose the specific variable and  aggregation that you want to see. 

Clicking on the Action button will allow you to take a look at the metadata associated with the variable.

Choosing the Data Classification

Once you have chosen a variable, you can change the way that the data is displayed in your map using in the Legend. If you don’t see the legend on your map you can go to the Display Options menu and click Show Legend. 

In the legend you can choose to manually change the geography or just allow SimplyMap to Auto-select Geo Unit if you want it to change the geography based on your zoom level. You can also change the date of the data for the variable that is appearing in your map. 

In the Edit Legend Menu you can choose the data classification method. SimplyMap offers three basic classification options: quantiles, natural breaks or equal intervals. According to The Nature of Geographic Information, using quantiles will create breaks so that each category contains the same number of data points. Using equal intervals creates categories that equal divide the  range of data, which creates categories with different numbers of data points. 

Businesses

SimplyMap has provided access to a large database of business names and addresses that you can add to your map. You can search the businesses by keyword, name, NAICS code or SIC code, or a combination of those. 

Social Explorer is the choice if you are looking for historical census data. In Social Explorer you will have access to the U.S. Decennial Census back to 1790 as well as the data from the American Community Survey from 2005 to 2013. This is also the source for the Religious Congregations and Membership Study (RCMS) for 1980, 1990 and 2000, the Infogroup Religion 2009 and 2010 and the 2002 Carbon Dioxide Emissions data.

  • Air Quality System (AQS)
  • Toxics Release Inventory (TRI)
  • Environmental Justice Screening and Mapping (EJSCREEN)
  • NOAA Climate Normals
  • National Risk Index
Signing In

Social Explorer is a web-based mapping and data visualization tool. Access is provided by the Yale University Library to everyone at Yale. You can access Social Explorer at www.socialexplorer.com. If you are not on campus, you will need to use the VPN to get access.

You do not have to create an account to access the data and visualization tools in Social Explorer. You can create a free account if you wish to create and save projects. Do not use your NetID and password as your log in. You can choose to create a public profile if you wish.

Getting Started
Choosing the data

 You can change the data that is displayed on the map by clicking on the Change Data button in the top left corner of your screen. There are two options for browsing the data - by Category of by Survey. 

Browsing by Category requires that you choose a survey year first. Different data are available depending on the survey year that you choose. The available survey data go all the way back to 1790. 

Browsing by Survey allows you to choose your data source. You can choose from the available years of the American Community Survey (ACS), the decennial Census, the Religious Congregations & Membership Study (RCMS), the Religion studies by InfoGroup, and the 2002 Vulcan Project Carbon Emissions Data. You then choose the table and variable you are interested in.

Choosing the data geography

By default, the map will show the data geography that has been set as appropriate for the scale of the map (probably at the State level). You can see the map scale in the bottom left corner of the map. Use the -/+ buttons to zoom out and in, respectively. You can also zoom using the scroll wheel on your mouse. From Zoom 1-5 you will see state level data, Zoom 6-9 you will see County data, and above Zoom 10 you will see Census Tract level data (where those levels are available).

You can manually choose the data geography in the Show data by: pull-down menu. Turn the Automatic slider to Off and then click on the appropriate radio button. If the data disappear from the map you have chosen a geography that does not exist for that data set.

Visualization Types
Choosing a Visualization Type

Social Explorer offers you three different visualization types; shaded areas (choropleth maps), bubbles (proportional symbols) and dot density. According to Chapter 3 in The Nature of Geographic Information (a great resource if you are just starting out with GIS), if the variable that you are mapping is straight counts of indviduals you should use Bobbles or Dot Density. If you are mapping rates or densities, use shaded areas.

Shaded Area

Shaded area maps display data as a series of colors that fill in geographic areas. You can change When you are looking at a series of values from lowest to highest, it is best to use a color scale that goes from a light shade to a dark shade of one color. There are many options for color scales in Social Explorer or you can create your own. If the order of the color scale does not meet your needs you can Flip colors - this will change the color gradient so that what was previously the darkest colour is now the lightest.  

Bubbles (Proportional Symbols)

In the Bubble visualization type, the data will be represented by a series of circles that are centered in the mid-point of the geographic area they are representing. The scale of the bubbles is shown on the right side of the options menu bar. You can click on it to change the size or color of the bubbles. The +% Add perentages button allows you color-code the bubbles based on the percentage of the total value of the variable that they represent.

Dot Density

In the dot density visualizations, one dot is given a value and then a random scattering of dots is used to represent the variable value for that level of geography. For example, if a dot represented 10,000 households and the value for a county was 11,500 households that county would have one dot. Note that the value represented by one dot will decrease as you zoom in and the number of dots in a geography will increase. 

Map Layout

You can change the map layout using the button in the center bottom of the map. There are three layout options.

Single

Shows a single map window with one variable.

Side by side

Shows two completely independent map windows side by side. There is no connection between these maps - you can change the variables, the zoom, the area that you are looking at, and the visualization type as you please. 

Swipe

The swipe option allows you to look at two different variables in the same geographic context. The location stays the same, but you are allowed to adjust all of the other options on either side of a vertical slider. Move the slider from side to side to change what is represented on the map from one set of variables to another. You can see an example here.

Example Maps:

Bubble Single Map: Census 1850 - Total Population

Shaded Area Side by Side Map: 2002 CO2 Emissions and Census 2000 Median Household Income

Dot Density Swipe Map: Census 2010 - Total Population at the State and County Levels

Esri Business Analyst Web Apps allows you to perform market analysis with over 15,000 variables concerning demographics, consumer behavior, employment and many others.

  • You may conduct market analysis based on geographies, rings, drive-time and walk-time
  • It allows you to compare different sites
Signing In

Yale members can access Business Analyst through Yale ArcGIS Online. 

  1. Go to Yale ArcGIS Online http://yalemaps.maps.arcgis.com/
    • Please click sign in (top right corner)
    • Click  YALE UNIVERSITY 
    • If you previously had a Yale organization account, please contact us to walk you through the steps to connect your previously created content into this account 
  2. Click on the 9-square icon (top right corner) and click on Business Analyst
  3. Log in access should ask “using your Yale University account” which requires your NetID credentials
    • If not, go to next step
    • If yes, you will be granted access to Business Analyst
  4. Click on Enterprise Account
  5. Complete Yale’s ArcGIS Online URL, that is enter “yalemaps” (no quotes) on the box, then click continue
  6. You will be prompted to enter your Yale NetID credentials
  7. You should now have access to Esri Business Analyst

This report promotes public access to government data by publishing federal ArcGIS server addresses. State, county and city ArcGIS server addresses are included in this report with the hope that those teaching and learning GIS will use those local resources to help them learn about and explore geospatial data for their own backyard.

L2 Data is a database of U.S. voters used for political research and campaigns. It aggregates publicly available voter registration information and adds consumer and demographic data to create detailed profiles, including historical voting records and party affiliation.

CoreLogic Spatial Web Services (SWS) provides the tools and resources to access Spatial Solutions natural hazard risk, tax jurisdiction, parcel, and geocoding data. Using a RESTful API, you can now integrate and stream our data from a variety of platforms and programming languages including Java, .Net, and PHP.

Explore more on the Yale Library site.

PLEASE NOTE THAT MANY OF THESE GIS DATASETS ARE LICENSED FOR USE BY YALE PATRONS ONLY

Explore more on the Yale Library site.

  • 2011 India Census - GIS

    Includes state boundaries, district boundaries and subdistrict boundaries of India from 2011 census provisional data.

  • 2001 Indian Census - GIS

    IndiaMap includes socio-demographic and economic Census data for 2001 at the national, state, district, sub-district, tahsil, cities and towns. Also includes highways & railways.

  • 1991 Indian Census - GIS

    Includes socio-demographic and economic Census data for 1991 at the state, district, and subdistrict levels.

  • 1951-2001: Historical District Boundaries of India - GIS

    Historical boundaries dataset represents decadal change in district boundaries of India from 1951 to 2001 and corresponds to decennial census demographic and socio-economic data for the census years 1951, 1961, 1971, 1981, 1991 and 2001. Attribute data such as total population, population by sex, and rural/urban populations.

  • 2001-2011 Town map of India : digital map of town point of India - GIS

    All town points are shown linked to Primary Census Abstract 2001 and Primary Census Abstract 2011.

  • 9th-15th Lok Sabhas: Lok Sabha election data (1986-2009) - GIS

    “Pollmap of India”. Includes Parliamentary constituency boundaries for India, Assembly constituency boundaries for all states. Includes election data for the 9th, 10th, 11th, 12th, 13th, 14th, and 15th Lok Sabha. Optionally, an application for historical analysis and impact of regional and statewise swings can be added. 15th Lok Sabha Parliamentary boundaries of India linked to Census 2009. Includes current parliamentary constituency boundaries with demographic profile and 2009 elections results (15th Lok Sabha).

  • Census Digital Library (Government of India)

    OPEN ACCESS: A free beta site run by the Indian government with static maps, tables, photographs, PowerPoint presentations, and publicity audio clips from 1991-2011 censuses.

  • India Place Finder (GIS data)
OPEN ACCESS
  • DataMeet

    DataMeet was started on January 26th 2011 by a handful of data enthusiasts who started a google group where people can share tips for working with data. “The intent was always to work on data in India and to find others doing the same. Since there were many data groups in London I felt there should be one for India.”

  • SEDAC India-Village Socioeconomic Data

    The India Village-Level Geospatial Socio-Economic Data Set: 1991, 2001 is a compilation of the finest level of administrative boundaries in India (village/town-level) and over 200 socio-economic variables collected during the Indian Census in 1991 and 2001.