In geographic information systems (GIS), vector file formats are used to represent geographic features such as points, lines, and polygons, along with associated attribute data.[1] These formats are essential for storing, exchanging, and analyzing spatial data across both desktop and web-based GIS platforms. The structure and capabilities of each format — including support for coordinate reference systems, metadata, and topology — greatly affect interoperability and analytical accuracy.[2]
This article presents a comparison of popular GIS and CAD vector formats, detailing their design authorities, licensing models, technical properties, and common use cases. It is intended to help users choose the most appropriate format based on software compatibility, performance, and data exchange needs.[3]
The table below lists several widely used GIS/CAD vector formats, their managing authorities, and licensing terms.
Format | Design authority | License |
---|---|---|
AutoCAD DXF | Autodesk | Proprietary[4] |
Cartesian coordinate system (XYZ) | – | – |
GML | Open Geospatial Consortium (OGC) / ISO | Open standard[5] |
MapInfo TAB | Precisely (formerly MapInfoCorp) | Proprietary[6] |
Shapefile | ESRI | Hybrid (proprietary spec, documented)[7][8] |
TIGER | US Census Bureau | Public domain[9][10] |
The Drawing Interchange Format (DXF) is a CAD data file format developed by Autodesk in December 1982 alongside AutoCAD 1.0 to facilitate interoperability between CAD applications. It supports ASCII (since initial release) and binary encodings (since AutoCAD R10 in 1988), uses a group-code/tagged structure, and is partially publicly documented — though newer entity types may lack full documentation.[4]
GML is an XML-based grammar defined by the OGC and standardized as ISO 19136:2007 (and updated ISO 19136-1:2020). It models geographic features, geometries, coordinate reference system, coverages, and sensor data. GML profiles (e.g., Simple Features) tailor subsets for specific applications. Widely used in WFS, GML underpins vendor-neutral data exchange.[5]
MapInfo TAB is a proprietary GIS vector format created by MapInfo (now Precisely). A dataset includes .TAB (structure), .DAT (attributes), .MAP (geometry), and optional .ID/.IND index files. It supports points, lines, polygons, text annotations, spatial indexing, and multiple coordinate systems; common in MapInfo Pro and supported via GDAL/OGR.[6]
Introduced by ESRI in 1998, the Shapefile format comprises .shp (geometry), .shx (spatial index), .dbf (attribute table), and optional .prj/.cpg files. It supports simple features: point, polyline, polygon, and multipoint. While popular and widely supported, it has limitations — DBF attribute format, 2 GB file size limit, no topology, limited Unicode/field name length.[7][8]
The Topologically Integrated Geographic Encoding and Referencing (TIGER) format was developed by the US Census Bureau to represent geographic features (roads, boundaries, water) for census purposes. Distributed as public domain TIGER/Line shapefiles, TIGER data includes geographic codes (GEOIDs) that can be linked to demographic data.[9][10]