Skip to content

Commit a4bca58

Browse files
committed
docs: Add Bulk data product identifier table.
1 parent 69f7da2 commit a4bca58

1 file changed

Lines changed: 261 additions & 0 deletions

File tree

docs/source/api/clients/bulk_data.rst

Lines changed: 261 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -5,3 +5,264 @@ Bulk Data Client
55
:members:
66
:undoc-members:
77
:show-inheritance:
8+
9+
.. _bulk-data-product-identifiers:
10+
11+
Bulk Data Product Identifiers
12+
------------------------------
13+
14+
The table below lists all product identifiers available in the USPTO Open Data Portal Bulk Dataset Directory.
15+
Pass these identifiers to :meth:`~pyUSPTO.BulkDataClient.get_product_by_id` or use them as a filter
16+
when calling :meth:`~pyUSPTO.BulkDataClient.search_products`.
17+
18+
.. note::
19+
20+
This table reflects the Bulk Dataset Directory as of 2026-Mar-30 (47 products).
21+
Source: `2026 Bulk Data Product Descriptions <https://data.uspto.gov/documents/documents/2026BulkDataProductDescriptions.xlsx>`_.
22+
USPTO adds new products over time; use :meth:`~pyUSPTO.BulkDataClient.search_products`
23+
without filters to retrieve the current full list.
24+
25+
.. list-table::
26+
:header-rows: 1
27+
:widths: 10 25 15 8 42
28+
29+
* - Identifier
30+
- Name
31+
- Dates Available
32+
- File Types
33+
- Description
34+
* - OACT
35+
- Office Actions Weekly Archives
36+
- 2023-Dec-18 – Present
37+
- JSON
38+
- Full-text of public Office Actions bundled as JSON in downloadable weekly ZIP files. Data covers 2020-01-06 to present.
39+
* - PTFWPRD
40+
- Patent File Wrapper (Bulk Datasets) – Daily
41+
- 2026-Mar-23 – Present
42+
- JSON
43+
- Bibliographic and assignment static patent data as daily delta increments.
44+
* - PTFWPRE
45+
- Patent File Wrapper (Bulk Datasets) – Weekly
46+
- 2001-Jan-01 – Present
47+
- JSON
48+
- Bibliographic and assignment static patent data as weekly datasets in 10-year increments.
49+
* - TRTDXFAP
50+
- Trademark Full Text XML Data (No Images) – Daily Applications
51+
- 2025-Jan-01 – Present
52+
- XML
53+
- Pending and registered trademark text data (no images) for the current calendar year per the U.S. Trademark Applications Version 2.3 DTD.
54+
* - TTABTDXF
55+
- Trademark Full Text XML Data (No Images) – Daily TTAB
56+
- 2025-Jan-01 – Present
57+
- XML
58+
- TTAB text data (no images) for the current calendar year per the TTAB Version 1.0 DTD.
59+
* - PTGRMP2
60+
- Patent Grant Multi-page PDF Images
61+
- 1790-Jul-31 – Present
62+
- PDF
63+
- Multi-page PDF images of each patent grant issued weekly (Tuesdays) from 1790 to present. Includes Certificates-of-Correction and rescanned older grants.
64+
* - APPXML
65+
- Patent Application Full-Text Data (No Images)
66+
- 2001-Mar-15 – Present
67+
- XML
68+
- Concatenated full-text XML of non-provisional utility and plant patent applications published weekly (Thursdays).
69+
* - APPMP2
70+
- Patent Application Multi-Page PDF Images
71+
- 2001-Mar-15 – Present
72+
- PDF
73+
- Multi-page PDF images of non-provisional utility and plant patent applications published weekly (Thursdays).
74+
* - APPBLXML
75+
- Patent Application Bibliographic (Front Page) Data
76+
- 2001-Mar-15 – Present
77+
- XML
78+
- Concatenated bibliographic (front page) text of patent applications published weekly (Thursdays); excludes images. Subset of APPXML.
79+
* - APPDT
80+
- Patent Application Full Text Data with Embedded TIFF Images
81+
- 2001-Mar-15 – Present
82+
- XML
83+
- Full text, images/drawings, and complex work units (tables, math, chemical structures, genetic sequences) of patent applications published weekly (Thursdays).
84+
* - PTMNFEE2
85+
- Patent Maintenance Fee Events
86+
- 2026-Jan-06 – Present
87+
- ASCII
88+
- Cumulative weekly file of recorded maintenance fee events for patents granted from 1981-Sep-01 to present.
89+
* - PTGRDT
90+
- Patent Grant Full Text Data with Embedded TIFF Images (Grant Red Book / WIPO ST.36)
91+
- 2002-Jan-01 – Present
92+
- XML
93+
- Full text, images/drawings, and complex work units of patent grants issued weekly (Tuesdays).
94+
* - GZLST
95+
- Patent Official Gazettes
96+
- 2002-Jul-02 – Present
97+
- HTML
98+
- Weekly bibliographic information, representative claim, and drawing for each patent grant, plus USPTO Notices.
99+
* - PTGRXML
100+
- Patent Grant Full-Text Data (No Images)
101+
- 2002-Jan-01 – Present
102+
- ASCII, XML
103+
- Concatenated full-text of patent grant documents issued weekly (Tuesdays); excludes images.
104+
* - PTBLXML
105+
- Patent Grant Bibliographic (Front Page) Text Data
106+
- 2002-Jan-01 – Present
107+
- ASCII, XML
108+
- Concatenated bibliographic (front page) text of patent grant documents issued weekly (Tuesdays); excludes images. Subset of PTGRXML.
109+
* - CPCMCPT
110+
- CPC Master Classification Files for U.S. Patent Grants
111+
- 2025-Jun-17 – Present
112+
- TXT, XML
113+
- CPC classification data for all U.S. patent grants from 1790-Jul-31 to present, updated monthly.
114+
* - CPCMCAPP
115+
- CPC Master Classification Files for U.S. Patent Applications
116+
- 2025-Jun-17 – Present
117+
- TXT, XML
118+
- CPC classification data for all U.S. patent applications published from 2001-Mar-15 to present, updated monthly.
119+
* - PVPGPUBTXT
120+
- PatentsView Pre-Grant Publication Long Text Data
121+
- 2001-Mar-15 – Present
122+
- TSV
123+
- Annual files of long-text fields (Brief Summary, Claims, Detail Description, Drawing Description) for pre-grant publications from 2001 to present.
124+
* - PVGPATTXT
125+
- PatentsView Granted Patent Long Text Data
126+
- 1976-Jan-01 – Present
127+
- TSV
128+
- Annual files of long-text fields (Brief Summary, Claims, Detail Description, Drawing Description) for granted patents from 1976 to present.
129+
* - PVPGPUBDIS
130+
- PatentsView Pre-Grant Publication Disambiguated Data
131+
- 2001-Mar-15 – Present
132+
- TSV
133+
- 25 files for pre-grant publications from 2001 to present, including disambiguated applicants, assignees, inventors, locations, technology categories, and government interest statements.
134+
* - PVGPATDIS
135+
- PatentsView Granted Patent Disambiguated Data
136+
- 1976-Jan-01 – Present
137+
- TSV
138+
- 35 files for granted patents from 1976 to present, including disambiguated assignees, inventors, locations, cited prior art, examiner name, and government interest statements.
139+
* - PVSORTED
140+
- PatentsView Sorted Data (Beta)
141+
- 1976-Jan-01 – Present
142+
- TSV
143+
- Reorganized bibliographic data correcting inventor/applicant/assignee ordering inconsistencies introduced by the Leahy-Smith America Invents Act.
144+
* - PVANNUAL
145+
- PatentsView Annualized Patent Data
146+
- 1976-Jan-01 – Present
147+
- CSV
148+
- Small annual CSV files derived from PatentsView Granted Patent Disambiguated Data, including inventor gender attribution.
149+
* - TRTYRAP
150+
- Trademark Full Text XML Data (No Images) – Annual Applications
151+
- 1884-Apr-07 – Present
152+
- XML
153+
- Backfile of pending and registered trademark text data (no images) from 1884-Apr through 2025-Dec per the U.S. Trademark Applications Version 2.3 DTD.
154+
* - TRTDXFAG
155+
- Trademark Full Text XML Data (No Images) – Daily Assignments
156+
- 2025-Jan-01 – Present
157+
- XML
158+
- Trademark assignment text data (no images) for the current calendar year per the Trademark Assignments Version 0.4 DTD.
159+
* - PASDL
160+
- Patent Assignment XML (Ownership) Text – Daily
161+
- 2025-Jan-01 – Present
162+
- XML
163+
- Daily patent assignment text (no images) for the current calendar year derived from USPTO assignment recordations.
164+
* - PASYR
165+
- Patent Assignment XML (Ownership) Text – Annual
166+
- 1980-Jan-01 – Present
167+
- XML
168+
- Annual backfile of patent assignment text (no images) from 1980-Aug through 2025-Dec.
169+
* - ECOPATAI
170+
- Artificial Intelligence Patent Dataset (AIPD)
171+
- 2021-Jul-30 – 2026-Feb-03
172+
- DTA, TSV
173+
- AI patent landscape data classifying 13.2M granted patents and PGPubs from 1976–2020 across eight AI component technologies using machine learning models.
174+
* - TRTYRAG
175+
- Trademark Full Text XML Data (No Images) – Annual Assignments
176+
- 1951-Oct-02 – Present
177+
- XML
178+
- Backfile of trademark assignment text data from 1955-Jan-03 through 2025-Dec per the Trademark Assignments Version 0.4 DTD.
179+
* - TTABYR
180+
- Trademark Full Text XML Data (No Images) – Annual TTAB
181+
- 1951-Oct-02 – Present
182+
- XML
183+
- Backfile of TTAB text data from 1951-Oct-02 through 2025-Dec per the TTAB Version 1.0 DTD.
184+
* - PEDSJSON
185+
- Patent Examination Data System (Bulk Datasets) – JSON
186+
- 1900-Jan-01 – 2000-Dec-31
187+
- JSON
188+
- Static snapshot (created 2025-Mar-17) of patent application data from 1900–2000, migrated from the retired PEDS system, in 20-year increment downloads.
189+
* - PEDSXML
190+
- Patent Examination Data System (Bulk Datasets) – XML
191+
- 1900-Jan-01 – 2000-Dec-31
192+
- XML
193+
- Static snapshot (created 2025-Mar-16) of patent application data from 1900–2000, migrated from the retired PEDS system, in 20-year increment downloads.
194+
* - ECORSEXC
195+
- Patent Assignment Data for Academia and Researchers
196+
- 2015-Aug-05 – 2024-Apr-19
197+
- DTA, TSV
198+
- ~10M patent assignments and transactions recorded at USPTO since 1970, covering ~17.8M patents and applications.
199+
* - TRASECO
200+
- Trademark Assignment Data for Academia and Researchers
201+
- 2014-Apr-18 – 2024-Apr-01
202+
- CSV, DTA
203+
- 1.29M trademark assignments and transactions recorded at USPTO between 1952 and 2023, covering 2.28M unique trademark properties.
204+
* - TRCFECO2
205+
- Trademark Case File Data for Academia and Researchers
206+
- 2013-Jan-02 – 2024-Mar-27
207+
- CSV, DTA
208+
- 12.1M trademark applications filed with or registrations issued by USPTO between 1870 and January 2023.
209+
* - PTLITIG
210+
- Patent Litigation Docket Report Data Files for Academia and Researchers
211+
- 2016-Dec-29 – 2024-Mar-27
212+
- CSV, DTA
213+
- U.S. District Court patent litigation data on 81,350 unique cases filed 1963–2020, sourced from PACER and RECAP, including parties, cause of action, court location, key dates, and 5M+ docket documents.
214+
* - ECOPAIR
215+
- Patent Examination Research Dataset (PatEx)
216+
- 2015-Dec-02 – 2023-Sep-26
217+
- CSV, DTA
218+
- 13M+ publicly viewable patent applications and 1M+ PCT applications through June 2023, including prosecution history, continuation history, foreign priority claims, and PTA history.
219+
* - PTAPOATH
220+
- Patent and Patent Application Oath Signature Dataset
221+
- 2022-Sep-30 – 2022-Sep-30
222+
- JPEG, JSON
223+
- 883,811 signature images extracted from patent inventor oath documents from 1998-Sep to 2022-Sep, broken into 8 ZIP files by series code (12–17, 29, 35). 40.5 GB total.
224+
* - PTOFFACT
225+
- Patent Application Office Actions Research Dataset
226+
- 2017-Nov-29 – 2017-Nov-29
227+
- CSV, DTA
228+
- 4.4M Office actions mailed 2008–June 2017 for 2.2M publicly viewable applications, including grounds for rejection, claims, and pertinent prior art.
229+
* - PTGRAPS
230+
- Patent Grant Full-Text Data (No Images) – APS
231+
- 1976-Jan-06 – Present
232+
- ASCII, XML
233+
- Concatenated full-text of patent grants issued weekly (Tuesdays) from 1976-Jan-01 to 2001-Dec-25; excludes images.
234+
* - PTBLAPS
235+
- Patent Grant Bibliographic (Front Page) Text Data – APS
236+
- 1976-Jan-01 – Present
237+
- ASCII, XML
238+
- Concatenated bibliographic (front page) text of patent grants issued weekly (Tuesdays) from 1976-Jan-01 to 2000-Dec-26; excludes images. Subset of PTGRAPS.
239+
* - PTAPPCLM
240+
- Patent and Patent Application Claims Research Dataset
241+
- 2016-Oct-07 – 2016-Oct-11
242+
- CSV, DTA
243+
- Claims data for U.S. patents granted 1976–2014 and applications published 2001–2014, including individual claim text, dependency relationships, claim-level and document-level statistics.
244+
* - MOONSHOT
245+
- Cancer Moonshot Patent Data Files
246+
- 2016-Aug-19 – 2016-Aug-19
247+
- CSV
248+
- 269,353 patent documents from 1976–2016 curated to identify R&D in diagnostics, therapeutics, data analytics, and model biological systems.
249+
* - HISTEXC
250+
- Historical Patent Data Files for Academia and Researchers
251+
- 2015-Jun-25 – 2015-Jul-02
252+
- CSV, DTA
253+
- Four NBER research datasets with time-series and micro-level data by technology sub-category spanning two centuries of patent applications, grants, and in-force patents.
254+
* - PTBLSGM
255+
- Patent Grant Bibliographic (Front Page) Text Data – SGML
256+
- 2001-Jan-02 – Present
257+
- ASCII, XML
258+
- Concatenated bibliographic (front page) text of patent grants issued weekly (Tuesdays) from 2001-Jan-02 to 2001-Dec-25; excludes images. Subset of PTGRDSGM.
259+
* - PTGRDSGM
260+
- Patent Grant Full Text Data with Embedded TIFF Images (Grant Red Book / WIPO ST.36) – SGML
261+
- 2001-Jan-02 – Present
262+
- XML
263+
- Full text, images/drawings, and complex work units of patent grants issued weekly (Tuesdays) from 2001-Jan-02 to 2001-Dec-25.
264+
* - PTGRSGM
265+
- Patent Grant Full-Text Data (No Images) – SGML
266+
- 2001-Jan-02 – Present
267+
- ASCII, XML
268+
- Concatenated full-text of patent grants issued weekly (Tuesdays) from 2001-Jan-02 to 2001-Dec-25; excludes images.

0 commit comments

Comments
 (0)