You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/source/api/clients/bulk_data.rst
+261Lines changed: 261 additions & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -5,3 +5,264 @@ Bulk Data Client
5
5
:members:
6
6
:undoc-members:
7
7
:show-inheritance:
8
+
9
+
.. _bulk-data-product-identifiers:
10
+
11
+
Bulk Data Product Identifiers
12
+
------------------------------
13
+
14
+
The table below lists all product identifiers available in the USPTO Open Data Portal Bulk Dataset Directory.
15
+
Pass these identifiers to :meth:`~pyUSPTO.BulkDataClient.get_product_by_id` or use them as a filter
16
+
when calling :meth:`~pyUSPTO.BulkDataClient.search_products`.
17
+
18
+
.. note::
19
+
20
+
This table reflects the Bulk Dataset Directory as of 2026-Mar-30 (47 products).
21
+
Source: `2026 Bulk Data Product Descriptions <https://data.uspto.gov/documents/documents/2026BulkDataProductDescriptions.xlsx>`_.
22
+
USPTO adds new products over time; use :meth:`~pyUSPTO.BulkDataClient.search_products`
23
+
without filters to retrieve the current full list.
24
+
25
+
.. list-table::
26
+
:header-rows: 1
27
+
:widths: 10 25 15 8 42
28
+
29
+
* - Identifier
30
+
- Name
31
+
- Dates Available
32
+
- File Types
33
+
- Description
34
+
* - OACT
35
+
- Office Actions Weekly Archives
36
+
- 2023-Dec-18 – Present
37
+
- JSON
38
+
- Full-text of public Office Actions bundled as JSON in downloadable weekly ZIP files. Data covers 2020-01-06 to present.
39
+
* - PTFWPRD
40
+
- Patent File Wrapper (Bulk Datasets) – Daily
41
+
- 2026-Mar-23 – Present
42
+
- JSON
43
+
- Bibliographic and assignment static patent data as daily delta increments.
44
+
* - PTFWPRE
45
+
- Patent File Wrapper (Bulk Datasets) – Weekly
46
+
- 2001-Jan-01 – Present
47
+
- JSON
48
+
- Bibliographic and assignment static patent data as weekly datasets in 10-year increments.
49
+
* - TRTDXFAP
50
+
- Trademark Full Text XML Data (No Images) – Daily Applications
51
+
- 2025-Jan-01 – Present
52
+
- XML
53
+
- Pending and registered trademark text data (no images) for the current calendar year per the U.S. Trademark Applications Version 2.3 DTD.
54
+
* - TTABTDXF
55
+
- Trademark Full Text XML Data (No Images) – Daily TTAB
56
+
- 2025-Jan-01 – Present
57
+
- XML
58
+
- TTAB text data (no images) for the current calendar year per the TTAB Version 1.0 DTD.
59
+
* - PTGRMP2
60
+
- Patent Grant Multi-page PDF Images
61
+
- 1790-Jul-31 – Present
62
+
- PDF
63
+
- Multi-page PDF images of each patent grant issued weekly (Tuesdays) from 1790 to present. Includes Certificates-of-Correction and rescanned older grants.
64
+
* - APPXML
65
+
- Patent Application Full-Text Data (No Images)
66
+
- 2001-Mar-15 – Present
67
+
- XML
68
+
- Concatenated full-text XML of non-provisional utility and plant patent applications published weekly (Thursdays).
69
+
* - APPMP2
70
+
- Patent Application Multi-Page PDF Images
71
+
- 2001-Mar-15 – Present
72
+
- PDF
73
+
- Multi-page PDF images of non-provisional utility and plant patent applications published weekly (Thursdays).
74
+
* - APPBLXML
75
+
- Patent Application Bibliographic (Front Page) Data
76
+
- 2001-Mar-15 – Present
77
+
- XML
78
+
- Concatenated bibliographic (front page) text of patent applications published weekly (Thursdays); excludes images. Subset of APPXML.
79
+
* - APPDT
80
+
- Patent Application Full Text Data with Embedded TIFF Images
81
+
- 2001-Mar-15 – Present
82
+
- XML
83
+
- Full text, images/drawings, and complex work units (tables, math, chemical structures, genetic sequences) of patent applications published weekly (Thursdays).
84
+
* - PTMNFEE2
85
+
- Patent Maintenance Fee Events
86
+
- 2026-Jan-06 – Present
87
+
- ASCII
88
+
- Cumulative weekly file of recorded maintenance fee events for patents granted from 1981-Sep-01 to present.
89
+
* - PTGRDT
90
+
- Patent Grant Full Text Data with Embedded TIFF Images (Grant Red Book / WIPO ST.36)
91
+
- 2002-Jan-01 – Present
92
+
- XML
93
+
- Full text, images/drawings, and complex work units of patent grants issued weekly (Tuesdays).
94
+
* - GZLST
95
+
- Patent Official Gazettes
96
+
- 2002-Jul-02 – Present
97
+
- HTML
98
+
- Weekly bibliographic information, representative claim, and drawing for each patent grant, plus USPTO Notices.
99
+
* - PTGRXML
100
+
- Patent Grant Full-Text Data (No Images)
101
+
- 2002-Jan-01 – Present
102
+
- ASCII, XML
103
+
- Concatenated full-text of patent grant documents issued weekly (Tuesdays); excludes images.
104
+
* - PTBLXML
105
+
- Patent Grant Bibliographic (Front Page) Text Data
106
+
- 2002-Jan-01 – Present
107
+
- ASCII, XML
108
+
- Concatenated bibliographic (front page) text of patent grant documents issued weekly (Tuesdays); excludes images. Subset of PTGRXML.
109
+
* - CPCMCPT
110
+
- CPC Master Classification Files for U.S. Patent Grants
111
+
- 2025-Jun-17 – Present
112
+
- TXT, XML
113
+
- CPC classification data for all U.S. patent grants from 1790-Jul-31 to present, updated monthly.
114
+
* - CPCMCAPP
115
+
- CPC Master Classification Files for U.S. Patent Applications
116
+
- 2025-Jun-17 – Present
117
+
- TXT, XML
118
+
- CPC classification data for all U.S. patent applications published from 2001-Mar-15 to present, updated monthly.
119
+
* - PVPGPUBTXT
120
+
- PatentsView Pre-Grant Publication Long Text Data
121
+
- 2001-Mar-15 – Present
122
+
- TSV
123
+
- Annual files of long-text fields (Brief Summary, Claims, Detail Description, Drawing Description) for pre-grant publications from 2001 to present.
124
+
* - PVGPATTXT
125
+
- PatentsView Granted Patent Long Text Data
126
+
- 1976-Jan-01 – Present
127
+
- TSV
128
+
- Annual files of long-text fields (Brief Summary, Claims, Detail Description, Drawing Description) for granted patents from 1976 to present.
129
+
* - PVPGPUBDIS
130
+
- PatentsView Pre-Grant Publication Disambiguated Data
131
+
- 2001-Mar-15 – Present
132
+
- TSV
133
+
- 25 files for pre-grant publications from 2001 to present, including disambiguated applicants, assignees, inventors, locations, technology categories, and government interest statements.
134
+
* - PVGPATDIS
135
+
- PatentsView Granted Patent Disambiguated Data
136
+
- 1976-Jan-01 – Present
137
+
- TSV
138
+
- 35 files for granted patents from 1976 to present, including disambiguated assignees, inventors, locations, cited prior art, examiner name, and government interest statements.
139
+
* - PVSORTED
140
+
- PatentsView Sorted Data (Beta)
141
+
- 1976-Jan-01 – Present
142
+
- TSV
143
+
- Reorganized bibliographic data correcting inventor/applicant/assignee ordering inconsistencies introduced by the Leahy-Smith America Invents Act.
144
+
* - PVANNUAL
145
+
- PatentsView Annualized Patent Data
146
+
- 1976-Jan-01 – Present
147
+
- CSV
148
+
- Small annual CSV files derived from PatentsView Granted Patent Disambiguated Data, including inventor gender attribution.
149
+
* - TRTYRAP
150
+
- Trademark Full Text XML Data (No Images) – Annual Applications
151
+
- 1884-Apr-07 – Present
152
+
- XML
153
+
- Backfile of pending and registered trademark text data (no images) from 1884-Apr through 2025-Dec per the U.S. Trademark Applications Version 2.3 DTD.
154
+
* - TRTDXFAG
155
+
- Trademark Full Text XML Data (No Images) – Daily Assignments
156
+
- 2025-Jan-01 – Present
157
+
- XML
158
+
- Trademark assignment text data (no images) for the current calendar year per the Trademark Assignments Version 0.4 DTD.
159
+
* - PASDL
160
+
- Patent Assignment XML (Ownership) Text – Daily
161
+
- 2025-Jan-01 – Present
162
+
- XML
163
+
- Daily patent assignment text (no images) for the current calendar year derived from USPTO assignment recordations.
164
+
* - PASYR
165
+
- Patent Assignment XML (Ownership) Text – Annual
166
+
- 1980-Jan-01 – Present
167
+
- XML
168
+
- Annual backfile of patent assignment text (no images) from 1980-Aug through 2025-Dec.
169
+
* - ECOPATAI
170
+
- Artificial Intelligence Patent Dataset (AIPD)
171
+
- 2021-Jul-30 – 2026-Feb-03
172
+
- DTA, TSV
173
+
- AI patent landscape data classifying 13.2M granted patents and PGPubs from 1976–2020 across eight AI component technologies using machine learning models.
174
+
* - TRTYRAG
175
+
- Trademark Full Text XML Data (No Images) – Annual Assignments
176
+
- 1951-Oct-02 – Present
177
+
- XML
178
+
- Backfile of trademark assignment text data from 1955-Jan-03 through 2025-Dec per the Trademark Assignments Version 0.4 DTD.
179
+
* - TTABYR
180
+
- Trademark Full Text XML Data (No Images) – Annual TTAB
181
+
- 1951-Oct-02 – Present
182
+
- XML
183
+
- Backfile of TTAB text data from 1951-Oct-02 through 2025-Dec per the TTAB Version 1.0 DTD.
184
+
* - PEDSJSON
185
+
- Patent Examination Data System (Bulk Datasets) – JSON
186
+
- 1900-Jan-01 – 2000-Dec-31
187
+
- JSON
188
+
- Static snapshot (created 2025-Mar-17) of patent application data from 1900–2000, migrated from the retired PEDS system, in 20-year increment downloads.
189
+
* - PEDSXML
190
+
- Patent Examination Data System (Bulk Datasets) – XML
191
+
- 1900-Jan-01 – 2000-Dec-31
192
+
- XML
193
+
- Static snapshot (created 2025-Mar-16) of patent application data from 1900–2000, migrated from the retired PEDS system, in 20-year increment downloads.
194
+
* - ECORSEXC
195
+
- Patent Assignment Data for Academia and Researchers
196
+
- 2015-Aug-05 – 2024-Apr-19
197
+
- DTA, TSV
198
+
- ~10M patent assignments and transactions recorded at USPTO since 1970, covering ~17.8M patents and applications.
199
+
* - TRASECO
200
+
- Trademark Assignment Data for Academia and Researchers
201
+
- 2014-Apr-18 – 2024-Apr-01
202
+
- CSV, DTA
203
+
- 1.29M trademark assignments and transactions recorded at USPTO between 1952 and 2023, covering 2.28M unique trademark properties.
204
+
* - TRCFECO2
205
+
- Trademark Case File Data for Academia and Researchers
206
+
- 2013-Jan-02 – 2024-Mar-27
207
+
- CSV, DTA
208
+
- 12.1M trademark applications filed with or registrations issued by USPTO between 1870 and January 2023.
209
+
* - PTLITIG
210
+
- Patent Litigation Docket Report Data Files for Academia and Researchers
211
+
- 2016-Dec-29 – 2024-Mar-27
212
+
- CSV, DTA
213
+
- U.S. District Court patent litigation data on 81,350 unique cases filed 1963–2020, sourced from PACER and RECAP, including parties, cause of action, court location, key dates, and 5M+ docket documents.
214
+
* - ECOPAIR
215
+
- Patent Examination Research Dataset (PatEx)
216
+
- 2015-Dec-02 – 2023-Sep-26
217
+
- CSV, DTA
218
+
- 13M+ publicly viewable patent applications and 1M+ PCT applications through June 2023, including prosecution history, continuation history, foreign priority claims, and PTA history.
219
+
* - PTAPOATH
220
+
- Patent and Patent Application Oath Signature Dataset
221
+
- 2022-Sep-30 – 2022-Sep-30
222
+
- JPEG, JSON
223
+
- 883,811 signature images extracted from patent inventor oath documents from 1998-Sep to 2022-Sep, broken into 8 ZIP files by series code (12–17, 29, 35). 40.5 GB total.
224
+
* - PTOFFACT
225
+
- Patent Application Office Actions Research Dataset
226
+
- 2017-Nov-29 – 2017-Nov-29
227
+
- CSV, DTA
228
+
- 4.4M Office actions mailed 2008–June 2017 for 2.2M publicly viewable applications, including grounds for rejection, claims, and pertinent prior art.
229
+
* - PTGRAPS
230
+
- Patent Grant Full-Text Data (No Images) – APS
231
+
- 1976-Jan-06 – Present
232
+
- ASCII, XML
233
+
- Concatenated full-text of patent grants issued weekly (Tuesdays) from 1976-Jan-01 to 2001-Dec-25; excludes images.
234
+
* - PTBLAPS
235
+
- Patent Grant Bibliographic (Front Page) Text Data – APS
236
+
- 1976-Jan-01 – Present
237
+
- ASCII, XML
238
+
- Concatenated bibliographic (front page) text of patent grants issued weekly (Tuesdays) from 1976-Jan-01 to 2000-Dec-26; excludes images. Subset of PTGRAPS.
239
+
* - PTAPPCLM
240
+
- Patent and Patent Application Claims Research Dataset
241
+
- 2016-Oct-07 – 2016-Oct-11
242
+
- CSV, DTA
243
+
- Claims data for U.S. patents granted 1976–2014 and applications published 2001–2014, including individual claim text, dependency relationships, claim-level and document-level statistics.
244
+
* - MOONSHOT
245
+
- Cancer Moonshot Patent Data Files
246
+
- 2016-Aug-19 – 2016-Aug-19
247
+
- CSV
248
+
- 269,353 patent documents from 1976–2016 curated to identify R&D in diagnostics, therapeutics, data analytics, and model biological systems.
249
+
* - HISTEXC
250
+
- Historical Patent Data Files for Academia and Researchers
251
+
- 2015-Jun-25 – 2015-Jul-02
252
+
- CSV, DTA
253
+
- Four NBER research datasets with time-series and micro-level data by technology sub-category spanning two centuries of patent applications, grants, and in-force patents.
254
+
* - PTBLSGM
255
+
- Patent Grant Bibliographic (Front Page) Text Data – SGML
256
+
- 2001-Jan-02 – Present
257
+
- ASCII, XML
258
+
- Concatenated bibliographic (front page) text of patent grants issued weekly (Tuesdays) from 2001-Jan-02 to 2001-Dec-25; excludes images. Subset of PTGRDSGM.
259
+
* - PTGRDSGM
260
+
- Patent Grant Full Text Data with Embedded TIFF Images (Grant Red Book / WIPO ST.36) – SGML
261
+
- 2001-Jan-02 – Present
262
+
- XML
263
+
- Full text, images/drawings, and complex work units of patent grants issued weekly (Tuesdays) from 2001-Jan-02 to 2001-Dec-25.
264
+
* - PTGRSGM
265
+
- Patent Grant Full-Text Data (No Images) – SGML
266
+
- 2001-Jan-02 – Present
267
+
- ASCII, XML
268
+
- Concatenated full-text of patent grants issued weekly (Tuesdays) from 2001-Jan-02 to 2001-Dec-25; excludes images.
0 commit comments