libpython: add to/from numpy functions by ninsbl · Pull Request #423 · OSGeo/grass

ninsbl · 2020-03-16T23:00:01Z

This PR would add two New functions:

to parse e.g. stdout into a numpy array and
to write a numpy array to a table in the DB backend

I would very much appreciate thorough review of the two functions, esp. the one writing to DB.
Also hints on how to properly add examples for doctest (or other unit-tests) would be very welcome...

ninsbl · 2020-03-18T13:03:06Z

See also:
https://trac.osgeo.org/grass/ticket/3639

metzm · 2020-03-29T20:41:06Z

In this PR you are using the python interface to SQLite3 and PG. Have you tested the alternative using db.execute? The reason I am asking is that the GRASS db drivers have a lot of error handling that might be missing from the python interfaces to the respective DB drivers. Using the python interfaces could case cryptic errors if something goes wrong that might be better explained by the GRASS db drivers.

wenzeslaus · 2020-04-21T19:19:07Z

Also hints on how to properly add examples for doctest (or other unit-tests) would be very welcome...

Any general Python testing instructions should be applicable like this or this. You should probably focus just on SQLite, because PostgreSQL is more difficult to set up for the tests (although it is possible).

If working within NC SPM location won't work for you tests, you can try to write plain Python unittest test and use e.g. --exec to run GRASS (the testing framework is from the times before --exec, so it does not integrate with it well, but I already had to use this approach here).

ninsbl · 2020-05-11T08:38:33Z

In this PR you are using the python interface to SQLite3 and PG. Have you tested the alternative using db.execute?

I have not tested. The python interfaces offer some additional functionality and direct translation between Python objects and database data types. DB handling in pygrass (where these functions are supposed to end up) also uses Python DB adapters. But It is a good point to double check that potential errors are caught properly!

wenzeslaus · 2020-06-05T03:33:09Z

lib/python/pygrass/utils.py

+    """
+    sql_to_dtype = {
+        "sqlite": {
+            "INTEGER": [0, 2, 3, 4, 5, 6, 7, 8, 9, 10],


Does these really need to be numbers and not things like numpy.int32?

wenzeslaus · 2020-06-05T03:39:11Z

lib/python/pygrass/utils.py

+                insert_sql = "INSERT INTO {}({}) VALUES %s;".format(
+                    table,
+                    ", ".join(structured_array.dtype.names),
+                    ",".join(["?"] * len(structured_array.dtype.names)),


Use named placeholders such as {table} and table= when there is more than one item to avoid confusion. Here seems to be some with %s and the third argument.

neteler · 2023-11-07T12:50:05Z

@ninsbl: would you mind to rebase this PR?

echoix · 2025-08-10T02:35:47Z

Fixed the bad merge by undoing that commit, doing a reset, then merging upstream/main again. Now there isn't 5000+ files changed with 2000+ commits, but really the only single one changed with like 3 commits.

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

python/grass/pygrass/utils.py

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

wenzeslaus

@ninsbl Can you please review this older PR to see how/if your motivation for this changed over time?

The two functions are related based on the original motivation, but independent as they stand now, also have different level of complexity, so splitting the PR into two would help merging at least parts of this faster.

wenzeslaus · 2025-12-15T21:31:31Z

python/grass/pygrass/utils.py

+    if type(tablestring).__name__ == "str":
+        tablestring = grasscore.encode(tablestring, encoding=encoding)
+    elif type(tablestring).__name__ != "bytes":


Why not testing the types or instances (isinstance) instead of names as strings?

wenzeslaus · 2025-12-15T21:32:46Z

python/grass/pygrass/utils.py

+    if structured:
+        kwargs["dtype"] = None
+
+    return np.genfromtxt(BytesIO(tablestring), **kwargs)


What the new function is adding on top of np.genfromtxt, the encoding?

wenzeslaus · 2025-12-15T21:35:25Z

python/grass/pygrass/utils.py

+def txt2numpy(
+    tablestring,
+    sep=",",
+    names=None,
+    null_value=None,
+    fill_value=None,
+    comments="#",
+    usecols=None,
+    encoding=None,
+    structured=True,
+):


Put this one to a separate PR. Seems much easier to get it merged, perhaps it could be integrated into grass.tools if really useful. Or maybe improvements in format=json and format=csv when used together with np.genfromtxt are a better route?

wenzeslaus · 2025-12-15T21:42:17Z

python/grass/pygrass/utils.py

+def numpy2table(
+    np_array,
+    table,
+    connection,
+    formats=None,
+    names=False,
+    column_prefix="column",
+    update_formats=True,
+    overwrite=True,
+):


If the motivation is the same as the referenced https://trac.osgeo.org/grass/ticket/3639, wouldn't a tool which takes a CSV table puts it into a database table be more fitting? Like db.in.org but specialized for CSV (db.in.table or db.in.csv).

add to/from numpy functions

6a5d024

ninsbl added the enhancement New feature or request label Mar 16, 2020

ninsbl requested review from wenzeslaus and zarch March 16, 2020 23:00

allow to return structured or unstructured arras

8fe4396

wenzeslaus reviewed Jun 5, 2020

View reviewed changes

neteler added the Python Related code is in Python label Dec 9, 2021

neteler added this to the 8.0.1 milestone Dec 9, 2021

ninsbl modified the milestones: 8.0.1, 8.2.0 Feb 20, 2022

wenzeslaus modified the milestones: 8.2.0, 8.4.0 Mar 16, 2022

wenzeslaus modified the milestones: 8.3.0, 8.4.0 Feb 10, 2023

neteler changed the title ~~add to/from numpy functions~~ libpython: add to/from numpy functions Nov 7, 2023

wenzeslaus modified the milestones: 8.4.0, 8.5.0 Apr 26, 2024

echoix added the conflicts/needs rebase Rebase to or merge with the latest base branch is needed label Jul 3, 2024

github-actions bot added GUI wxGUI related docker Docker related CI Continuous integration raster Related to raster data processing temporal Related to temporal data processing C Related code is in C labels Aug 30, 2024

echoix added the conflicts/needs rebase Rebase to or merge with the latest base branch is needed label Feb 26, 2025

Merge branch 'main' into pr/ninsbl/423

ccf8714

echoix force-pushed the numpy_pygrass branch from f29451a to ccf8714 Compare August 10, 2025 02:32

github-actions bot added Python Related code is in Python libraries labels Aug 10, 2025

echoix removed the conflicts/needs rebase Rebase to or merge with the latest base branch is needed label Aug 10, 2025

Apply ruff suggestions from code review

7610389

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

github-actions bot reviewed Aug 10, 2025

View reviewed changes

ninsbl and others added 2 commits August 10, 2025 22:48

Apply suggestions from code review

0e5ff19

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

Merge branch 'main' into numpy_pygrass

98f9d21

wenzeslaus requested changes Dec 15, 2025

View reviewed changes

wenzeslaus modified the milestones: 8.5.0, 8.6.0 Dec 15, 2025

Uh oh!

Conversation

ninsbl commented Mar 16, 2020

Uh oh!

ninsbl commented Mar 18, 2020

Uh oh!

metzm commented Mar 29, 2020

Uh oh!

wenzeslaus commented Apr 21, 2020

Uh oh!

ninsbl commented May 11, 2020

Uh oh!

wenzeslaus Jun 5, 2020

Choose a reason for hiding this comment

Uh oh!

wenzeslaus Jun 5, 2020

Choose a reason for hiding this comment

Uh oh!

neteler commented Nov 7, 2023

Uh oh!

echoix commented Aug 10, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

wenzeslaus left a comment

Choose a reason for hiding this comment

Uh oh!

wenzeslaus Dec 15, 2025

Choose a reason for hiding this comment

Uh oh!

wenzeslaus Dec 15, 2025

Choose a reason for hiding this comment

Uh oh!

wenzeslaus Dec 15, 2025

Choose a reason for hiding this comment

Uh oh!

wenzeslaus Dec 15, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants