[SaltXMLExporter]: SCorpusGraphImpl's naming of documents seems broken

This issue came to light when using a Pepper importer which relied on [`SCorpusGraphImpl`](https://github.com/korpling/salt/blob/develop/salt-api/src/main/java/org/corpus_tools/salt/common/impl/SCorpusGraphImpl.java)'s [default implementation](
https://github.com/korpling/salt/blob/develop/salt-api/src/main/java/org/corpus_tools/salt/common/impl/SCorpusGraphImpl.java#L310-L315) of naming documents:

```java{4}
String namePart = null;
namePart = document.getName();
if (Strings.isNullOrEmpty(namePart)) {
    namePart = "doc_" + getCorpora().size();
}
GraphFactory.createIdentifier(document, URI.createURI(corpus.getId() + "/" + namePart).toString());
```

Relying on this implementation produced `ConcurrentModificationException`s during runtime, which seems to have been fixed when explicitly setting document names, in this case, falling back on `PepperImporterImpl`'s [default implementation](https://github.com/korpling/pepper/blob/develop/pepper-framework/src/main/java/org/corpus_tools/pepper/impl/PepperImporterImpl.java#L278-L285) of naming documents in  `#importCorpusStructureRec(URI currURI, SCorpus parent)`:

```java
SDocument sDocument = null;
if (docFile.isDirectory()) {
    sDocument = getCorpusGraph().createDocument(parent, currURI.lastSegment());
} else {
    // if uri is a file, cut off file ending
    sDocument = getCorpusGraph().createDocument(parent,
            currURI.lastSegment().replace("." + currURI.fileExtension(), ""));
}
```

Especially the line `namePart = "doc_" + getCorpora().size();` doesn't quite sit right, as at any time a document named `doc_n` could be created while `getCorpora().size() < n`, and once `getCorpora().size() == n` we'd have two documents of the same name, which would be translated into the document ID, which in turn is used by Pepper to calculate execution paths. If the two documents sit in the same corpus, this is likely to trigger a `ConcurrentModificationException` on lists in either of the documents' lists of nodes, etc., as has been encountered above.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SaltXMLExporter]: SCorpusGraphImpl's naming of documents seems broken #128

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[SaltXMLExporter]: SCorpusGraphImpl's naming of documents seems broken #128

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions