Improve and rename persistentID to trackID #784

lfcnassif · 2021-10-16T13:52:32Z

This is an ID that doesn't change between different runs used when resuming processing to skip already processed items. It is built hashing different concatenated IDs: path, idInDataSource (eg. sleuthID, ad1ID, ufdrID), subitemId, parentContainerPersistentID.

Current name isn't intuitive. Although it is not unique across different cases (it is not an UUID), changing it to globalID seems more user friendly. Any other suggestion?

hauck-jvsh · 2021-11-09T19:36:52Z

Just let me if you change this name, as it is used in the SARD project.

lfcnassif · 2021-12-02T13:54:28Z

Just saw we are not using the datasource UUID in the computation. If we include it, this ID would really work like an UUID and will be unique across cases, that would be great. Renaming to globalID then will totally make sense.

lfcnassif · 2021-12-02T14:23:01Z

parentContainerPersistentID

Just an explanation why this is used. We can have 2 different files with this same path (one allocated and other deleted):
/root/a/b.zip/c/d.txt
/root/a/b.zip/c/d.txt

The two d.txt files have no idInDatasource (they are subitems from zip, they aren't allocated) and their subitemId could be equal if they were extracted from 2 different b.zip files (one allocated and other deleted). So b.zip globalID (that uses b.zip IdInDatasource) must be used in the computation of d.txt globalID

lfcnassif · 2022-01-20T16:46:49Z

Just saw we are not using the datasource UUID in the computation. If we include it, this ID would really work like an UUID and will be unique across cases, that would be great.

Thinking better about this, do we need an "UUID" for items in iped? Would it be useful in multicases? This change would make #918 very difficult, I doubt users will know/remember they need to specify the evidence UUID when re-processing cases to import old bookmarks later. Maybe this change to include the evidence UUID in globalID computation can be made just into ElasticSearchTask, what do you think @hauck-jvsh?

hauck-jvsh · 2022-01-20T16:52:09Z

I think it could be used only in the elastic ID, I think you could also maintain the persistentID in elastic just not as _id field which must be unique.

lfcnassif · 2022-01-20T18:56:52Z

I think you could also maintain the persistentID in elastic just not as _id field which must be unique.

This was done to make the --continue option work when resuming a processing to ElasticSearch instead of having to delete a remote index and start the processing from beginning again. I think including the evidence UUID into persistentID/globalID computation should be enough to avoid _id conflicts between elastic cases.

lfcnassif · 2022-01-20T19:10:14Z

@hauck-jvsh what field are you using to store bookmarks into in elastic, _id?

edit: I mean, to correlate bookmarks to items?

hauck-jvsh · 2022-01-20T19:25:44Z

Currently I'm using the _id just to find the item and then set a new metadata with the bookmark in the item.

lfcnassif · 2022-01-20T21:25:05Z

@hauck-jvsh, I changed the attribute names persistentId->globalId, parentPersistentId->parentGlobalId, parentContainerPersistentId->containerGlobalId, and also ElasticSearchTask contentPersistentId -> contentGlobalId to follow the new naming convention.

hauck-jvsh · 2022-01-21T13:35:20Z

After that commit an error is occurring when processing cases, see the log file attached.
IPED-2022-01-21-10-25-00.log

lfcnassif · 2022-01-21T14:12:54Z

Thanks @hauck-jvsh, I'll take a look.

Actually I'm still not convinced about the new globalID attribute name, since it could repeat across cases without including the evidenceUUID in the computation. As you said, we can create a real UUID for items for possible future use in a new attribute (maybe using the globalID name), I like this idea.

But about persistentID renaming, I thought about more options: fixedID, constantID, constID. What do you think? @tc-wleite do have any suggestion?

wladimirleite · 2022-01-21T16:32:16Z

After that commit an error is occurring when processing cases, see the log file attached. IPED-2022-01-21-10-25-00.log

Processing an E01 image worked, but when I tried to process a folder, got a similar exception here.

wladimirleite · 2022-01-21T16:34:00Z

But about persistentID renaming, I thought about more options: fixedID, constantID, constID. What do you think? @tc-wleite do have any suggestion?

I was following the discussions around this issue, but I am not sure what would be the best option.

hauck-jvsh · 2022-01-25T18:32:52Z

There is also the multivalued parentIds property (different from parentId) with all item parents, used to allow fast filtering on file tree

I also use it to allow filtering using the file tree in the web interface.

lfcnassif · 2022-01-25T18:43:28Z

Have you tested an implementation with just parentId, right? Did it have a noticeable performance impact?

hauck-jvsh · 2022-01-25T18:51:22Z

Have you tested an implementation with just parentId, right? Did it have a noticeable performance impact?

I couldn't make the searches, because I have to filter items that has in their parentdIds the ids of the selected items.

lfcnassif · 2022-01-25T19:00:32Z

I couldn't make the searches, because I have to filter items that has in their parentdIds the ids of the selected items.

I see, this would need some recursive search, possibly Elastic doesn't have a support for that, but we could try to implement this inside iped...

- this property is needed when resuming processing to get a previous parent id referenced by subitems which parents were not commited, then when reprocessing parents, their id can be updated to the previous value, so parent-child relationships will be preserved.

- fix embedded disks subitems references to parentGlobalID

lfcnassif added the enhancement label Oct 16, 2021

lfcnassif changed the title ~~Rename persistentID to globalID~~ Improve and rename persistentID to globalID Dec 2, 2021

lfcnassif self-assigned this Dec 2, 2021

lfcnassif mentioned this issue Jan 12, 2022

Import bookmarked items from old case to a new case with the same evidence #918

Open

lfcnassif added a commit that referenced this issue Jan 20, 2022

#784: rename persistentId variables and strings to globalId

6acadb3

lfcnassif added a commit that referenced this issue Jan 20, 2022

#784: combine globalID + evidenceUUID to make elastic _id unique

be191dc

lfcnassif added a commit that referenced this issue Jan 20, 2022

#784: rename persistentId variables and strings to globalId

3e1a996

lfcnassif added a commit that referenced this issue Jan 20, 2022

#784: combine globalID + evidenceUUID to make elastic _id unique

1f4e424

lfcnassif added a commit that referenced this issue Jan 20, 2022

#784: throw exception if needed param to compute globalId is not net

e798cb8

lfcnassif added a commit that referenced this issue Jan 20, 2022

#784: combine globalID + evidenceUUID to make elastic _id unique

5bfcf2c

lfcnassif mentioned this issue Jan 20, 2022

#784 rename persistentId to globalId #934

Merged

lfcnassif closed this as completed in #934 Jan 20, 2022

hauck-jvsh reopened this Jan 21, 2022

lfcnassif added a commit that referenced this issue Jan 25, 2022

#784: additional minor checking for null invalid item paths

8d40edf

lfcnassif added a commit that referenced this issue Jan 25, 2022

#784: rename globalID (previous persistentID) to trackID

6537d2f

lfcnassif added a commit that referenced this issue Jan 25, 2022

#784: compute and store a real globalID unique across cases

700bc70

lfcnassif added a commit that referenced this issue Jan 25, 2022

#784: compute and store a real globalID unique across cases

b876fcb

lfcnassif mentioned this issue Jan 25, 2022

Subitems or items from UFDR may point to inexistent parent when resuming processing #941

Closed

lfcnassif added a commit that referenced this issue Jan 25, 2022

#784: fix processing of recursive disks after commit ea81e2f

2be79a5

lfcnassif added a commit that referenced this issue Jan 25, 2022

#784: fix processing of recursive disks after commit ea81e2f

0aa4519

lfcnassif added a commit that referenced this issue Jan 26, 2022

#784/#792 improve globalID formula, throw exception if something is bad

9abaeab

lfcnassif added a commit that referenced this issue Jan 26, 2022

#784: store parentGlobalID for all items produced by FolderTreeReader

daf13ab

lfcnassif added a commit that referenced this issue Jan 26, 2022

#784: store parentGlobalID for all items from SleuthkitReader and also:

53d443b

- fix embedded disks subitems references to parentGlobalID

lfcnassif added a commit that referenced this issue Jan 26, 2022

#784: remove parentIdInDataSource setters & getters, not needed anymore

d599e31

lfcnassif added a commit that referenced this issue Jan 26, 2022

#784: additional minor checking for null invalid item paths

6b352c9

lfcnassif added a commit that referenced this issue Jan 26, 2022

#784: rename globalID (previous persistentID) to trackID

b0eb4f3

lfcnassif added a commit that referenced this issue Jan 26, 2022

#784: compute and store a real globalID unique across cases

33584b0

lfcnassif added a commit that referenced this issue Jan 26, 2022

#784: fix processing of recursive disks after commit 53d443b

7eee8e1

lfcnassif added a commit that referenced this issue Jan 28, 2022

#784: fix trackID computation for file fragments

379c10a

lfcnassif closed this as completed in #937 Jan 28, 2022

lfcnassif changed the title ~~Improve and rename persistentID to globalID~~ Improve and rename persistentID to trackID Jan 29, 2022

lfcnassif added a commit that referenced this issue Feb 10, 2022

#784: always set parentIdInDataSource in all dataSourceReaders

a6cee81

lfcnassif added a commit that referenced this issue Feb 10, 2022

#784: copy trackID/persistentId formula from master

a1d43b5

lfcnassif added a commit that referenced this issue Feb 11, 2022

#784: fix carving in folder datasource after commit a6cee81

1cedc57

lfcnassif added a commit that referenced this issue Feb 11, 2022

#784: improve a6cee81: better values for idInDataSource for local files

28aa966

lfcnassif added a commit that referenced this issue Feb 11, 2022

#784: improve a7e26c5: better values for idInDataSource for local files

47f2015

lfcnassif added a commit that referenced this issue Feb 11, 2022

#784: improve a6cee81: better values for idInDataSource for local files

f32b478

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve and rename persistentID to trackID #784

Improve and rename persistentID to trackID #784

lfcnassif commented Oct 16, 2021 •

edited

Loading

hauck-jvsh commented Nov 9, 2021

lfcnassif commented Dec 2, 2021

lfcnassif commented Dec 2, 2021 •

edited

Loading

lfcnassif commented Jan 20, 2022 •

edited

Loading

hauck-jvsh commented Jan 20, 2022

lfcnassif commented Jan 20, 2022 •

edited

Loading

lfcnassif commented Jan 20, 2022 •

edited

Loading

hauck-jvsh commented Jan 20, 2022 •

edited

Loading

lfcnassif commented Jan 20, 2022

hauck-jvsh commented Jan 21, 2022

lfcnassif commented Jan 21, 2022

wladimirleite commented Jan 21, 2022

wladimirleite commented Jan 21, 2022

hauck-jvsh commented Jan 25, 2022 •

edited

Loading

lfcnassif commented Jan 25, 2022

hauck-jvsh commented Jan 25, 2022

lfcnassif commented Jan 25, 2022 •

edited

Loading

Improve and rename persistentID to trackID #784

Improve and rename persistentID to trackID #784

Comments

lfcnassif commented Oct 16, 2021 • edited Loading

hauck-jvsh commented Nov 9, 2021

lfcnassif commented Dec 2, 2021

lfcnassif commented Dec 2, 2021 • edited Loading

lfcnassif commented Jan 20, 2022 • edited Loading

hauck-jvsh commented Jan 20, 2022

lfcnassif commented Jan 20, 2022 • edited Loading

lfcnassif commented Jan 20, 2022 • edited Loading

hauck-jvsh commented Jan 20, 2022 • edited Loading

lfcnassif commented Jan 20, 2022

hauck-jvsh commented Jan 21, 2022

lfcnassif commented Jan 21, 2022

wladimirleite commented Jan 21, 2022

wladimirleite commented Jan 21, 2022

hauck-jvsh commented Jan 25, 2022 • edited Loading

lfcnassif commented Jan 25, 2022

hauck-jvsh commented Jan 25, 2022

lfcnassif commented Jan 25, 2022 • edited Loading

lfcnassif commented Oct 16, 2021 •

edited

Loading

lfcnassif commented Dec 2, 2021 •

edited

Loading

lfcnassif commented Jan 20, 2022 •

edited

Loading

lfcnassif commented Jan 20, 2022 •

edited

Loading

lfcnassif commented Jan 20, 2022 •

edited

Loading

hauck-jvsh commented Jan 20, 2022 •

edited

Loading

hauck-jvsh commented Jan 25, 2022 •

edited

Loading

lfcnassif commented Jan 25, 2022 •

edited

Loading