Generate graphs using cell sets as unifying concept #24

ubyndr · 2023-07-14T13:07:42Z

Resolves #26

TODO:

Currently, we are using enriched_df to add cell type terms to the graph. However, we have noticed that if a cell type does not have any subClassOf relations with other cell types, those terms are missing from the graph. To address this issue, it would be better to utilize the co_annotation report for adding the cell type terms. I use the obs attribute in the anndata object to generate a cell type dictionary. This dictionary consists of cell type IDs and labels, which I then utilize to add the cell type terms to the graph. Then, we can use enriched_df specifically to incorporate the subClassOf relations between those terms. The root cause of the missing cell terms in the neo4j UI is attributed to the way I currently add the cell terms. To address this issue, I will be making updates to the enrich_rdf_graph enrich_rdf_graph method

ubyndr · 2023-07-14T13:13:56Z

pandasaurus_cxg/graph_generator/graph_generator.py

+    def visualize_rdf_graph(self):
+        nx_graph = rdflib_to_networkx_multidigraph(self.graph)
+        # Plot Networkx instance of RDF Graph
+        pos = nx.spring_layout(nx_graph, scale=2, k=2)
+        edge_labels = nx.get_edge_attributes(nx_graph, "r")
+        nx.draw_networkx_edge_labels(nx_graph, pos, edge_labels=edge_labels)
+        nx.draw(nx_graph, with_labels=True)
+        plt.show()


This is just a placeholder; I have used OBASK to visualize graphs and examine them for validation purposes.

This is a place where oaklib could really help. Worth talking to @anitacaron about how she uses it for visualising validation.

I had a conversation with Anita yesterday in the office. From what I gathered, it seems that visualising the relations and neighbours of a set of terms is necessary. However, I am unsure if this is something we actually need.

I didn't know the context. I can have a closer look into oaklib.

There's the OboGraph Interface

ubyndr · 2023-07-14T13:22:40Z

pandasaurus_cxg/graph_generator/graph_generator.py

+        cl_namespace = Namespace("http://purl.obolibrary.org/obo/CL_")
+        for curie, label in self.cell_type_dict.items():
+            resource = cl_namespace[curie.split(":")[-1]]
+            self.graph.add((resource, RDFS.label, Literal(label)))
+            for s, _, _ in self.graph.triples((None, self.ns["cell_type"], Literal(label))):
+                self.graph.add((s, self.ns["consists_of"], resource))
+        # add subClassOf between terms in CL enrichment
+        for _, row in self.enriched_df.iterrows():
+            for s, _, _ in self.graph.triples((None, RDFS.label, Literal(row["s_label"]))):
+                for o, _, _ in self.graph.triples((None, RDFS.label, Literal(row["o_label"]))):
+                    self.graph.add((s, RDFS.subClassOf, o))


I'm not entirely certain if any methods, other than simple_enrichment, contribute additional information to the graph. This is because those methods may involve CL terms from a subset and context that are not utilized in annotations.
We should talk about this.

By definition all enrichment methods link to terms related to those used in annotation.

Looking at this again, I think it is clear that I have underspecified. I think the challenge is how we deal with flattening. In the pipelines Anita has worked on we use ROBOT or Souffle to strip redundancy from the flattened graph. If we're sticking with pure python we will need something similar to the the Souffle redundancy stripping algo here - which will require some thought.

For this PR I'd suggest an MVP of building a graph based on co-annotation first. We can then move folding in the enrichment graph to a second ticket/PR.

…ng-concept

* Merged from main * Updated anndata_analyzer.py * Removed state and state.l2 from free-text annotations * Refactored visualize_rdf_graph method * Refactored save_rdf_graph, visualize_rdf_graph method and added transitive_reduction method * Format changes in co_annotation_report * Added state and state.l2 to free-text annotations

This reverts commit 625b237.

…aph, transitive_reduction methods

…ng-concept

This reverts commit c356303.

ubyndr added 7 commits July 7, 2023 12:40

Initial commit

f115893

Refactor

8bbb94c

Refactored anndata_enricher.py

8c3ef0e

Added docstring to _assign_predicate and refactored _remove_duplicates

735c33f

Implemented missing methods in graph_generator.py

14f1b9f

Added InvalidGraphFormat and MissingEnrichmentProcess exceptions

7410482

Updated .gitignore

607c9b2

ubyndr requested review from dosumis and hkir-dev July 14, 2023 13:07

ubyndr commented Jul 14, 2023

View reviewed changes

ubyndr added 5 commits July 18, 2023 17:00

Added CellType to nodes that represent CL terms

aca0f73

Added pygraphviz dependency

9438edc

Refactored visualize_rdf_graph method

0e7248f

Added consists_of relations as OWL.Restriction

ee69211

Format changes

95c926a

ubyndr linked an issue Jul 25, 2023 that may be closed by this pull request

Extend cell set graphs with (cell) ontology classification. from enrichment #27

Closed

ubyndr added 2 commits July 25, 2023 12:37

Refactored visualize_rdf_graph method

dda2fa2

Updated walkthrough.ipynb

bb04f1e

ubyndr force-pushed the 13-generate-graphs-using-cell-sets-as-unifying-concept branch from a221a81 to bb04f1e Compare July 25, 2023 11:44

Ismail Ugur Bayindir and others added 10 commits July 25, 2023 12:44

Merge branch 'main' into 13-generate-graphs-using-cell-sets-as-unifyi…

4117cd4

…ng-concept

Updated .gitignore

7698ed5

Merged from main

1c3fb91

Updated anndata_analyzer.py

4684594

Removed state and state.l2 from free-text annotations

45312ca

Refactored visualize_rdf_graph method

54da01d

Resolved conflicts

6f8d4fa

Refactored cell_type_dict initialization

625b237

Revert "Refactored cell_type_dict initialization"

0ad1b4e

This reverts commit 625b237.

ubyndr added 5 commits August 2, 2023 10:56

Refactored cell_type_dict initialization

a38e12d

Added oaklib

227f137

Added logging to transitive_reduction and refactored visualize_rdf_gr…

ecb8b88

…aph, transitive_reduction methods

Refactored edge_data generation

7930f81

Fixed issues in nx_graph generation

dc71801

dosumis approved these changes Aug 8, 2023

View reviewed changes

ubyndr and others added 3 commits August 8, 2023 16:05

Refactored logging configuration, and added add_label_to_terms method

b5427b7

Added add_node method

026071e

Merge branch 'main' into 13-generate-graphs-using-cell-sets-as-unifyi…

f7108a9

…ng-concept

ubyndr merged commit c356303 into main Aug 8, 2023

ubyndr pushed a commit that referenced this pull request Aug 8, 2023

Revert "Generate graphs using cell sets as unifying concept (#24)"

8bf7c43

This reverts commit c356303.

ubyndr mentioned this pull request Aug 10, 2023

Add label settting function for co-annotation graph generation #30

Closed

ubyndr linked an issue Aug 10, 2023 that may be closed by this pull request

Add label settting function for co-annotation graph generation #30

Closed

ubyndr deleted the 13-generate-graphs-using-cell-sets-as-unifying-concept branch September 29, 2023 14:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Generate graphs using cell sets as unifying concept #24

Generate graphs using cell sets as unifying concept #24

ubyndr commented Jul 14, 2023 •

edited

Loading

ubyndr Jul 14, 2023

dosumis Jul 14, 2023

ubyndr Jul 14, 2023

anitacaron Jul 14, 2023

anitacaron Jul 17, 2023

ubyndr Jul 14, 2023

dosumis Jul 17, 2023 •

edited

Loading

Generate graphs using cell sets as unifying concept #24

Generate graphs using cell sets as unifying concept #24

Conversation

ubyndr commented Jul 14, 2023 • edited Loading

ubyndr Jul 14, 2023

Choose a reason for hiding this comment

dosumis Jul 14, 2023

Choose a reason for hiding this comment

ubyndr Jul 14, 2023

Choose a reason for hiding this comment

anitacaron Jul 14, 2023

Choose a reason for hiding this comment

anitacaron Jul 17, 2023

Choose a reason for hiding this comment

ubyndr Jul 14, 2023

Choose a reason for hiding this comment

dosumis Jul 17, 2023 • edited Loading

Choose a reason for hiding this comment

ubyndr commented Jul 14, 2023 •

edited

Loading

dosumis Jul 17, 2023 •

edited

Loading