mc2-project · andrewlawhh · Oct 2, 2020 · Dec 10, 2020 · Mar 29, 2021 · Mar 29, 2021
diff --git a/docs/src/integrity/integrity.rst b/docs/src/integrity/integrity.rst
@@ -0,0 +1,68 @@
+***********************
+Computational Integrity
+***********************
+
+The integrity module of Opaque ensures that the untrusted job driver hosted on the cloud service schedules tasks in the manner computed by Spark's Catalyst query optimizer. 
+Opaque runs on Spark, which utilizes data partitioning to speed up computation. 
+Specifically, Catalyst will compute a physical query plan for a given dataframe query and delegate Spark workers (run on enclaves) to compute Spark SQL operations on data partitions. 
+Each of these individual units is trusted, but the intermediary steps in which the units communicate is controlled by the job driver, running as untrusted code in the cloud. 
+The integrity module will detect foul play by the job driver, including deviation from the query plan computed by Catalyst, 
+shuffling data in an unexpected manner across data partitions, spoofing extra data between ecalls, or dropping output between ecalls.
+
+Overview
+--------
+The main idea behind integrity support is to tag each step of computation with a MAC over individual enclave workers' encrypted output, attached by the enclave worker when it has completed its computation. 
+All MACs received by all previous enclave workers are logged. In the end during post verification, these MACs, which each represent an ecall at a data partition, are compared and reconstructed into a graph. 
+This graph is compared to the DAG of the query plan computed by Catalyst. 
+If the graphs are isomorphic, then no tampering has occurred. 
+Else, the result of the query returned by the cloud is rejected.
+
+Implementation
+--------------
+Two main extensions were made to support integrity - one in enclave code, and one in the Scala client application.
+
+Enclave Code
+^^^^^^^^^^^^
+In the enclave code (C++), modifications were made to the ``FlatbuffersWriters.cpp`` file and ``FlatbuffersReaders.cpp`` file. 
+The "write" change attaches a MAC over the ``EncryptedBlocks`` object to the output.
+The "read" change checks whether all blocks that were output from the previous ecall were received by the subsequent ecall.
+No further modifications need to be made to the application logic since this functionality hooks into how Opaque workers output their data.
+
+Scala/Application Code
+^^^^^^^^^^^^^^^^^^^^^^
+The main extension supporting Integrity is the ```JobVerificationEngine`` which is a piece of Scala code that broadly carries out three tasks:
+
+1. Reconstruct the flow of information between enclave workers.
+
+2. Compute the corresponding DAG of ecalls for a given query.
+
+3. Compare the two DAGs and output "accept" or "reject."
+
+These happen in the "verify" function of the JobVerificationEngine class.
+
+Reconstructing the executed DAG of ecalls involves iterating through the MACs attached by enclave workers, provided in the "LogEntryChain" object in the Job Verification Engine.
+This object is filled by Opaque when Spark's ``collect`` method is called when a query is executed.
+
+Output MACs of parents correspond to input MACs of their child. Using this information, the DAG is created.
+
+The "expected" DAG is created from Spark's ``dataframe.queryPlan.executedPlan`` object which is a recursive tree node of Spark Operators.
+The Job Verification Engine contains the logic to transform this tree of operators into a tree of ecalls.
+
+Adding Integrity Support for New Operators
+^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
+To support new operators, if they are added, one should make changes to the Enclave code and the Job Verification Engine code.
+
+In the enclave, make sure that the enclave context's "finish_ecall" method is called before returning in ``Enclave.cpp```.
+
+In the Job Verification Engine, add the logic to transform the operator into a list of ecalls that the operator uses in ``generateJobNodes``.
+This amounts to adding a case in the switch statement of this function.
+
+Furthermore, add the logic to connect the ecalls together in ``linkEcalls``.
+As above, this amounts to adding a case in the switch statement of this function, but requires knowledge of how each ecall communicates the transfer of data partitions to its successor ecall
+(broadcast, all to one, one to all, etc.).
+
+Usage
+^^^^^
+To use the Job Verification Engine as a black box, make sure that its state is flushed by calling its ``resetForNextJob`` function.
+Then, you can call ``Utils.verifyJob`` on the query dataframe, which will return a boolean indicating whether the job has passed post verification.
+It returns ``True`` if the job passed, else it returns ``False``.