Skip to content

Latest commit

 

History

History
41 lines (36 loc) · 1.12 KB

TODO.md

File metadata and controls

41 lines (36 loc) · 1.12 KB

Data wrangling

  • Summary ** data wrangling how data are transformed, from document to other format type ** data source how to send data into a a database, just the communication with a database ** data connector just the basic connector type ssh, api, database

  • define the minimal code which willl be pymedext core without to much annotators

  • general recommendations

  • use logger instead of print
  • comments
  • UML schema of the code
  • Data wrangling [0%]
  • [-] add doccano
  • add Brat
  • BioC
  • COnnL
  • FhiR
  • universal dependencies
  • BioNLP
  • data source [20%]
  • Source abstract class factorize
  • OmopSource with optimize load function
  • FhirSource APIconnection to FhiR
  • doccanoSource APIConnection to doccano api
  • bratSource SSHconnection send data to server
  • data connector [28%]
  • factorize the code better connection between each class
  • Connector general and abstract class
  • ApiConnector factorize doccano code
  • DatabaseConnector factorize
  • sshConnector within paramiko
  • POstgreConnector
  • CxOracleConnector (done but not provided)