-
Hi, |
Beta Was this translation helpful? Give feedback.
Replies: 2 comments 22 replies
-
what's a pentaho? do you export the connections with the metadata when you register your workflows/pipelines on the server? |
Beta Was this translation helpful? Give feedback.
-
I'll try to write a full and cohesive answer.
IntroHop ServerWhat it is What it isn't By circling back to what it is, we can also discuss why it is poorly documented. We do not want it to be used in a stand-alone way; it wasn't made for this. We did add the endpoints to our documentation because people were asking for them, but honestly, it was never designed to be used without using the GUI or Hop Run. There are better ways, eg. using short lived containers which provide more flexibility and in combination with airflow (tutorial) you can also use webhooks to start things. Shameless plug: We (know.bi) are working on something better which we hope to showcase soon. This covers our disclaimer, let's get back to the subject. Running a single pipelineThe process to start something on Hop Server is split up in 3 different categories:
let's discuss starting a single pipeline, there are 3 steps that need to be taken to start a pipeline on Hop Server. registerPipelineThe first step is to send the pipeline and all needed environment information to the server. As stated before the server is stateless so it knows nothing it needs all information to create a successful execution.
3 blocks of information need to be included in this request: Each variable looks like metastore_json: It boils down to a json containing all objects you have defined in the metadata perspective/metadata folder. example if you only have a PostgreSQL connection, but it also needs to contain your run targets and all other objects that are available in your metadata folder. Another note: each database type can have different fields (just like in the UI) most of them are shared, but eg MSSQL Server has more fields.
After building and sending this request to the server (POST) you will get a response:
prepareExecAfter you get back the Id you have to hit the prepareExec with a GET request response:
This will prepare the pipeline for execution and it will enter a "waiting state" startExecThe final step is a GET to startExec to start the actual execution
response
You can follow up how everything is going with the pipelineStatus endpoint. Closing noteThese steps should help you use the REST API directly to start a pipeline, running a single workflow is a similar process. Happy coding, |
Beta Was this translation helpful? Give feedback.
I'll try to write a full and cohesive answer.
Let's start by rephrasing the original question:
Intro
Hop Server
What it is
Hop server is a stateless server, its main purpose is to be used as an extension to Hop GUI to run a pipeline or workflow in a remote environment
What it isn't
Hop Server isn't the typical server that you would use for scheduling/monitoring it does not retain state or history, it does not store this information (except in memory for a short period). After a restart all previous information is lost.
By circling back to what it is, we can al…