Rough spike at adding Java driver #15

catsby · 2015-09-01T22:08:31Z

⚠️ 🚧
This is a spike for adding a java driver, copied-paste and butchered from the standard exec driver.

It's pretty simple and naive. I submit now for feedback on the approach, or to catch any major flaws in my simple thinking

Does:

accepts a jar file from a remote location (http(s)://)
downloads into the AllocDir
executes with java -jar <file> [args]

Assumes:

publicly accessible jar file for download
file is http(s) (not ftp, ssh)

The jar file referenced should be public to anyone (at time of writing). It's a simple app the outputs "Hi" every few seconds. The source can be found here:

https://gist.github.com/catsby/953e237773f920abdace

TODOs

Abstract out the http fetching into a Fetcher interface of sorts
Test with a batch-like Java app, that does not run indefinitely, to confirm exits etc
Probably several other things

At this point, I need feedback on what I'm doing right/wrong 😐

cbednarski · 2015-09-01T22:51:15Z

client/driver/java.go

+
+func (d *JavaDriver) Fingerprint(cfg *config.Config, node *structs.Node) (bool, error) {
+	// We can always do a fork/exec
+	node.Attributes["driver.java"] = "1"


You can test for this with java -version. I think it's also useful to capture the version name. This might be

java version "1.8.0_51" openjdk version "1.8.0_45-internal"

And whether or not it's oracle java, which I think we can detect from Java(TM) or Hotspot(TM) in the version string.

Derp, totally glossed over this 😱

I updated it in a337df4

cbednarski · 2015-09-02T19:03:47Z

client/driver/java.go

+	select {
+	case <-h.doneCh:
+		return nil
+	case <-time.After(5 * time.Second):


I'm not sure we can fix this in this PR since we don't really have job configs yet (or do we?), but I think this should be configurable to give the app a chance to do cleanup or checkpointing. Curious: How would we pass configuration into this block?

I see your point, but I also think we should be firm here.
I think burden of responding to interrupt and cleaning up should fall on the developer, and I think 5 seconds should be ok, but maybe 10 seconds is more reasonable (more Heroku-isms coming out of me 😄 )

If not, I don't see where else this leads to except for us accepting a "shut down" command string that we exec, and then we're exec'ing arbitrary strings 😱

a chance to do cleanup or checkpointing

Checkpointing for sure. Cleanup though seems like it's a thing happening implicitly with the Driver. For Java, we're using the java command and all that's in a VM. Same with Docker, right?

Unless you're talking about cleanup as in "get things in order before we go away". In which case, I think 10 seconds is still good..

Unless you're talking about cleanup as in "get things in order before we go away". In which case, I think 10 seconds is still good..

Yes exactly. Sorry about the confusing wording. EC2 gives you a 120 second warning. I know that there are use-cases to extend this to a much longer duration (1 hour, for example). Also there are probably different thresholds appropriate for different scenarios:

Out of resources, so pre-empting something (terminate immediately)

Upgrading (wait for currently running app to finish doing its work)

For example, at Riot we might have a game server that needs to be taken down for upgrade or hotfix, but we want to let the current games finish (~1 hour) without booting people out. In the mean time the game server will receive the sigint and will simply not accept new games.

Agreed, I think there are a bunch of future enhancements possible. I don't think allowing an arbitrary drain time will work, since it blocks the ability of the scheduler to make progress. It also can cause priority inversion, if a low priority job refuses to drain and a high priority job cannot be scheduled. Probably 5 second max is fine for now, we can address in 0.2

we can address in 0.2

👍

armon · 2015-09-03T01:43:57Z

@catsby Looking good! I agree with the TODOs, we can probably abstract out some of the helper things like downloading, waiting for PID, etc.

Where the rubber hits the road on a lot of these will be the isolation mechanisms. Those I think we want to put in helper libs and share as much as possible.

Chroot to restrict disk, various controls for CPU etc, and those will improve over time.

cbednarski · 2015-09-03T18:43:17Z

This looks ready to me! We can iterate details but let's mainline this so we can do x-plat testing and such. 👍

Rough spike at adding Java driver

Improve logging on pot create fail and pot destroy failure

github-actions · 2023-05-11T02:10:22Z

I'm going to lock this pull request because it has been closed for 120 days ⏳. This helps our maintainers find and focus on the active contributions.
If you have found a problem that seems related to this change, please open a new issue and complete the issue template so we can capture all the details necessary to investigate further.

catsby added 2 commits September 1, 2015 16:56

move test logger to shared file

e7c88f1

Spike at a Java driver, runs Java jars

a835602

cbednarski reviewed Sep 1, 2015
View reviewed changes

driver/java: update Fingerprint for Java driver

a337df4

cbednarski reviewed Sep 2, 2015
View reviewed changes

clean up orphaned java processes

9dbe6c0

catsby mentioned this pull request Sep 3, 2015

Adjustable kill time #17

Closed

catsby added 2 commits September 3, 2015 10:02

driver/java: Don't error if Java isn't found; log if error parsing

02ebe5c

driver/java: use filepath.Join instead of simple +

852edf2

catsby added this to the v0.1 milestone Sep 3, 2015

catsby added the Driver label Sep 3, 2015

catsby added a commit that referenced this pull request Sep 3, 2015

Merge pull request #15 from hashicorp/b-java-driver

ff0383b

Rough spike at adding Java driver

catsby merged commit ff0383b into master Sep 3, 2015

catsby deleted the b-java-driver branch September 3, 2015 19:31

grembo pushed a commit to grembo/nomad that referenced this pull request Jan 30, 2022

Merge pull request hashicorp#15 from trivago/Improve-logging

3a1be66

Improve logging on pot create fail and pot destroy failure

github-actions bot locked as resolved and limited conversation to collaborators May 11, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Rough spike at adding Java driver #15

Rough spike at adding Java driver #15

catsby commented Sep 1, 2015

cbednarski Sep 1, 2015

catsby Sep 2, 2015

cbednarski Sep 2, 2015

catsby Sep 2, 2015

catsby Sep 2, 2015

cbednarski Sep 2, 2015

armon Sep 3, 2015

catsby Sep 3, 2015

armon commented Sep 3, 2015

cbednarski commented Sep 3, 2015

github-actions bot commented May 11, 2023

Rough spike at adding Java driver #15

Rough spike at adding Java driver #15

Conversation

catsby commented Sep 1, 2015

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

armon commented Sep 3, 2015

cbednarski commented Sep 3, 2015

github-actions bot commented May 11, 2023