apache · mxm · Jan 28, 2015 · Jan 28, 2015 · Feb 11, 2015 · Feb 12, 2015
diff --git a/.travis.yml b/.travis.yml
@@ -31,5 +31,5 @@ install:
 
 script:
   - travis_retry mvn versions:set -DnewVersion=manual_build
-  - travis_retry mvn $MAVEN_OVERRIDE install -U
+  - travis_retry mvn $MAVEN_OVERRIDE verify -U
   - travis_retry travis/test_wordcount.sh
diff --git a/pom.xml b/pom.xml
@@ -91,6 +91,7 @@
   <packaging>pom</packaging>
   <modules>
     <module>sdk</module>
+    <module>runners</module>
     <module>examples</module>
     <module>maven-archetypes/starter</module>
     <module>maven-archetypes/examples</module>

diff --git a/runners/flink/README.md b/runners/flink/README.md
@@ -0,0 +1,202 @@
+Flink Beam Runner (Flink-Runner)
+-------------------------------
+
+Flink-Runner is a Runner for Apache Beam which enables you to
+run Beam dataflows with Flink. It integrates seamlessly with the Beam
+API, allowing you to execute Apache Beam programs in streaming or batch mode.
+
+## Streaming
+
+### Full Beam Windowing and Triggering Semantics
+
+The Flink Beam Runner supports *Event Time* allowing you to analyze data with respect to its
+associated timestamp. It handles out-or-order and late-arriving elements. You may leverage the full
+power of the Beam windowing semantics like *time-based*, *sliding*, *tumbling*, or *count*
+windows. You may build *session* windows which allow you to keep track of events associated with
+each other.
+
+### Fault-Tolerance
+
+The program's state is persisted by Apache Flink. You may re-run and resume your program upon
+failure or if you decide to continue computation at a later time.
+
+### Sources and Sinks
+
+Build your own data ingestion or digestion using the source/sink interface. Re-use Flink's sources
+and sinks or use the provided support for Apache Kafka.
+
+### Seamless integration
+
+To execute a Beam program in streaming mode, just enable streaming in the `PipelineOptions`:
+
+    options.setStreaming(true);
+
+That's it. If you prefer batched execution, simply disable streaming mode.
+
+## Batch
+
+### Batch optimization
+
+Flink gives you out-of-core algorithms which operate on its managed memory to perform sorting, 
+caching, and hash table operations. We have optimized operations like CoGroup to use Flink's
+optimized out-of-core implementation.
+
+### Fault-Tolerance
+
+We guarantee job-level fault-tolerance which gracefully restarts failed batch jobs.
+
+### Sources and Sinks
+
+Build your own data ingestion or digestion using the source/sink interface or re-use Flink's sources
+and sinks.
+
+## Features
+
+The Flink Beam Runner maintains as much compatibility with the Beam API as possible. We
+support transformations on data like:
+
+- Grouping
+- Windowing
+- ParDo
+- CoGroup
+- Flatten
+- Combine
+- Side inputs/outputs
+- Encoding
+
+# Getting Started
+
+To get started using the Flink Runner, we first need to install the latest version.
+
+## Install Flink-Runner ##
+
+To retrieve the latest version of Flink-Runner, run the following command
+
+    git clone https://github.com/apache/incubator-beam
+
+Then switch to the newly created directory and run Maven to build the Beam runner:
+
+    cd incubator-beam
+    mvn clean install -DskipTests
+
+Flink-Runner is now installed in your local maven repository.
+
+## Executing an example
+
+Next, let's run the classic WordCount example. It's semantically identically to
+the example provided with ApacheBeam. Only this time, we chose the
+`FlinkPipelineRunner` to execute the WordCount on top of Flink.
+
+Here's an excerpt from the WordCount class file:
+
+```java
+Options options = PipelineOptionsFactory.fromArgs(args).as(Options.class);
+// yes, we want to run WordCount with Flink
+options.setRunner(FlinkPipelineRunner.class);
+
+Pipeline p = Pipeline.create(options);
+
+p.apply(TextIO.Read.named("ReadLines").from(options.getInput()))
+		.apply(new CountWords())
+		.apply(TextIO.Write.named("WriteCounts")
+				.to(options.getOutput())
+				.withNumShards(options.getNumShards()));
+
+p.run();
+```
+
+To execute the example, let's first get some sample data:
+
+    curl http://www.gutenberg.org/cache/epub/1128/pg1128.txt > kinglear.txt
+
+Then let's run the included WordCount locally on your machine:
+
+    mvn exec:exec -Dinput=kinglear.txt -Doutput=wordcounts.txt
+
+Congratulations, you have run your first ApacheBeam program on top of Apache Flink!
+
+
+# Running Beam programs on a Flink cluster
+
+You can run your Beam program on an Apache Flink cluster. Please start off by creating a new
+Maven project.
+
+    mvn archetype:generate -DgroupId=com.mycompany.beam -DartifactId=beam-test \
+        -DarchetypeArtifactId=maven-archetype-quickstart -DinteractiveMode=false
+
+The contents of the root `pom.xml` should be slightly changed aftewards (explanation below):
+
+```xml
+<?xml version="1.0" encoding="UTF-8"?>
+<project xmlns="http://maven.apache.org/POM/4.0.0"
+         xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
+         xsi:schemaLocation="http://maven.apache.org/POM/4.0.0 http://maven.apache.org/xsd/maven-4.0.0.xsd">
+    <modelVersion>4.0.0</modelVersion>
+
+    <groupId>com.mycompany.beam</groupId>
+    <artifactId>beam-test</artifactId>
+    <version>1.0</version>
+
+    <dependencies>
+        <dependency>
+            <groupId>org.apache.beam</groupId>
+            <artifactId>flink-runner</artifactId>
+            <version>0.2</version>
+        </dependency>
+    </dependencies>
+
+    <build>
+        <plugins>
+            <plugin>
+                <groupId>org.apache.maven.plugins</groupId>
+                <artifactId>maven-shade-plugin</artifactId>
+                <version>2.4.1</version>
+                <executions>
+                    <execution>
+                        <phase>package</phase>
+                        <goals>
+                            <goal>shade</goal>
+                        </goals>
+                        <configuration>
+                            <transformers>
+                                <transformer implementation="org.apache.maven.plugins.shade.resource.ManifestResourceTransformer">
+                                    <mainClass>WordCount</mainClass>
+                                </transformer>
+                            </transformers>
+                            <artifactSet>
+                                <excludes>
+                                    <exclude>org.apache.flink:*</exclude>
+                                </excludes>
+                            </artifactSet>
+                        </configuration>
+                    </execution>
+                </executions>
+            </plugin>
+
+        </plugins>
+
+    </build>
+
+</project>
+```
+
+The following changes have been made:
+
+1. The Flink Beam Runner was added as a dependency.
+
+2. The Maven Shade plugin was added to build a fat jar.
+
+A fat jar is necessary if you want to submit your Beam code to a Flink cluster. The fat jar
+includes your program code but also Beam code which is necessary during runtime. Note that this
+step is necessary because the Beam Runner is not part of Flink.
+
+You can then build the jar using `mvn clean package`. Please submit the fat jar in the `target`
+folder to the Flink cluster using the command-line utility like so:
+
+    ./bin/flink run /path/to/fat.jar
+
+
+# More
+
+For more information, please visit the [Apache Flink Website](http://flink.apache.org) or contact
+the [Mailinglists](http://flink.apache.org/community.html#mailing-lists).