You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository has been archived by the owner on Jan 20, 2022. It is now read-only.
What would we write to? We don't have a good open source batch KV store...we could try HBase or MySql (the latter more just for illustrative purposes)?
We can run the scalding job without having the mysql or HBase portion
available. a more complete environment it would be good to have it. But our
VersionedBatchedStore is an hdfs file system only thing. It should be
relatively straight forward if we have a source of input data.
A full example would be interesting with client/HBase(or mysql) and
everything. The difficulty with that is mostly around getting a good
datasource?
On Mon, Mar 10, 2014 at 12:40 PM, Chris Aniszczyk [email protected]:
Would have an AWS machine image with a full setup perhaps make sense? Access to a real-time and off-line data source would of course be needed. Seems like Twitter's filtered API would make sense. https://dev.twitter.com/docs/api/1.1/post/statuses/filter.
Is anyone working on this issue yet?
snoble
pushed a commit
to snoble/summingbird
that referenced
this issue
Sep 8, 2017
Show how to launch the whole job on a hadoop cluster.
Take a look at:
https://github.com/twitter/summingbird/blob/develop/summingbird-scalding/src/main/scala/com/twitter/summingbird/scalding/Executor.scala
Which will need one of these:
https://github.com/twitter/summingbird/blob/develop/summingbird-batch-hadoop/src/main/scala/com/twitter/summingbird/batch/state/HDFSState.scala
Here is an example configured job for storm:
https://github.com/twitter/summingbird/blob/develop/summingbird-example/src/main/scala/com/twitter/summingbird/example/StormRunner.scala
The text was updated successfully, but these errors were encountered: