Hi all,
The scenario which I have is I need to setup a way to stream data to HDInsight hosted on WindowsAzure specifically using Flume.
I have a third party tool which collects website metrics. I want to send these metrics to a flume service on HDInsight. I then want to use Hive to regularly import this data into Hadoop. The third party tool only has the capability to stream to a flume service.
Has anyone had any experience with this kind of setup? I can install Flume on HDInsight, that's not a problem. The biggest problem I have right now is being able to send the data to the flume installation. Specifically I need to know the URI to point to for the Flume Service on HDInsight and I need to be able to setup the port for this.
Can anyone give me any help with this?
Many thanks,