Hi Max Your use case of collecting AWR for 100 Oracle systems is nice use case. This will be AWR Datawarehouse which will be used for Analytics to compare AWR, get health score for each system, predict when system can High CPU those sort of stuff. Please note Hadoop is to scale batch Analytics. If you want Interactive Analytics please check Apache Spark. In Datawarehouse any history data which are meant for archive can be put into Hadoop. This data when needed submit job and Hadoop will take care. HTH Karth Sent from my IPhone > On Oct 3, 2014, at 10:35 AM, max scalf <oracle.blog3@xxxxxxxxx> wrote: > > Hello list, > > > our DBA team(including myself) are exploring Hadoop. Our company(consumer > product goods) is a pretty decent size company (revenues about 18-20 billion) > and we are a big SAP shop here with all big bells and whiles (SAP Hana > running for few modules) but we also want to explore Hadoop as a new option. > > Where we are struggling is, some of the business folks do want to start > putting some data in Hadoop echo system and see what value they get out of it > but DO NOT WANT TO PAY FOR POC. So our department came up with a idea of > doing a small POC(maybe a 3-4 node cluster) with the data we have in hand. > What i mean when i say "data we have in hand" is, awr performance data, > syslogs from unix* server, windows event log on windows server, whatever a > DBA can get there hands on(except the business data)... > > So here i am reaching out to community and see if anyone else have a similar > problem and what they have done about it? Maybe a good use case mining AWR > performance data(from 100's of oracle system) on hadoop or something like > that. > > I understand hadoop is not the answer to replace your traditional RDBMS but > we are just trying to step into that big data world and do some POC, so the > business can also jump in. > > Any thoughts or suggestion or good use cases ??