From obrien.andrew at gmail.com Tue Jun 5 09:43:13 2007 From: obrien.andrew at gmail.com (Andrew O'Brien) Date: Tue, 5 Jun 2007 09:43:13 -0400 Subject: [Activewarehouse-discuss] Best Datawarehouse book? Message-ID: I was wondering: what's the best book for learning the datawarehouse concepts? I know the normal transactional stuff about as well as any other RoR/former PHP developer, so that's where I'm starting from. I noticed The Data Warehouse Toolkit by Kimball & Ross is on the project page. Is this still the best recommendation? Thanks, Andrew From anthonyeden at gmail.com Tue Jun 5 09:48:10 2007 From: anthonyeden at gmail.com (Anthony Eden) Date: Tue, 5 Jun 2007 09:48:10 -0400 Subject: [Activewarehouse-discuss] Best Datawarehouse book? In-Reply-To: References: Message-ID: I think it is the best book for understanding data warehouse concepts (which is why I put it on the project page). It is structured as a series of case studies and puts each concept into the context of a hypothetical project, making it fairly easy to grok. My copy is well-worn. :-) V/r Anthony On 6/5/07, Andrew O'Brien wrote: > I was wondering: what's the best book for learning the datawarehouse > concepts? I know the normal transactional stuff about as well as any > other RoR/former PHP developer, so that's where I'm starting from. > > I noticed The Data Warehouse Toolkit by Kimball & Ross is on the > project page. Is this still the best recommendation? > > Thanks, > Andrew > _______________________________________________ > Activewarehouse-discuss mailing list > Activewarehouse-discuss at rubyforge.org > http://rubyforge.org/mailman/listinfo/activewarehouse-discuss > -- Cell: 808 782-5046 Current Location: Melbourne, FL From obrien.andrew at gmail.com Tue Jun 5 10:01:39 2007 From: obrien.andrew at gmail.com (Andrew O'Brien) Date: Tue, 5 Jun 2007 10:01:39 -0400 Subject: [Activewarehouse-discuss] Best Datawarehouse book? In-Reply-To: References: Message-ID: Excellent -- just wanted to make sure that was the best of the best before I made the plunge. Looking forward to warehousing the heck out of my data. -Andrew On 6/5/07, Anthony Eden wrote: > I think it is the best book for understanding data warehouse concepts > (which is why I put it on the project page). It is structured as a > series of case studies and puts each concept into the context of a > hypothetical project, making it fairly easy to grok. My copy is > well-worn. :-) > > V/r > Anthony > > On 6/5/07, Andrew O'Brien wrote: > > I was wondering: what's the best book for learning the datawarehouse > > concepts? I know the normal transactional stuff about as well as any > > other RoR/former PHP developer, so that's where I'm starting from. > > > > I noticed The Data Warehouse Toolkit by Kimball & Ross is on the > > project page. Is this still the best recommendation? > > > > Thanks, > > Andrew > > _______________________________________________ > > Activewarehouse-discuss mailing list > > Activewarehouse-discuss at rubyforge.org > > http://rubyforge.org/mailman/listinfo/activewarehouse-discuss > > > > > -- > Cell: 808 782-5046 > Current Location: Melbourne, FL > From kbedell at courseadvisor.com Tue Jun 5 10:05:11 2007 From: kbedell at courseadvisor.com (Kevin Bedell) Date: Tue, 05 Jun 2007 10:05:11 -0400 Subject: [Activewarehouse-discuss] Best Datawarehouse book? In-Reply-To: Message-ID: I'm on my second copy of the Kimball's DW Toolkit book. I also recommend Kimball's DW ETL Toolkit book - it's a similar book but focused on ETL issues. On 6/5/07 10:01 AM, "Andrew O'Brien" wrote: > Excellent -- just wanted to make sure that was the best of the best > before I made the plunge. Looking forward to warehousing the heck out > of my data. > > -Andrew > > On 6/5/07, Anthony Eden wrote: >> I think it is the best book for understanding data warehouse concepts >> (which is why I put it on the project page). It is structured as a >> series of case studies and puts each concept into the context of a >> hypothetical project, making it fairly easy to grok. My copy is >> well-worn. :-) >> >> V/r >> Anthony >> >> On 6/5/07, Andrew O'Brien wrote: >>> I was wondering: what's the best book for learning the datawarehouse >>> concepts? I know the normal transactional stuff about as well as any >>> other RoR/former PHP developer, so that's where I'm starting from. >>> >>> I noticed The Data Warehouse Toolkit by Kimball & Ross is on the >>> project page. Is this still the best recommendation? >>> >>> Thanks, >>> Andrew >>> _______________________________________________ >>> Activewarehouse-discuss mailing list >>> Activewarehouse-discuss at rubyforge.org >>> http://rubyforge.org/mailman/listinfo/activewarehouse-discuss >>> >> >> >> -- >> Cell: 808 782-5046 >> Current Location: Melbourne, FL >> > _______________________________________________ > Activewarehouse-discuss mailing list > Activewarehouse-discuss at rubyforge.org > http://rubyforge.org/mailman/listinfo/activewarehouse-discuss > > > > From anthonyeden at gmail.com Tue Jun 5 18:48:33 2007 From: anthonyeden at gmail.com (Anthony Eden) Date: Tue, 5 Jun 2007 18:48:33 -0400 Subject: [Activewarehouse-discuss] Managing / partitioning facts tables In-Reply-To: References: Message-ID: Harry, For partitioning I would recommend using your database's native partitioning mechanism if possible. The best performance is achieved this way. ActiveWarehouse does not define any sort of database-independent partitioning mechanisms. As for aggregation, you may choose to do some aggregation during the ETL process or you may choose aggregation until runtime. Currently ActiveWarehouse has a pluggable aggregate system but really doesn't have a functioning aggregation mechanism (outside of just using the NoAggregate to go straight to the facts and dimensions). We are working on other aggregation mechanisms like an improved ROLAP aggregate and the Dwarf aggregate, but I can't say when those will be ready for use. The approach I would recommend right now is using AW to build your facts and dimensions, perhaps build some fact tables that preaggregate some of the page view data, and then see how it performs. If the performance is not up to snuff then you can build additional aggregates and query those to build your reports. If you build a good dimensional model though you will be able to benefit from future aggregation schemes as they become part of AW. V/r Anthony Eden On 6/5/07, Harry Fuecks wrote: > Hi Anthony, > > Currently getting a feel for what activewarehouse does; have been > through all docs / information you've put online but no further yet. > > What I'm trying to understand / haven't seen so far, is how fact > tables are / should be managed - stuff like partitioning etc. when > you're dealing with "lots of data" - let's say 30million plus rows in > a single facts table, for example. > > Probably best explain this with my use case - interested in using it > for web analytics - as well as the usual suspects (page views etc.) > need to be able to run ad hoc queries, from time to time. What > approach should I be taking with activewarehouse to storing hits in a > fact table, given that I might need over a years worth of data ("How > are be doing compared to last year") available? > > Also, when does aggregation actually take place? For example is it > trigger the first time a user requests a specific result or generated > in advance by some kind of batch process? > > Many thanks, > > Harry > -- Cell: 808 782-5046 Current Location: Melbourne, FL From pablo.marrero at gmail.com Wed Jun 6 10:46:44 2007 From: pablo.marrero at gmail.com (Pablo Marrero) Date: Wed, 6 Jun 2007 11:46:44 -0300 Subject: [Activewarehouse-discuss] More information... Message-ID: Hello, my name is Pablo Marrero from Montevideo-Uruguay I'm in the Data Warehousing field and in this moment I'm writing a study of different possibilities for implementing Data Warehouses, and I found the ActiveWarehouse and ActiveWarehouse-etl plugins. I'm very interested in these projects, but I can't find much information. I need information about: 1.- A complete example of use ActiveWarehouse-etl (i read the manual http://activewarehouse.rubyforge.org/docs/activewarehouse-etl.html) 2.- Documentation about the use of different kinds of reports in ActiveWarehouse Can anyone help me? Thank you in advance Pablo -- Pablo Marrero Tel: 099 927 566 -------------- next part -------------- An HTML attachment was scrubbed... URL: http://rubyforge.org/pipermail/activewarehouse-discuss/attachments/20070606/5721d47b/attachment.html From blake at millionsofus.com Wed Jun 6 12:15:40 2007 From: blake at millionsofus.com (Blake Mizerany) Date: Wed, 6 Jun 2007 09:15:40 -0700 Subject: [Activewarehouse-discuss] (no subject) Message-ID: <8A74DD47-7D33-485D-8F99-D2C7C6E4F151@millionsofus.com> When running ctl file the etl it tells me I need a jobs table. I noticed the gem contains a migration for this but the rails plugin does not. I'm sure I have the latest version of everything since I just install it all last night. Is there something I missing that doesn't require a manual run of the jobs migrations through the console? Blake Mizerany http://millionsofus.com -------------- next part -------------- An HTML attachment was scrubbed... URL: http://rubyforge.org/pipermail/activewarehouse-discuss/attachments/20070606/66abedda/attachment.html From anthonyeden at gmail.com Wed Jun 6 12:52:53 2007 From: anthonyeden at gmail.com (Anthony Eden) Date: Wed, 6 Jun 2007 12:52:53 -0400 Subject: [Activewarehouse-discuss] More information... In-Reply-To: References: Message-ID: Currently the best working example is in the rails_warehouse directory in the subversion repository. This is the code that I used in my original tutorial (although the code has been updated since). It only demonstrates a very small amount of functionality though. As for the reports in ActiveWarehouse, right now there is only support for tabular reports, and only tabular reports with one dimension for columns and one for rows. This is an area which would benefit from some attention, so if you are interested feel free to start digging through the code and building new report helpers. V/r Anthony On 6/6/07, Pablo Marrero wrote: > Hello, my name is Pablo Marrero from Montevideo-Uruguay > I'm in the Data Warehousing field and in this moment I'm writing a study of > different possibilities for implementing > Data Warehouses, and I found the ActiveWarehouse and ActiveWarehouse-etl > plugins. > I'm very interested in these projects, but I can't find much information. > I need information about: > 1.- A complete example of use ActiveWarehouse-etl (i read the manual > http://activewarehouse.rubyforge.org/docs/activewarehouse-etl.html) > 2.- Documentation about the use of different kinds of reports in > ActiveWarehouse > > Can anyone help me? > > Thank you in advance > Pablo > > > -- > Pablo Marrero > Tel: 099 927 566 > _______________________________________________ > Activewarehouse-discuss mailing list > Activewarehouse-discuss at rubyforge.org > http://rubyforge.org/mailman/listinfo/activewarehouse-discuss > > -- Cell: 808 782-5046 Current Location: Melbourne, FL From anthonyeden at gmail.com Wed Jun 6 13:25:38 2007 From: anthonyeden at gmail.com (Anthony Eden) Date: Wed, 6 Jun 2007 13:25:38 -0400 Subject: [Activewarehouse-discuss] (no subject) In-Reply-To: <8A74DD47-7D33-485D-8F99-D2C7C6E4F151@millionsofus.com> References: <8A74DD47-7D33-485D-8F99-D2C7C6E4F151@millionsofus.com> Message-ID: You need to have a database.yml file in the directory with your ETL control files that defines where the etl execution database is, and that database must exist. I had left this out of the online docs, but I just added it in: http://activewarehouse.rubyforge.org/docs/activewarehouse-etl.html#setting_up The migrations will automatically be run the first time you run the etl application. HTH. If you still have issues please let me know. V/r Anthony Eden On 6/6/07, Blake Mizerany wrote: > > When running ctl file the etl it tells me I need a jobs table. I noticed > the gem contains a migration for this but the rails plugin does not. I'm > sure I have the latest version of everything since I just install it all > last night. Is there something I missing that doesn't require a manual run > of the jobs migrations through the console? > > > Blake Mizerany > http://millionsofus.com > > _______________________________________________ > Activewarehouse-discuss mailing list > Activewarehouse-discuss at rubyforge.org > http://rubyforge.org/mailman/listinfo/activewarehouse-discuss > > -- Cell: 808 782-5046 Current Location: Melbourne, FL From obrien.andrew at gmail.com Mon Jun 11 15:37:01 2007 From: obrien.andrew at gmail.com (Andrew O'Brien) Date: Mon, 11 Jun 2007 15:37:01 -0400 Subject: [Activewarehouse-discuss] Error with demo app Message-ID: Hi, I downloaded the example application from the repository and I get this error pretty frequently (e.g. rake migrate, ruby db/setup.rb, actions that generate reports, etc). I commented out the line and the same thing happened, but with the "precision" attribute. After commenting both lines out, things seem to work, but I'm guessing sooner or later AW's going to find a nil where it didn't want one and get mad at me. Using AdapterExtensions ./db/../config/../vendor/plugins/activewarehouse/lib/active_warehouse/aggregate_field.rb:17:in `initialize': undefined method `scale' for # (NoMethodError) from ./db/../config/../vendor/plugins/activewarehouse/lib/active_warehouse/fact.rb:123:in `aggregate' from db/../config/../app/models/pos_retail_sales_transaction_fact.rb:2 from /usr/local/lib/ruby/gems/1.8/gems/activesupport-1.3.1/lib/active_support/dependencies.rb:140:in `load' from /usr/local/lib/ruby/gems/1.8/gems/activesupport-1.3.1/lib/active_support/dependencies.rb:56:in `require_or_load' from /usr/local/lib/ruby/gems/1.8/gems/activesupport-1.3.1/lib/active_support/dependencies.rb:30:in `depend_on' from /usr/local/lib/ruby/gems/1.8/gems/activesupport-1.3.1/lib/active_support/dependencies.rb:85:in `require_dependency' from /usr/local/lib/ruby/gems/1.8/gems/activesupport-1.3.1/lib/active_support/dependencies.rb:98:in `const_missing' from /usr/local/lib/ruby/gems/1.8/gems/activesupport-1.3.1/lib/active_support/dependencies.rb:131:in `const_missing' from ./db/setup/pos_retail_sales_transaction.rb:16 from /usr/local/lib/ruby/site_ruby/1.8/rubygems/custom_require.rb:27:in `require' from /usr/local/lib/ruby/gems/1.8/gems/activesupport-1.3.1/lib/active_support/dependencies.rb:147:in `require' from db/setup.rb:19 -------------- next part -------------- An HTML attachment was scrubbed... URL: http://rubyforge.org/pipermail/activewarehouse-discuss/attachments/20070611/3e6f0e8d/attachment-0001.html From matthew.d.williams at gmail.com Wed Jun 13 07:44:56 2007 From: matthew.d.williams at gmail.com (Matthew Williams) Date: Wed, 13 Jun 2007 07:44:56 -0400 Subject: [Activewarehouse-discuss] Quick introduction Message-ID: <4c8cdcb30706130444n553219fdrb3c9379ae0897625@mail.gmail.com> Good morning, Just wanted to give a brief introduction since I joined the mailing list. Last night my brother (Chris) met up with Anthony, I was tied up in a previous engagement and unable to make it however I believe we're going to try to meet up in July at some point. I've been out of school for a year now and working for a defense contractor in Melbourne, I'm also in the process of getting my Masters in Software Engineering from FIT. Right now I'm working mostly with Perl, Java and Javascripti (integrating a lot of the great frameworks into some web applications I support). We work in a pretty limited development environment so implementing Ruby into an application is pretty much out of the question, however, with the recent 1.0 release of JRuby I hope to take advantage of in one way or another with some of the Java development I do. Resume: http://matthewwilliams.emurse.com/ I look forward to keeping an eye out on the mailing list and perhaps making some contributions in the future. Take care, -Matt Williams ps. I'm currently taking a Software Testing course as part of my Masters program, I think as the class progresses that we'll be asked to join an open source project to get involved, test and provide feedback; I think I may have just found a project! From anthonyeden at gmail.com Fri Jun 15 08:49:15 2007 From: anthonyeden at gmail.com (Anthony Eden) Date: Fri, 15 Jun 2007 08:49:15 -0400 Subject: [Activewarehouse-discuss] Error with demo app In-Reply-To: References: Message-ID: What version of ActiveRecord are you using? What version of MySQL and which MySQL driver? Thanks. V/r Anthony Eden On 6/11/07, Andrew O'Brien wrote: > > Hi, > > I downloaded the example application from the repository and I get this > error pretty frequently (e.g. rake migrate, ruby db/setup.rb, actions that > generate reports, etc). I commented out the line and the same thing > happened, but with the "precision" attribute. > > After commenting both lines out, things seem to work, but I'm guessing > sooner or later AW's going to find a nil where it didn't want one and get > mad at me. > > Using AdapterExtensions > ./db/../config/../vendor/plugins/activewarehouse/lib/active_warehouse/aggregate_field.rb:17:in > `initialize': undefined method `scale' for > # (NoMethodError) > from > ./db/../config/../vendor/plugins/activewarehouse/lib/active_warehouse/fact.rb:123:in > `aggregate' > from > db/../config/../app/models/pos_retail_sales_transaction_fact.rb:2 > from /usr/local/lib/ruby/gems/1.8/gems/activesupport-1.3.1/lib/active_support/dependencies.rb:140:in > `load' > from /usr/local/lib/ruby/gems/1.8/gems/activesupport-1.3.1/lib/active_support/dependencies.rb:56:in > `require_or_load' > from /usr/local/lib/ruby/gems/1.8/gems/activesupport-1.3.1/lib/active_support/dependencies.rb:30:in > `depend_on' > from /usr/local/lib/ruby/gems/1.8/gems/activesupport-1.3.1/lib/active_support/dependencies.rb:85:in > `require_dependency' > from /usr/local/lib/ruby/gems/1.8/gems/activesupport-1.3.1/lib/active_support/dependencies.rb:98:in > `const_missing' > from /usr/local/lib/ruby/gems/1.8/gems/activesupport-1.3.1/lib/active_support/dependencies.rb:131:in > `const_missing' > from ./db/setup/pos_retail_sales_transaction.rb:16 > from > /usr/local/lib/ruby/site_ruby/1.8/rubygems/custom_require.rb:27:in `require' > from /usr/local/lib/ruby/gems/1.8/gems/activesupport- 1.3.1/lib/active_support/dependencies.rb:147:in > `require' > from db/setup.rb:19 > > > > _______________________________________________ > Activewarehouse-discuss mailing list > Activewarehouse-discuss at rubyforge.org > http://rubyforge.org/mailman/listinfo/activewarehouse-discuss > > -- Cell: 808 782-5046 Current Location: Melbourne, FL -------------- next part -------------- An HTML attachment was scrubbed... URL: http://rubyforge.org/pipermail/activewarehouse-discuss/attachments/20070615/cdccb8a3/attachment.html From alexkwolfe at gmail.com Fri Jun 22 11:01:15 2007 From: alexkwolfe at gmail.com (Alex Wolfe) Date: Fri, 22 Jun 2007 10:01:15 -0500 Subject: [Activewarehouse-discuss] Error with demo app Message-ID: I'm gettng the same error as Andrew O'Brien: Using AdapterExtensions ./db/../config/../vendor/plugins/activewarehouse/lib/active_warehouse/ aggregate_field.rb:17:in `initialize': undefined method `scale' for # (NoMethodError) from ./db/../config/../vendor/plugins/activewarehouse/lib/active_warehouse/ fact.rb:123:in I'm on Rails 1.2.3 (ActiveRecord 1.15.3), Mac OS X with MySQL 5.0.21. In ActiveRecord 1.15.3 scale (and precision) are defined on ActiveRecord::ConnectionAdapters::Column, which is extended by MysqlColumn. So I don't understand the NoMethodError. Anyone have any bright ideas? -------------- next part -------------- An HTML attachment was scrubbed... URL: http://rubyforge.org/pipermail/activewarehouse-discuss/attachments/20070622/d585533a/attachment.html