I’ve been focused on business customer solutions in business intelligence, Big Data and data warehouse designs for the past 20 years. In this blog, we’ll focus on Big Data Analtyics. My hope is that everyone will benefit from these discussions and through our community. Enjoy! Best, Mark Kromer http://www.linkedin.com/in/markkromer
Hi Mark,
Recently came across some of your blogs on converting SSAS AdventureWorks model to Mondrian. Great set of details.
I recently started dabbling on Mondrian/Pentaho, and would like to explore – if you could share any converted Mondrian Schema (XML), for the Adventureworks SSAS model.
I have been able to convert the database using SQLExpress to MySQL (my choice of database(, however I don’t have an immediate access to a SQL Server standard version, which I need to convert the schema using your command line utility.
Even an incomplete/starting model would be great. I’m trying to setup a POC on how to work with Mondrian/Pentaho replacing existing SSAS capability.
Will look forward to hear from you.
Regards,
– Sujit
Hi Sujit – If you have the AdventureWorks DW database converted to MySQL, then the easiest way to get started using Mondrian for OLAP analysis is to auto-generate a new Mondrian model using Pentaho Analytics.
Download the Pentaho 5.2 BA Suite and use the Data Source Wizard to auto-generate a new model from your MySQL database. You can look up the instructions for doing this at http://help.pentaho.com or try one of my tutorials here or here.
Hi, when will all the datasets become available for mapping data flows.
Hi Mark, I just viewed your Data Quality with “Azure Data Factory, Better Analytics at Scale” presentation on the YouTube PragmaticWorks channel. I’ve viewed many presentations on ADF and ADLS – this is excellent and so comprehensive, thank you for adding the resources. I am fast tracking to ADF/ADLS/ADLSA from a traditional SSIS , Kimball data warehouse background. I have a question – where is the best place to perform transformations? I see opportunity in ADLS using ADLSA and U-SQL and what you demonstrate using ADF and some available tasks. Of all the presentations Ive viewed nobody mentions whats “best practice” – is this modern approach too new for a best practice? Is it a combination of both ADF and ADLS, or (as a professor use to say) does it “depend”? I have complex applications with call data with many aggregations, CHARINDEX, ISNULL and CONVERT/CAST. Anyway, perhaps a loaded question but Im a literal and want to abide by best practice
Hi Mark, I just viewed your Data Quality with “Azure Data Factory, Better Analytics at Scale” presentation on the PragmaticWorks YouTube channel – I’ve been covering many presentations on ADF and ADLS – your presentation is excellent and very comprehensive – thank you for adding the resources. I am crossing over tothe ADF/ADLS/ADLSA way of doing things from a traditional SSIS , Kimball data warehouse background. I have a question as there is one thing I dont get – where is the best place to perform transformations? I see opportunity in ADLS using ADLSA and U-SQL and what you demonstrate using ADF with some available tasks. Of all the presentations Ive viewed nobody mentions what and where “best practice” is here – is this modern approach too new for a best practice? Is it a combination of both ADF and ADLS, or does it “depend”? I have complex DW use case with call data with many aggregations such as CHARINDEX, ISNULL and CONVERT/CAST. Anyway, perhaps a loaded question but Im a literal and want to abide by best practice here. Thank you for all of the content and insight you provide.
My presentation on Data Quality with ADF is focused on using Data Flows in ADF. But it is intended for data that is borne in the cloud and requires scale-out computing resources. ADF Data Flows use Spark via Azure Databricks for transformation. So if you have smaller data requirements or wish to transform data on-prem, then use SSIS.
Hi, when will XML and Dynamics datasets be available in mapping data flows? Also do you have any update on this:
https://feedback.azure.com/forums/270578-data-factory/suggestions/17508058-xml-file-type-in-copy-activity-along-with-xml-sc#comments
please give us some good news during this time!! thank you!
In the works, coming soon
HI, have you blogged about parsing xml in mapping data flows since the release of the xml connector? cant find much about this…
Hi Mark, I watched the Azure Synapse Analytics https://ve.on24.com/vshow/AADM/exhibits/Episode_2 but I could not find anyway how to get any of your data sources… do you have a step by step how to get your sources?
I’m working on that
https://github.com/kromerm/adfdataflowdocs/blob/master/sampledata/synapse-dataflows-tutorial-001.docx