About Kromer Big Data

I’ve been focused on business customer solutions in business intelligence, Big Data and data warehouse designs for the past 20 years. In this blog, we’ll focus on Big Data Analtyics. My hope is that everyone will benefit from these discussions and through our community. Enjoy! Best, Mark Kromer http://www.linkedin.com/in/markkromer

8 comments

  1. Hi Mark,

    Recently came across some of your blogs on converting SSAS AdventureWorks model to Mondrian. Great set of details.

    I recently started dabbling on Mondrian/Pentaho, and would like to explore – if you could share any converted Mondrian Schema (XML), for the Adventureworks SSAS model.

    I have been able to convert the database using SQLExpress to MySQL (my choice of database(, however I don’t have an immediate access to a SQL Server standard version, which I need to convert the schema using your command line utility.

    Even an incomplete/starting model would be great. I’m trying to setup a POC on how to work with Mondrian/Pentaho replacing existing SSAS capability.

    Will look forward to hear from you.

    Regards,

    – Sujit

  2. Hi Mark, I just viewed your Data Quality with “Azure Data Factory, Better Analytics at Scale” presentation on the YouTube PragmaticWorks channel. I’ve viewed many presentations on ADF and ADLS – this is excellent and so comprehensive, thank you for adding the resources. I am fast tracking to ADF/ADLS/ADLSA from a traditional SSIS , Kimball data warehouse background. I have a question – where is the best place to perform transformations? I see opportunity in ADLS using ADLSA and U-SQL and what you demonstrate using ADF and some available tasks. Of all the presentations Ive viewed nobody mentions whats “best practice” – is this modern approach too new for a best practice? Is it a combination of both ADF and ADLS, or (as a professor use to say) does it “depend”? I have complex applications with call data with many aggregations, CHARINDEX, ISNULL and CONVERT/CAST. Anyway, perhaps a loaded question but Im a literal and want to abide by best practice

  3. Hi Mark, I just viewed your Data Quality with “Azure Data Factory, Better Analytics at Scale” presentation on the PragmaticWorks YouTube channel – I’ve been covering many presentations on ADF and ADLS – your presentation is excellent and very comprehensive – thank you for adding the resources. I am crossing over tothe ADF/ADLS/ADLSA way of doing things from a traditional SSIS , Kimball data warehouse background. I have a question as there is one thing I dont get – where is the best place to perform transformations? I see opportunity in ADLS using ADLSA and U-SQL and what you demonstrate using ADF with some available tasks. Of all the presentations Ive viewed nobody mentions what and where “best practice” is here – is this modern approach too new for a best practice? Is it a combination of both ADF and ADLS, or does it “depend”? I have complex DW use case with call data with many aggregations such as CHARINDEX, ISNULL and CONVERT/CAST. Anyway, perhaps a loaded question but Im a literal and want to abide by best practice here. Thank you for all of the content and insight you provide.

    • My presentation on Data Quality with ADF is focused on using Data Flows in ADF. But it is intended for data that is borne in the cloud and requires scale-out computing resources. ADF Data Flows use Spark via Azure Databricks for transformation. So if you have smaller data requirements or wish to transform data on-prem, then use SSIS.

Leave a Reply to kromerbigdata Cancel reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out /  Change )

Google photo

You are commenting using your Google account. Log Out /  Change )

Twitter picture

You are commenting using your Twitter account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )

Connecting to %s