<img height="1" width="1" style="display:none;" alt="" src="https://dc.ads.linkedin.com/collect/?pid=24166&amp;fmt=gif">

AtScale Blog

Announcing AtScale 6.5

Posted by Joshua Klahr on Mar 6, 2018

Data Lake Intelligence with AtScale

In my recent Data Lake 2.0 article I described how the worlds of big data and cloud are coming together to reshape the concept of the data lake. The data lake is an important element of any modern data architecture, and the data lake footprint will continue to expand. However, the data lake investment is only one part of delivering a modern data architecture. At Yahoo!, in addition to building a Hadoop-based data lake, we also needed to solve the problem of connecting traditional business intelligence workloads to this Hadoop data. Although the term “Data Lake” didn’t exist back then, we were solving the problem of: “How can you deliver an interactive BI experience on top of a scale-out Data Lake” - it turns out we were pioneers in delivering Data Lake Intelligence.


See AtScale Intelligence Platform in action, Sign up for this webinar!


Our experiences and learnings from those initial efforts led to the architecture that sits at the core of the AtScale Intelligence Platform. Because AtScale has been built from the ground up to deliver business-friendly insights from the vast amounts of information in data lakes, AtScale has experienced tremendous success and adoption in enterprises ranging from financial services, to retail to digital media. With the release of AtScale 6.5, we’ve continued to build on and expand AtScale’s ability to uniquely deliver on the promise of Data Lake Intelligence. If this sounds like something you might be interested in knowing more about… keep reading!

Read More

Topics: Business Intelligence, bi-on-hadoop, Big Data, Cloud, BI, Analytics, BI on Big Data, Data Strategy, data driven

What You Might Have Missed in February 2018

Posted by Ashley Huang on Mar 1, 2018

Poor February. The short month is dismissed for its brevity (let’s not talk about the weather) but a lot transpired the past 28 days, especially in big data and analytics. ICYMI, here’s a recap of the top stories:

 

Read More

Topics: Business Intelligence, bi-on-hadoop, Big Data, Cloud, BI, Analytics, BI on Big Data, Data Strategy, data driven

Supercharge Your Percentile Calculations for Big Data (Part III)

Posted by Daren Drummond on Feb 27, 2018

Additional contribution by: Santanu ChatterjeeTrystan LeftwichBryan Naden.

In the previous post we demonstrated how to model percentile estimates and use them in Tableau without moving large amounts of data.  You may ask, "how accurate are the results and how much load is placed on the cluster?".  In this post we discuss the accuracy and scaling properties of the AtScale percentile estimation algorithm.


To learn how to be a data driven orgazation, watch this webinar now!Join Jen Underwood to Learn Best Practices on Deploying Data Analytics Strategy


Read More

Topics: Hadoop, bi-on-hadoop, Analytics, BI on Big Data, percentiles

Supercharge Your Percentile Calculations for Big Data (Part II)

Posted by Daren Drummond on Feb 26, 2018

Additional contribution by: Santanu ChatterjeeTrystan LeftwichBryan Naden.

In the previous post, we discussed typical use cases for percentiles and the advantages of percentile estimates.  In this post, we illustrate how to model percentile estimates with AtScale and use them from Tableau.


To learn how to be a data driven orgazation, check out this webinar!Join Jen Underwood to Learn Best Practices on Deploying Data Analytics Strategy


Read More

Topics: Hadoop, bi-on-hadoop, Analytics, BI on Big Data, percentiles

Supercharge Your Percentile Calculations for Big Data (Part I)

Posted by Daren Drummond on Feb 23, 2018

Additional contribution by: Santanu Chatterjee, Trystan Leftwich, Bryan Naden.

 

A new and powerful method of computing percentile estimates on Big Data is now available to you! By combining the well known t-Digest algorithm with AtScale’s semantic layer and smart aggregation features AtScale addresses gaps in both the Business Intelligence and Big Data landscapes. Most BI tools have features to compute and display various percentiles (i.e. medians, interquartile ranges, etc), but they move data for processing which dramatically limits the size of the analysis.  The Hadoop-based SQL engines (Hive, Impala, Spark) can compute approximate percentiles on large datasets, however these expensive calculations are not aggregated and reused to answer similar queries.  AtScale offers robust percentile estimates that work with AtScale’s semantic layer and aggregate tables to provide fast, accurate, and reusable percentile estimates.  

In this three-part blog series we discuss the benefits of percentile estimates and how to compute them in a Big Data environment.  Subscribe today to learn the best practices of percentile estimation on Big Data and more.  Let's dive right in!


To learn how to be a data driven orgazation, check out this webinar!Join Jen Underwood to Learn Best Practices on Deploying Data Analytics Strategy


Read More

Topics: Hadoop, bi-on-hadoop, Analytics, BI on Big Data, percentiles

Analytics Performance of Olympic Proportions

Posted by Lucio Daza on Feb 22, 2018

Did you know that 9 out of 10 companies are only able to analyze less than half of the data they collect? In fact, 45% of these companies analyze less than one quarter of this data . Why is that? The answer is simple. We are generating more data than ever, 44 zettabytes by the year 2020 to be precise. What does this mean for you? A great data lake filled with extremely valuable potential to make better decisions for your company. Remember what Sherlock Holmes once said: “I can’t build bricks without clay!” We need data to answer complex questions.

Read More

Topics: Hadoop, Tableau, Performance, bi-on-hadoop, Analytics, Webinar, BI on Big Data

Happy Birthday George!

Posted by Lucio Daza on Feb 19, 2018

President’s day is the perfect opportunity to explore, honor and remember the legacy of Washington, Lincoln and other presidents. It is also a great day for all those who are looking for a good sale at their favorite retail or online store. What does this mean? Hundreds of millions of sales transactions will generate enormous amount of financial and inventory data will be generated on this one day.  


To learn from successful from Cloudera customers on how they succeed in BI on Big Data, check out this best practices webinarHear Success Stories From Enterprises like yours, Check Out This Webinar!


Read More

Topics: Hadoop, Tableau, Performance, bi-on-hadoop, Analytics, Webinar, BI on Big Data

Your Strata San Jose 2018 Insider’s Guide

Posted by Ashley Huang on Feb 16, 2018

 

 

It seems like only yesterday that we all gathered for the Strata New York Conference. And yet here we are, March is around the corner, and Strata San Jose is just a month away. Historically, Strata San Jose has roughly 5000 attendees while Strata New York averages closer to 7000 attendees. As one of the largest Hadoop conference in the US, Strata sessions focus on using data for competitive advantage. Strata Conference is also an opportunity to hear real life stories from enterprises who have been there, have the scars, and wrote the book. Strata is the ideal place to understand trends in the big data world. If you missed it, here are the [trends from 2017].


 

To learn from successful from Cloudera customers on how they succeed in BI on Big Data, check out this best practices webinar
Register for the Webinar

Read More

Topics: Big Data, GartnerBI, Cloud, BI, Analytics, Data, Data Lake, Data Strategy, data driven

Gartner Summit 2018: Scale the Value of Data and Analytics, Texas Style!

Posted by Ashley Huang on Feb 15, 2018

 

The annual Gartner Data & Analytics Summit is just around the corner.  As in past years, we are all anticipating the overwhelming sessions and agenda throughout this exciting week. This time in Grapevine, Texas (again)! In between sessions, catch a breath of fresh air and check out the exhibit hall to collect a bag of goodies to bring home. While T-shirts always make good pajamas, we may also wonder which sessions and vendors we should not miss. With all of the sessions available at the Summit, here are our suggestion on the ones you don’t want to miss!

Join the BI on Datalake Checklist Webinar

Read More

Topics: Big Data, GartnerBI, Cloud, BI, Analytics, Data, Data Lake, Data Strategy, data driven

It’s Complicated: The Love Hate Relationship with Data.

Posted by Lucio Daza on Feb 12, 2018

Data, Data, Data! All the facts, numbers, and everything in between that, when collected, can be analyzed and decisions made based upon them. Can’t live with it! Can’t live without it! This year over $14 billion will be spent globally on flowers, chocolate, and jewelry on Valentine’s Day alone. According to CIO Research, big data analytics investments are expected to be close to $187 billion. We are definitely investing more on data than expensive chocolates! The big question is, are your data investments nurturing your relationship with data the same way a bouquet of flowers and a box of chocolates can nurture your relationships?Hear Success Stories From Enterprises like yours, Check Out This Webinar!

Read More

Topics: Hadoop, Tableau, Performance, bi-on-hadoop, Analytics, Webinar, BI on Big Data

Learn about BI & Hadoop

The AtScale Blog is the one-stop shop for cutting edge news and insights about BI on Hadoop and all things AtScale.

Subscribe to Email Updates

Recent Posts