In a recent blog post, Amazon has introduced a new market data publisher/subscriber service called AWS Data Exchange. This service is an add-on to the existing AWS Marketplace and contains more than 1000 licensable data products from more than 80 data providers. These data feeds include both free and paid offerings that span industries such […]
Machine Learning Drastically Curtails Mobile App Development Mistakes
Machine learning is disrupting the mobile app development industry. Although mobile app developers have used machine learning in some way or another for years, they are finding new applications for it. Machine learning is particularly useful when it comes to avoiding many of the biggest mistakes that app developers make. The Growing Importance of Machine […]
Simplifying ETL in the Cloud, Microsoft Releases Azure Data Factory Mapping Data Flows
In a recent blog post, Microsoft announced the general availability (GA) of their serverless, code-free Extract-Transform-Load (ETL) capability inside of Azure Data Factory called Mapping Data Flows. This tool allows organizations to embrace a data-driven culture without the need to manage large infrastructure footprints while having the ability to dynamically scale data processing workloads. By […]
Presentation: Big Data’s Ethical Drought: The Thirst for More Data Has Led to a Lapse in Ethics and Privacy
Katharine Jarmul provides examples of data (mis)use and asking how we can work with data without violating the trust and privacy of users, producing an ethical product? By Katharine Jarmul
Artificial Intelligence and Big Data in Higher Education: Promising or Perilous?
What exactly is artificial intelligence (AI) and what business does it have in higher education? Simply put, AI is an attempt to emulate human knowledge by programming extensive rules into computers. Through machine learning and expert systems, machines can produce patterns within mass flows of data and pinpoint correlations that couldn’t possibly be immediately intuitive […]
Investigating The Scalability Issues Of Bitcoin In Blockchain
Scalability is a crucial factor that is talked about even before an application has been created. Yet, even after the application is deployed, the application will be required to be updated frequently to scale based on the application load. While scalability has been one of the concerns even in the blockchain space, it wasn’t the […]
Google Releases Cloud Dataproc for Kubernetes in Alpha
Google Cloud Dataproc is an open-source data and analytic processing service based on Hadoop and Spark. Google has now announced the alpha availability of Cloud Dataproc for Kubernetes to provide customers with more efficiency to process data across platforms. By Steef-Jan Wiggers
Big Data Paves The Road For A New Generation Of Investing Apps
Big data is changing the financial industry in a truly astounding way. Countless financial professionals are looking towards machine learning and other new tools to improve the quality of the services that they offer to their customers. K. Hussain of Atos Spain published a white paper on the growing relevance of big data in the […]
Jagadish Venkatraman on LinkedIn’s Journey to Samza 1.0
At the recent ApacheCon North America, Jagadish Venkatraman spoke about how LinkedIn developed Apache Samza 1.0 to handle stream processing at scale. He described LinkedIn’s use cases involving trillions of events and petabytes of data, then highlighted the features added for the 1.0 release, including: stateful processing, high-level APIs, and a flexible deployment model. By […]
5 Reasons Why You Should Store Big Data In The Cloud
Gone are the days when storage of information can only be done with the traditional remote servers which are located in a secluded location. Today, the in-thing is cloud data storage where information and data are stored electronically online. With this approach, you can store unlimited data online (in the cloud) and access it anywhere. […]