craigreadcloud – Page 3 – Cloud, IS & Business Alignment

Data Platform Automation

2nd November 2024 craigreadcloud

The problems with Data Pipelines and the hydration of a Data Lake include: Data teams often end with technical debt surrounding…

Continue Reading →

Data Ops, a summary

25th October 2024 craigreadcloud

In essence Data Operations is based on DevSecOps or DevOps and applies these same ideas to the life cycle of…

Continue Reading →

DataLake and Challenges

21st October 2024 craigreadcloud

DataLake The entire concept of a Data Operations Platform rests on top of a Data Lake. There is no simple…

Continue Reading →

Data Operations + Agile

2nd October 2024 craigreadcloud

Data Operations ‘DataOps’ has been inspired by the Agile-premised ‘Development Operations’ model. The ‘DevOps’ model which usually includes security (DevSecOps),…

Continue Reading →

Apache Iceberg: an overview

27th September 2024 craigreadcloud

The icebergth is hereth. Apache Iceberg is an open-source table format for large-scale data systems, designed to provide efficient and…

Continue Reading →

Data Partitions

14th September 2024 craigreadcloud

Data files or tables are parsed into smaller units. This is also called ‘partitioning’. A partition is usually performed against…

Continue Reading →

Parquet file format for Data Lakes

5th September 2024 craigreadcloud

Parquet is a file format standard used in many enterprises. It allows the standardisation of files and provides a common…

Continue Reading →

Databricks and Snowflake: Summary

29th August 2024 craigreadcloud

Databricks and Snowflake overlap in many areas. Firms deploying both need to clearly demarcate the epics and use case journeys…

Continue Reading →

Automating S3 to Redshift with Glue

15th August 2024 craigreadcloud

A straightfoward method to automate data ingestion from S3 buckets (data lake) to a Redshift (data warehouse) cluster; by using…

Continue Reading →

Data Ingestion and AWS Data Lake

9th August 2024 craigreadcloud

Data Ingestion Challenges Data ingestion can be complicated. There are usually a variety of data sources, including both SQL and…

Continue Reading →