Blog | Perardua Consulting

Apache Iceberg

https://medium.com/data-engineer-things/apache-iceberg-the-hadoop-of-the-modern-data-stack-c83f63a4ebb9

há 3 dias1 min de leitura

Melhores práticas do Apache Spark: otimizando o processamento de dados

O Apache Spark é um sistema de computação distribuído, poderoso e de código aberto que pode processar big data. É conhecido por sua velocidade e facilidade de uso, o que o torna popular entre engenheiros de software e cientistas de dados.

Claude Paugh

há 4 dias3 min de leitura

Coleta de dados estatísticos com PySpark: uma análise comparativa com Scala

Data processing and statistics gathering are essential tasks in today's data-driven world. Engineers frequently find themselves choosing between tools like PySpark and Scala when embarking on these tasks.

Claude Paugh

há 4 dias4 min de leitura

Benefícios da engenharia de dados e seu impacto nos custos empresariais

Data architecture refers to the design and organization of data structures and systems within an organization. It defines how data is collected, stored, and used, serving as a blueprint for managing data assets.

Claude Paugh

há 4 dias5 min de leitura

Dados de ETFs, fundos mútuos e ações: acesso a conteúdo analítico

The analytics console looks very much like the query console with the exception of the panels on the right. This is where you can map data structures from the local or remote Couchbase collections as sources. The analytics service makes a copy of the original data, and provides the ability to index it separately from the original source.

Claude Paugh

há 4 dias2 min de leitura

Dados de ETF, fundos mútuos e acionistas: recuperar conteúdo

If you're a software engineer, there are various SDK's and connectors available. On the other hand if you just want to look at document content, either the built-in "Query" section on the Couchbase console, or a third-party tool that has a driver to connect.

Claude Paugh

há 4 dias2 min de leitura

Spark Data Engineering: Melhores Práticas e Casos de Uso

In today's data-driven world, organizations are generating vast amounts of data every second. This data can be a goldmine for insights when processed and analyzed effectively. One of the most powerful tools in this realm is Apache Spark.

Claude Paugh

há 4 dias5 min de leitura

ETFs, fundos mútuos e análise de dados de ativos: introdução

Several years ago, I started a side project that I thought would be fun: collecting and loading SEC filings for ETF and Mutual Fund Holdings on a monthly basis. I wanted to essentially automate the collection of the SEC filings

Claude Paugh

há 4 dias5 min de leitura

Apache Iceberg

Melhores práticas do Apache Spark: otimizando o processamento de dados

Coleta de dados estatísticos com PySpark: uma análise comparativa com Scala

Benefícios da engenharia de dados e seu impacto nos custos empresariais

Dados de ETFs, fundos mútuos e ações: acesso a conteúdo analítico

Dados de ETF, fundos mútuos e acionistas: recuperar conteúdo

Spark Data Engineering: Melhores Práticas e Casos de Uso

ETFs, fundos mútuos e análise de dados de ativos: introdução

Privacy Policy