We are excited to share a significant achievement at PeerDB: we have achieved full compliance with the General Data Protection Regulation (GDPR). This milestone represents our unwavering dedication to data protection and privacy, further strengthenin...| PeerDB Blog
At PeerDB, we are building a fast and cost-effective way to replicate data from Postgres to Data Warehouses such as BigQuery, Snowflake and ClickHouse. When building PeerDB UI, we wanted it to be minimal but effective. Features were driven by what th...| PeerDB Blog
Learn 5 key considerations for building a scalable, cost-effective GCP data lake. Optimize performance, storage, and analytics on Google Cloud.| HatchWorks AI
A new dataset of ndt7 measurements is now available in BigQuery, offering access to measurements from a set of servers whose data had not been previously published.| M-Lab
IP Route Survey (IPRS) data published in M-Lab | www.measurementlab.net
How Kuda leveraged dbt incremental models to reduce costs, speed up pipelines, and scale confidently.| dbt Developer Hub Blog
Data analysis of millions of GitHub events to track developer activity and tech trends driving the evolution of open-source WebRTC| webrtcHacks
Many companies face the challenge of efficiently processing large datasets for analytics. Using an operational database for such purposes can lead to performance issues or, in extreme cases, system failures. This highlights the need to transfer data from operational databases to data warehouses. This approach allows heavy analytical queries without overburdening transactional systems and supports shorter retention periods in production databases.| blog.allegro.tech
In today’s digital landscape, data is king. Google Search Console (GSC) serves as a treasure trove of data, offering valuable insights into your website’s performance in the Google Search Engine Results Pages (SERPs). However, for in-depth analysis and uncovering hidden trends, integrating GSC data with BigQuery unlocks a whole new level of SEO (digital marketing) power. […] The post Analysing Google Search Console Data with BigQuery appeared first on Omi Sido.| Omi Sido
はじめに こんにちは!今回はANDPADの各種ログを分析するためのデータ基盤を担当しているエンジニアからデータ基盤の変遷について紹介させていただきます。ANDPADのデータ基盤に興味がある方はぜひ過去の記事も合わせてご覧ください。 tech.andpad.co.jp tech.andpad.co.jp 本記事では過去のデータ基盤が抱えていた課題と、チームがどうやってその課題を解決してきたか*1につ...| ANDPAD Tech Blog
Origin trials are a way for developers to get early access to experimental web platform features. They’re carefully controlled “beta tests” run by browsers to ensure that the feature works and is worth more time on implementation and standardization. Check out Getting started with origin trials to learn more. What’s interesting to me is seeing […] The post Origin trials and tribulations appeared first on rviscomi.dev.| rviscomi.dev
Using regular expressions to parse HTML in BigQuery is a nightmare. Instead, we can use Cheerio in SQL to extract insights about the web.| rviscomi.dev
Analytics Toolkit was conceived in 2012 as a set of tools that automate essential Google Analytics-related tasks and augment the GA functionalities in various ways. This goal was achieved in the years since with the release of over a dozen tools utilizing the Google Analytics API. These were accompanied by dozens of in-depth technical articles on the same topic posted on this very blog which gathered hundreds of thousands of views over time. The toolkit served hundreds of digital agencies and...| Blog for Web Analytics, Statistics and Data-Driven Internet Marketing | Analy...
Raise your hand 🙋♂️if you write SQL but are bad about saving your queries and wish they were source controlled in git. This was me for the last 10 years. I used a variety of tools to help but none got the job done. It’s actually amazing how many “rogue” SQL queries there are out …Source control your SQL queries with Git. Use a SQL Runner + IDE hybrid! Read More »| Hashpath
Striim Cloud for Application Integration: Stream real-time CRM, ERP, Billing, and Payment data from cloud apps to data warehouses in minutes with zero coding.| Striim
FiveThityEight recently released a dataset of what is believed to be ~3 million tweets associated with “Russian trolls”. These tweets are designed to spread misinformation (let’s not mince words: lies), and ultimately influence voters. If you haven’t read the linked article, I highly suggest you do that before continuing on. Exploring a ~700MB+ CSV file isn’t hugely practical (it’s since been sharded into < 100MB chunks), and so I’ve made the tweets available as a public dataset...| questionable services
The DemandSphere team has been hard at work this Summer and we are excited to announce a bunch of new features focused on holisitic visibility.| DemandSphere
Google's team clarified that their Search Console import to BigQuery does not support historical import. We have a solution for historical data.| DemandSphere
Wehe data is now available in BigQuery | www.measurementlab.net