PythonHub Logo Python Hub Weekly Digest for 2024-08-25

๐Ÿ’– Most Popular

DeltaDB
DeltaDB is a lightweight, fast, and scalable database built on polars and deltalake.

Some more batteries to do stuff with Mapping related data structures
This library provides utility functions for manipulating and transforming data structures which have or include Mapping-like characteristics.

Optimizing PyTorch Docker images: how to cut size by 60%

emval
emval is a blazingly fast Python email validator written in Rust.

From Boring Object-Oriented to INSANE Functional Code
This video demonstrates that there's a place for both object-oriented and functional code. In Python, these two approaches can be combined effectively, allowing you to leverage the strengths of each for the best results.


๐Ÿ“– Articles

Python Hub Weekly Digest for 2024-08-18

FlexAttention: The Flexibility of PyTorch with the Performance of FlashAttention

Crawling Pages with Infinite Scroll using Scrapy and Playwright
This post provides a detailed guide on how to scrape infinite scroll websites using Scrapy and Playwright in Python. It covers the setup process, explains how to implement a custom downloader middleware to handle JavaScript rendering, and demonstrates how to extract data from dynamically loaded content, offering a practical solution for web scraping challenges posed by modern web applica...

CSVs Are Kinda Bad. DSVs Are Kinda Good.
The article argues that CSVs (Comma-Separated Values) are problematic due to various edge cases involving delimiters, quotes, and newlines, and proposes using Delimiter-Separated Values (DSV) with ASCII control characters as a more robust alternative. It demonstrates how DSVs can handle complex data without escaping or quoting issues, but acknowledges that the lack of widespread tool sup...

Automating Boolean-Based SQL Injection with Python
How to be efficiently lazy at finding hidden gems in predictable places โ€“ Database Edition.

Cloudflare R2 x Django. Static Files. User uploads, css, images, js and more. Production-ready.
The video covers setting up and managing Django files, including static and user-uploaded files, using Cloudflare's R2 object storage. It emphasizes best practices for configuring environment variables, securing API keys, and managing static and media files in Django with advanced validation and customization options.

Django: create sub-commands within a management command

goauthentik / authentik
The authentication glue you need.


โš™๏ธ Projects

Deep-Live-Cam
Real time face swap and one-click video deepfake with only a single image.

LinkedIn_AIHawk
LinkedIn_AIHawk is a tool that automates the jobs application process on LinkedIn. Utilizing artificial intelligence, it enables users to apply for multiple job offers in an automated and personalized way.

llm_aided_ocr
Enhance Tesseract OCR output for scanned PDFs by applying Large Language Model (LLM) corrections.

pgai
Bring AI models closer to your PostgreSQL data.

Unstract
No-code LLM Platform to launch APIs and ETL Pipelines to structure unstructured documents.

AgentK
An autoagentic AGI that is self-evolving and modular.

EDSL
Design, conduct and analyze results of AI-powered surveys and experiments. Simulate social science and market research with large numbers of AI agents and LLMs.


๐Ÿ‘พ Reddits

What are data structures anyway?


← Previous Next →

Project by Ruslan Keba. Since 2012. Powered by Python. Made in ๐Ÿ‡บ๐Ÿ‡ฆUkraine.