computer vision

Explore the New PixLab Vision Platform VLM API Endpoints

PixLab’s Vision VLM Platform introduces a groundbreaking set of Vision Language Model (VLM) endpoints that combine natural language processing and computer vision in a simple, developer-friendly API suite.

PixLab Vision Platform

From querying images and parsing complex documents to generating PDFs and extracting ID data, the PixLab VLM API makes it easy to integrate intelligent image and document analysis into your own apps.

Vision Language Model Endpoints

These endpoints allow your application to understand images and video frames with natural language intelligence.

/query – Ask natural language questions about images and receive contextual answers
/describe – Get a natural language description of an image
/tagimg – Retrieve tags describing the image content
/detect – Detect and localize objects in the image
/vocr – OCR via vision models for printed text
/nsfw – Detect explicit content in media

Unstructured Document Parsing

Parse unstructured documents like invoices, receipts, and contracts using VLM-powered tools.

/llm-parse – Extract data from complex document layouts using a user-defined schema

Embedding APIs

Turn your content into machine-understandable vectors for search, indexing, and matching.

/txt-embed – Generate semantic embeddings from raw text
/img-embed – Generate vector embeddings from images

These endpoints are perfect for building your own AI-powered similarity search or recommendation systems.

Image Processing & Background Removal

PixLab also provides classic computer vision capabilities enhanced by AI.

/bg-remove – Remove background or unwanted objects from images
/docscan – Scan and extract JSON data from over 11,000 supported ID document types from 200+ countries
/nsfw – Pixelate or block NSFW content automatically

PDF Generation & Conversion

Create and manipulate PDF documents programmatically using these SDK-free endpoints:

/pdfgen – Generate media-rich PDFs from HTML or Markdown
/pdftoimg – Convert PDF files into image previews

LLM Tool Calling Infrastructure

PixLab provides built-in tools for enhancing your LLM pipeline:

/llm-tool – Get a list of tools callable from your LLM
/tool-call – Execute a tool call based on LLM output

These endpoints enable your LLM agent to execute functions dynamically and return results within the same context.

System Tools & Metadata

Helpful utility endpoints for checking server health and supported formats:

/status – View current system status
/about – Get PixLab version & license info
/extension – Retrieve supported file extensions

Explore the Full API Suite

PixLab offers over 150 RESTful endpoints for vision, media, and document automation tasks. Visit the following links to dive deeper:

Final Thoughts

Whether you're working on an AI productivity suite, eKYC onboarding, or document automation pipeline, PixLab’s VLM API delivers powerful, production-ready tools in minutes. All endpoints are accessible via secure HTTP requests and require no proprietary SDKs.

Get started by signing up for an API key at PixLab Console and explore what's possible with Vision Language Models.

Build smarter apps — faster.

Introducing the New DOCSCAN API - Vision-Powered, SDK-Free, and Easier Than Ever

The PixLab Development Team is thrilled to announce the release of the next-generation DOCSCAN API, the core engine behind the newly rebranded PixLab ID Scan Platform.

Built from the ground up with Vision Language Models and hosted on the powerful PixLab Vision infrastructure, this update brings unmatched simplicity, security, and intelligence to identity document processing.

🌍 A Platform Re-imagined

Say goodbye to complex SDK integrations. The new DOCSCAN API requires no client-side SDKs, just a single HTTPS-enabled REST endpoint that supports both GET and POST requests. This means you can call DOCSCAN from any programming environment, whether you're using Python, Java, PHP, Go, or even a shell script.

New Home Page: pixlab.io/id-scan-api/
New Documentation: pixlab.io/id-scan-api/docscan

⚡ What's New in This Version?

✅ Powered by Vision Language Models

The new DOCSCAN API harnesses the full power of PixLab’s Vision Language Models to extract structured, high-quality data from scanned documents with increased accuracy and robustness.

✅ No SDK Required

Forget installing SDKs or maintaining device-specific libraries. DOCSCAN is pure REST—simple, fast, and universal.

✅ Single Endpoint Simplicity

Use a single, unified API endpoint for both document scanning and data extraction. No need to juggle multiple APIs or chained requests.

✅ Supports GET & POST

Whether you prefer URL-based GET requests or multipart POST uploads, DOCSCAN supports both with full flexibility.

✅ TLS 1.3 Secured

All API traffic is encrypted end-to-end using TLS 1.3, ensuring maximum security and compliance from the first byte.

🚀 Built for Developers

The updated documentation at pixlab.io/id-scan-api/docscan has been completely restructured to be developer-first. Clear examples, copy-ready code snippets, and real-world integration guides will help you get up and running in minutes.

🧩 Use Cases

ID verification in finance, healthcare, or government
Digital onboarding for apps and services
Automated customer registration flows
Global document scanning with consistent output formats

🛠 Start Building Today

Head to the PixLab Console to generate your API key and begin integrating the new DOCSCAN API in minutes.

Whether you're modernizing your on-boarding flow or automating ID verification at scale, the new DOCSCAN API offers unmatched speed, simplicity, and intelligence—without the SDK overhead.

🔗 Learn more: pixlab.io/id-scan-api/
📚 REST API Documentation: pixlab.io/id-scan-api/docscan

🌍 Universal Document Support

The DOCSCAN API provides robust support for a wide array of officially issued identification documents. This includes, but is not limited to:

Passports
ID Cards (Citizen ID, Resident ID, Immigration Card, etc.)
Driving Licenses
Visas
Birth & Death Certificates

The API covers documents from nearly all UN-recognized countries, offering unparalleled versatility. This release expands the API's capabilities to handle over 11,094 ID document variations originating from more than 197 countries. Below is a list of supported countries by DOCSCAN :

Afghanistan
Albania
Algeria
Andorra
Angola
Antigua and Barbuda
Argentina
Armenia
Australia
Austria
Azerbaijan
Bahamas
Bahrain
Bangladesh
Barbados
Belarus
Belgium
Belize
Benin
Bhutan
Bolivia
Bosnia and Herzegovina
Botswana
Brazil
Brunei
Bulgaria
Burkina Faso
Burundi
Cabo Verde
Cambodia
Cameroon
Canada
Central African Republic
Chad
Chile
China
Colombia
Comoros
Congo (Brazzaville)
Congo (Kinshasa)
Costa Rica
Cote d'Ivoire
Croatia
Cuba
Cyprus
Czechia
Denmark
Djibouti
Dominica
Dominican Republic
Ecuador
Egypt
El Salvador
Equatorial Guinea
Eritrea
Estonia
Eswatini
Ethiopia
Fiji
Finland
France
Gabon
Gambia
Georgia
Germany
Ghana
Greece
Grenada
Guatemala
Guinea
Guinea-Bissau
Guyana
Haiti
Honduras
Hungary
Iceland
India
Indonesia
Iran
Iraq
Ireland
Israel
Italy
Jamaica
Japan
Jordan
Kazakhstan
Kenya
Kiribati
Kuwait
Kyrgyzstan
Laos
Latvia
Lebanon
Lesotho
Liberia
Libya
Liechtenstein
Lithuania
Luxembourg
Madagascar
Malawi
Malaysia
Maldives
Mali
Malta
Marshall Islands
Mauritania
Mauritius
Mexico
Micronesia
Moldova
Monaco
Mongolia
Montenegro
Morocco
Mozambique
Myanmar
Namibia
Nauru
Nepal
Netherlands
New Zealand
Nicaragua
Niger
Nigeria
North Korea
North Macedonia
Norway
Oman
Pakistan
Palau
Panama
Papua New Guinea
Paraguay
Peru
Philippines
Poland
Portugal
Qatar
Romania
Russia
Rwanda
Saint Kitts and Nevis
Saint Lucia
Saint Vincent and the Grenadines
Samoa
San Marino
Sao Tome and Principe
Saudi Arabia
Senegal
Serbia
Seychelles
Sierra Leone
Singapore
Slovakia
Slovenia
Solomon Islands
Somalia
South Africa
South Korea
South Sudan
Spain
Sri Lanka
Sudan
Suriname
Sweden
Switzerland
Syria
Taiwan
Tajikistan
Tanzania
Thailand
Timor-Leste
Togo
Tonga
Trinidad and Tobago
Tunisia
Turkey
Turkmenistan
Tuvalu
Uganda
Ukraine
United Arab Emirates
United Kingdom
United States
Uruguay
Uzbekistan
Vanuatu
Vatican City
Venezuela
Vietnam
Yemen
Zambia
Zimbabwe

New Release: PixLab Bulk Background Removal App for Creators, Marketers & Developers

We’re excited to announce the launch of the PixLab Bulk Background Removal App — a powerful web-based tool built for creators, e-commerce teams, marketers, and developers who need to remove backgrounds from multiple images at once, quickly and effortlessly.

bg remove in action

Try it now at: https://bg-remove.pixlab.io

⚡ What Is It?

The PixLab Bulk Background Removal App is a fast and secure online utility that allows you to upload and process dozens of images in one go. Whether you're preparing product photos, social media content, or working on visual assets for apps, this tool saves hours of manual work by automatically removing image backgrounds with pixel-level accuracy.

🎯 Who Is It For?

This app is ideal for: - Content Creators: Prepare polished assets for thumbnails, posts, and visuals. - E-commerce Teams: Batch process product shots for online stores or catalogs. - Marketing Agencies: Generate clean marketing creatives quickly and consistently. - Developers & Engineers: Integrate background removal into custom workflows via API.

🧰 Key Features

Bulk Uploads: Process dozens of images at once — drag and drop your entire folder.
Fast & Secure: Optimized for performance with automatic deletion of files after 24 hours.
Pixel-Perfect Removal: Automatically detects and removes backgrounds with precision — no need for manual masking.
Multiple Format Support: Works with JPG, PNG, WEBP, and other popular image types.
Free to Start: Try it instantly, no signup required for basic usage.

🔗 Need Programmatic Integration?

For developers who want to integrate background removal into their applications, PixLab offers a fully documented REST API that supports: - Single image or batch processing - Custom output sizes and formats - Seamless integration with your existing codebase

📘 Explore the Background Removal API: pixlab.io/endpoints/background-remove-api

🖥 Try It Now

Use the app directly in your browser:
Launch Bulk Background Removal App →

Or learn more on the PixLab product page:
https://pixlab.io/bulk-image-background-removal-tool-apis

Whether you're editing a gallery of photos or automating your media pipeline, the new PixLab BG-Remove App brings high-performance image background removal to your fingertips — at scale.

— The PixLab Team

Unlocking the Power of DOCSCAN API for Developers: A Unified Solution for ID Scanning

In an era where digital identity verification is critical across industries, PixLab's DOCSCAN API stands out by providing an unparalleled ID scanning solution.

With support for over 11,000 types of identification documents from 197+ countries, DOCSCAN unifies ID Scan into a single, seamless REST API endpoint that is unmatched by other eKYC platforms. Let’s dive into the core capabilities of this powerful tool and why it’s a game-changer for developers.

DOCSCAN ID

Why DOCSCAN is a Must-Have for Developers

The DOCSCAN API Endpoint is more than just an ID scan API. Designed to be developer-friendly with RESTful API architecture, it simplifies the complex process of identifying, extracting, and validating personal data from IDs, passports, driver's licenses, visas, birth & death certificates and more.

Single Endpoint Access
With just one API endpoint, https://api.pixlab.io/docscan, developers can integrate ID scanning capabilities effortlessly into their applications. This endpoint unifies ID scanning, eliminating the need for multiple services or platforms to handle various document types.
Unmatched Document Coverage
DOCSCAN supports 11,097+ documents, including both national and international travel documents. Whether structured with Machine Readable Zones (MRZ) or not, the API efficiently processes them. No other eKYC platform on the market today provides this breadth of coverage.

Key Capabilities of DOCSCAN API Endpoints

Global ID Support
DOCSCAN is designed to handle IDs from nearly every country in the world, making it ideal for global businesses. From passports and residence permits to birth certificates, the API ensures that organizations can onboard users regardless of their location.
RESTful API Architecture
The API’s REST architecture ensures easy integration across platforms and programming languages, including Python, Java, PHP, and JavaScript. With a well-documented REST endpoint, developers can quickly set up the solution without extensive learning curves.
Intelligent Image Handling
DOCSCAN integrates intelligent image correction, automatically adjusting for skew, distortion, or layout variations to enhance scanning accuracy. This ensures that even low-quality images yield accurate extraction of data.
Built-in Face Detection
For applications that require additional security, DOCSCAN offers face detection and extraction.
Privacy-First Processing
PixLab processes all data in-memory, ensuring no user data is stored on servers. This privacy-first design aligns with regulatory compliance, including GDPR, making it a trustworthy choice for sensitive applications.

Streamlined Integration with Code Samples

DOCSCAN offers ready-to-use code samples that are accessible here making integration a breeze for developers of all skill levels. Whether you’re working on a financial service, e-commerce, healthcare, or travel platform, the code examples available in multiple languages help you quickly adopt DOCSCAN into your existing project.

Conclusion

PixLab’s DOCSCAN API Endpoint sets a new standard for identity verification, offering a unified, powerful, and developer-friendly platform. With comprehensive global document coverage and advanced features like face detection, it helps businesses scale their eKYC operations effortlessly.

Whether you're building solutions for financial services, healthcare, travel, or e-commerce, the DOCSCAN API endpoint offers scalable, privacy-first, and seamless ID scanning capabilities. Start leveraging DOCSCAN today to simplify identity verification across your digital platforms.

Explore Further:

PixLab’s Document Scanner now able to scan Driving License issued by any U.S. state

The PixLab Optical Character Recognition team is thrilled to announce that its document scanning API endpoint /DOCSCAN, is now able to scan U.S. Driver’s licenses and driving permits issued by jurisdictions from all the 50 U.S. states.

DOCSCAN API endpoint now supports scanning US driver’s license from all 5O states

The /DOCSCAN API endpoint now allows any Website that is presented with a U.S Driver’s License, International Passport or ID Card to verify that the inputted information by the end user matches those present on the submitted or uploaded ID document image.

Usage & Code Samples

Given an input U.S driver’s license image issued by any of the 50 U.S. states, crop the license holder face, and extract fields of interest as follow:

Input U.S driver’s license image Car Vectors by Vecteezy

Extracted Fields Showcase extracted fields from the submitted driver's image

The extracted fields after successful call to the /DOCSCAN API endpoint are:

License holder cropped face. This image will be stored on an AWS S3 bucket of your choice if you connect your target bucket from the PixLab Console.
Issuing Country (USA obviously).
Issuing State Name.
Issuing State Two-Letter Code.
License Number.
License Holder’s Full Name.
License Holder’s Address.
License Holder’s Date of Birth (yyyy-mm-dd).
License Issuing Date (yyyy-mm-dd).
License Expiry Date (yyyy-mm-dd).
License Holder’s Gender.

The code samples that is used to achieve such results are available via the following Gists:

Python Gist. Also available on the PixLab Gitub Repository:
PHP Gist. Also available on the PixLab Gitub Repository:

Algorithms Under the hood

Face extraction is automatically performed using the /FACEDETECT API endpoint.
/DOCSCAN already supports GET & POST HTTP methods so you can upload your document images directly from your application without relying on a foreign server. Refer to this Gist on how to do so.
Upon the image processed on our server, it is automatically deleted. We do not keep trace or any log of your input images.
Internally, we mainly rely on PP-OCR which is a practical ultra-lightweight OCR system that is mainly composed of three parts: Text Detection, Bounding Box Isolation, & Text Recognition. This combination produces highly accurate results in less than 5 seconds of processing.

Full Scan Support for United Arab Emirates (UAE) ID/Residence Cards

The PixLab Document Scanner, development team is pleased to announce that is now fully support scanning Emirates (UAE) ID & Residence Cards via the /DOCSCAN API endpoint at real-time using your favorite programming language.

When invoked, the /DOCSCAN HTTP API endpoint shall Extract (crop) any detected face and transform the raw UAE ID/Residence Card content such as holder name, nationality, ID number, etc. into a JSON object ready to be consumed by your app.

Below, a typical output result of the /DOCSCAN API endpoint for an Emiratis (UAE) ID card input sample:

Input Emirates (UAE) ID Card

UAE ID card specimen

Extracted UAE ID Card Fields

UAE extracted fields

The code samples used to achieve such result are available to consult via the following gists:

Python Code Samples for Scanning UAE ID Card: uae_emirates_id_card_scan.py
PHP Code Samples for Scanning UAE ID Card: uae_emirates_id_card_scan.php
PixLab Github Repository: github.com/symisc/pixlab

The same logic applies to scanning official travel documents like Visas, Passports, and ID Cards from many others countries in an unified manner, regardless of the underlying programming language used on your backend (Python, PHP, Ruby, JS, etc.) thanks to the DOCSCAN API endpoint as shown in previous blog posts:

Passports & Travel Document Scan: Blog Announcement & Code Sample.
Malaysia & Singapore ID Card Scan: Blog Announcement & Code Sample.
Aadhar India ID Card Scan: Blog Announcement & Code Sample.

Algorithm Details

Internally, PixLab's document scanner engine is based on PP-OCR which is a practical ultra-lightweight OCR system, mainly composed of three parts: DB text detection, detection frame correction, and CRNN text recognition. DB stands for Real-time Scene Text Detection.

PP-OCR: A Practical Ultra Lightweight OCR System - Algorithm Overview

PP-OCR Algorithm Overview

The system adopts 19 effective strategies from 8 aspects including backbone network selection and adjustment, prediction head design, data augmentation, learning rate transformation strategy, regularization parameter selection, pre-training model use, and automatic model tailoring and quantization to optimize and slim down the models of each module.

In PP-OCR, Differentiable Binarization (DB) is used as text detector which is based on a simple segmentation network. It integrates feature extraction and sequence modeling. It adopts the Connectionist Temporal Classification (CTC) loss to avoid the inconsistency between prediction and label.

The algorithm is further optimized in five aspect where the detection model adopts the CML (Collaborative Mutual Learning) knowledge distillation strategy and CopyPaste data expansion strategy. The recognition model adopts the LCNet lightweight backbone network, U-DML knowledge distillation strategy and enhanced CTC loss function improvement, which further improves the inference speed and prediction effect.

New Gender/Age Classification Model Deployed

Here at PixLab, we recently deployed on production, a brand new gender/age classification model available to our customers via the FACEMOTION API endpoint.

gender age detection

The new model implementation is based on the ResNet-50 convolutional neural network (CNN) that is 50 layers deep. The network can easily classify images into 1000 object categories, such as keyboard, mouse, pencil, and many animals.
The reference, implementation paper is from: Jiankang Deng, Jia Guo, Niannan Xue, Stefanos Zafeiriou: Additive angular margin loss for deep face recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2019 (https://arxiv.org/abs/1801.07698).
The Python/PHP code samples listed below should be able to easily output the age estimation, gender, and emotion pattern by just looking at the facial shape of any present human face in a given picture or video frame using our new classifcation model.

Python Code

The PHP usage sample can be viewed here or on the PixLab Github repository.

FACEMOTION is the sole endpoint needed to perform such a task. It should output the rectangle coordinates for each detected human face that you can pass verbatim if desired to other processing endpoints like CROP or MOGRIFY plus the age estimation, gender and emotion pattern of the target face based on its facial shape.
Finally, all of our production ready, code samples are available to consult at our samples page or the PixLab Gihtub repository.

Introducing the Pixel Generate API Endpoint

PixLab Logo

The PixLab Computer Vision Team is pleased to introduce the Pixel Generate API endpoint (/pixelgenerate) which let you in a single call, generate on the fly, images filled with random pixels of desired width & height using a mix of standard Image Processing and soon Machine Learning algorithms.

This endpoint is similar to /newimage except that the image contents is filled with random pixels. This is very useful for generating background (negative) samples for feeding Machine Learning training algorithms for example.

By default, this endpoint return a JSON object holding a link to the generated image output. But, you can set it via the Blob parameters to return the image binary contents instead.

Below, a Python snippet which generate on the fly a new image of height & width of 300x300 filled with random pixels using a single call to /pixelgenerate:

The code sample used to achieve such result is available to consult via the following Github link:

Python code for generating random pixels: pixel_generate.py.
The endpoint documentation is available to consult at pixlab.io/cmd?id=pixelgenerate.
Others code samples including Passports & ID Scanning ,Face Blurring, etc. and for a general introduction to the PixLab API, refer to our Github Repository and Examples Page.

Modern Passport Structure & Bulk Scan APIs

A Passport is a document that almost everyone has at some point in their lives. It is issued by the country’s government to its citizens and mainly being used for traveling purposes. It also serves as proof of nationality, name, and more importantly an Universally Unique ID for its owner.

Modern Passport Structure

Passport Specimen

Many services have been long-time accepting passports as identification documents from their customers to complete their KYC (Know Your Customer) form as required by the legislation in force. This is especially true and enforced for the Finance, HR or Travel sectors. In most cases, a human operator will verify the authenticity of the submitted document and grant validation or reject it.

Things can get really complicated if you have hundreds of KYC forms to checks, but also if your clients differ in nationality. Quickly, you will find yourself drowning in physical copies of passports in different languages that you can not even understand. Let alone the potential legal problems you can face with passport copies laying around the office. This is why, an automated & safe solution for Passports processing is required!

Modern Passport Structure

From the 1980s on wards, most countries started issuing passports containing an MRZ. MRZ stands for the Machine Readable Zone and is usually located at the bottom of the Passport as shown below:

Modern Passport Specimen

Passports MRZ Sample

Passports that contain an MRZ are referred to as MRPs, machine-readable passports (Almost all modern issued Passports have one). The structure of the MRZ is standardized by the ICAO Document 9303 and the International Electro-technical Commission as ISO/IEC 7501-1.

The MRZ is an area on the document that can easily be read by a machine using an OCR Reader Application or API. It’s not important for you to understand how it works, but if you look at it carefully, you will see that it contains most of the relevant information on the document, combined with additional characters and a checksum that can be extracted programmatically and automatically via API as we will see in the next section.

Once parsed, the following information are automatically extracted from the target MRZ and made immediately available to your app, thanks to the /docscan API endpoint:

issuingCountry: The issuing country or organization, encoded in three characters.
fullName: Passport holder full name. The name is entirely upper case.
documentNumber: This is the passport number, as assigned by the issuing country. Each country is free to assign numbers using any system it likes.
checkDigit: Check digits are calculated based on the previous field. Thus, the first check digit is based on the passport number, the next is based on the date of birth, the next on the expiration date, and the next on the personal number. The check digit is calculated using this algorithm.
nationality: The issuing country or organization, encoded in three characters.
dateOfBirth: The date of the passport holder's birth in YYMMDD form. Year is truncated to the least significant two digits. Single digit months or days are perpended with 0.
sex: Sex of the passport holder, M for males, F for females, and < for non-specified.
dateOfExpiry: The date the passport expires in YYMMDD form. Year is truncated to the least significant two digits. Single digit months or days are perpended with 0.
personalNumber: This field is optional and can be used for any purpose that the issuing country desires.
finalcheckDigit: This is a check digit for positions 1 to 10, 14 to 20, and 22 to 43 on the second line of the MRZ. Thus, the nationality and sex are not included in the check. The check digit is calculated using this algorithm.

Automatic Passport Processing

PixLab Logo

Fortunately for the developer wishing to automate Passports scanning, PixLab can automatically scan & extract passport MRZ but also help to detect possible fraudulent documents. This is made possible thanks to the /docscan API endpoint which let you in a single call scan government issued documents such as Passports, Visas or ID Cards from various countries.

Besides extracting MRZ, the /docscan API endpoint shall automatically crop any detected face and transform binary Machine Readable Zone into stream of text content (i.e. full name, issuing country, document number, date of expiry, etc.) ready to be consumed by your app in the JSON format.

Below, a typical output result of the /docscan endpoint for a passport input image:

Input Passport Specimen (JPEG/PNG/BMP Image)

Input Image URL

Extracted MRZ Fields

MRZ Fields

What follow is the gist used to achieve such result:

Other document scanning code samples are available to consult via the following Github links:

Python code for scanning Passports: passport_scan.py.
PHP code for scanning Passports: passport_scan.php.

Face extraction is automatically performed using the /facedetect API endpoint. For a general purpose Optical Character Recognition engine, you should rely on the /OCR API endpoint instead. If you are dealing with PDF documents, you can convert them at first to raw images via the /pdftoimg endpoint.

Conclusion

The era we are in is more digitized than ever. Tasks that are repetitive are slowly being replaced by computers and robots. In many cases, they can perform these tasks faster, with a smaller amount of mistakes and in a more cost-effective manner. At PixLab we focus on building software to replace manual repetitive labor in administrative business processes. The processing and checking of passports can be very time-consuming. Using /docscan to automate your passport processing will enable you to save cost, on-board customers faster and reduce errors in administrative processes.

Detect & Blur Faces Programmatically using PixLab

Our colleague Vincent just published an interesting blog post on how to automatically detect and blur faces at real-time using the PixLab API with a nice introduction on how modern face detection algorithms works under the hood and the privacy concerns related to such use of technology!

Article Link: dev.to/unqlite_db/detect-blur-faces-programmatically-2cbp.
DZone Article: dzone.com/articles/detect-and-blur-faces-programmatically.

Blurred Faces

facedetect API Endpoint Documentation: pixlab.io/cmd?id=facedetect.
morgify API Endpoint Documentation: pixlab.io/cmd?id=mogrify.
Python Code Gist.
PHP Code Gist.
Github Repository: github.com/symisc/pixlab.

Vision Language Model Endpoints

Unstructured Document Parsing

Embedding APIs

Image Processing & Background Removal

PDF Generation & Conversion

LLM Tool Calling Infrastructure

System Tools & Metadata

Explore the Full API Suite

Final Thoughts

🌍 A Platform Re-imagined

⚡ What's New in This Version?

✅ Powered by Vision Language Models

✅ No SDK Required

✅ Single Endpoint Simplicity

✅ Supports GET & POST

✅ TLS 1.3 Secured

🚀 Built for Developers

🧩 Use Cases

🛠 Start Building Today

🌍 Universal Document Support

⚡ What Is It?

🎯 Who Is It For?

🧰 Key Features

🔗 Need Programmatic Integration?

🖥 Try It Now

Why DOCSCAN is a Must-Have for Developers

Key Capabilities of DOCSCAN API Endpoints

Streamlined Integration with Code Samples

Conclusion

Explore Further:

Usage & Code Samples

Algorithms Under the hood

Further Reading

Algorithm Details

Python Code

Modern Passport Structure

Automatic Passport Processing

Conclusion