PixLab Document Scanner Scores Number 1 as Malaysia's Premier KYC ID Verification (MyKAD) Provider

Kuala Lumpur, Malaysia - In a significant industry recognition, PixLab's Document Scanner has been awarded the title of the top KYC ID Verification provider for MyKAD in Malaysia. This honor underscores the company's commitment to excellence and its continuous drive to provide the best identity verification solutions in the market.

PixLab Document Scanner

PixLab's Document Scanner has set itself apart with its cutting-edge machine learning technology, user-friendly interface, and stringent data security measures. Businesses across Malaysia have lauded the product for its accuracy and efficiency, making it the go-to solution for KYC ID verification.

Why PixLab Stands Out

PixLab's Document Scanner has been recognized as the number one KYC ID Verification provider for MyKAD in Malaysia. This accolade is not just a testament to the product's superior technology but also its commitment to ensuring a seamless user experience.

  • Advanced Technology: Leveraging state-of-the-art machine learning algorithms, PixLab offers unparalleled accuracy in scanning and verifying MyKAD documents. This ensures that businesses can trust the authenticity of the ID being presented, reducing the risk of fraud.
  • User-Centric Design: Understanding the importance of a smooth user journey, PixLab has designed its scanner to be intuitive. This means quicker onboarding for customers and less friction in the verification process.
  • Data Security: In today's world, data breaches are a growing concern. PixLab prioritizes user data security, ensuring that all scanned information is encrypted and stored securely.

Related Articles

To learn more about PixLab's DOCSCAN API endpoint and its comprehensive features, please refer to the following articles & code samples:

  1. Passports, Travel Documents & ID Cards Scan API Endpoint Available.
  2. Full Scan Support for Malaysia and Singapore ID Cards.
  3. Scanning Malaysian ID Cards with PixLab API.
  4. Memindai Kad Pengenalan Malaysia dengan API PixLab: Panduan Lengkap.

Code Samples

Press Release

"We are immensely proud of this recognition," said Mrad Chams, CTO of PixLab. "It reaffirms our dedication to providing the best solutions to our users. We understand the critical role identity verification plays in today's digital landscape, and PixLab strive to offer a product that is both reliable and easy to use".

As PixLab continues to innovate, the industry and its users can expect even more advanced features and enhanced user experience in the future. With its eyes set on global expansion, PixLab is on a trajectory to redefine identity verification standards worldwide.

For more information about PixLab and its award-winning Document Scanner, please visit PixLab's website.

Press Release - PixLab Introduces Groundbreaking Document Scanning API for KYC and ID Verification

FOR IMMEDIATE RELEASE

PixLab Symisc

PixLab, a leading provider of advanced image and video analysis solutions, is thrilled to unveil its newly redesigned and highly innovative Machine Learning-based document scanner engine, developed in-house.

This groundbreaking technology is specifically designed for Know Your Customer (KYC) and ID verification tasks, offering customer onboarding solutions that go beyond standard KYC and Anti-Money Laundering (AML) checks. The PixLab DOCSCAN API Endpoint empowers organizations to boost conversions, reduce fraud, and maintain global compliance effortlessly.

The PixLab DOCSCAN API Endpoint revolutionizes the way government-issued documents, including Passports, Visas, U.S Driver License, and ID cards from various countries including but not limited to Malaysia, Singapore, India, and Emirates, are scanned and verified. With a single call, organizations can effortlessly scan and extract critical information from these documents. The API endpoint also features automatic face extraction, enhancing the accuracy and completeness of the scanning process.

Below, the DOCSCAN API endpoint output for a typical Input Passport Image:

Input Passport Specimen (JPEG/PNG/BMP Image or PDF Upload) Input Passport Specimen

And, the extracted Passport (MRZ) Fields Extract Output

One of the key highlights of the DOCSCAN API endpoint is its ability to transform binary data, such as Passport Machine Readable Zone (MRZ), into a stream of textual content in JSON format. This includes extracting crucial details such as the Full Name, Issuing Country, Document Number, Date of Birth & Expiry, etc.. This seamless integration of the extracted information into your application allows for streamlined and efficient processes, reducing manual effort and eliminating errors.

PixLab takes document scanning and verification to the next level by offering additional features that help identify possible fraudulent documents. The DOCSCAN API endpoint's automated face scanning capabilities, combined with its MRZ extraction functionality, enable developers to automate passport scanning while maintaining stringent security standards. This empowers organizations to protect against fraudulent activities and maintain the integrity of their processes.

Input ID Card Specimen from Malaysia (MyKAD) (JPEG/PNG/BMP Image or PDF Upload) MyKAD Specimen

And, the extracted MyKAD Fields including Face Extraction, Date Of Birth, Full Name, Address, Religion, and ID Number

Extracted MyKAD Fields

In today's increasingly digitized era, the need for automation and efficiency is paramount. Manual and repetitive administrative tasks can be time-consuming, error-prone, and costly. By leveraging the power of the PixLab DOCSCAN API endpoint, organizations can automate passport processing, resulting in substantial cost savings, accelerated customer on-boarding, and enhanced accuracy in administrative processes.

Finally, to learn more about PixLab's DOCSCAN API endpoint and its comprehensive features, please refer to the previous blog posts & code samples:

  1. Passports, Travel Documents & ID Cards Scan API Endpoint Available
  2. Full Scan Support for Malaysia and Singapore ID Cards
  3. Full Scan Support for United Arab Emirates (UAE) ID/Residence Cards
  4. PixLab’s Document Scanner now able to scan Driving License issued from any U.S. state
  5. Implement a Minimalistic KYC Form & Identify Verification Check
  6. Step-by-step guide to do e-KYC in your app
  7. Streamlining KYC with PixLab Document Scanner API

Code Samples

  1. Scan a government issued Passport document using the PixLab API. Extract the user's face and display all MRZ fields (PHP Code)
  2. Scan a government issued Passport document using the PixLab API. Extract the user's face and display all MRZ fields (Python Code)
  3. Scan Malaysia ID Card (MyKAD) (PHP Code)
  4. Scan government issued ID card from Malaysia (MyKAD, MyKID)., extract the user face and parse all fields (Python Code)

About PixLab

PixLab is a leading provider of advanced image and video analysis solutions, leveraging state-of-the-art technologies such as Machine Learning and Artificial Intelligence. With a commitment to innovation, PixLab empowers organizations across industries to automate and streamline their image and video processing workflows. The company's robust APIs and developer-friendly tools enable businesses to extract valuable insights, perform accurate face recognition, analyze emotions, detect objects, and much more.

PixLab’s Document Scanner now able to scan Driving License issued by any U.S. state

The PixLab Optical Character Recognition team is thrilled to announce that its document scanning API endpoint /DOCSCAN, is now able to scan U.S. Driver’s licenses and driving permits issued by jurisdictions from all the 50 U.S. states.

DOCSCAN API endpoint now supports scanning US driver’s license from all 5O states

The /DOCSCAN API endpoint now allows any Website that is presented with a U.S Driver’s License, International Passport or ID Card to verify that the inputted information by the end user matches those present on the submitted or uploaded ID document image.

Usage & Code Samples

Given an input U.S driver’s license image issued by any of the 50 U.S. states, crop the license holder face, and extract fields of interest as follow:

Input U.S driver’s license image Car Vectors by Vecteezy Input U.S driver’s license image

Extracted Fields Showcase extracted fields from the submitted driver's image

The extracted fields after successful call to the /DOCSCAN API endpoint are:

  • License holder cropped face. This image will be stored on an AWS S3 bucket of your choice if you connect your target bucket from the PixLab Console.
  • Issuing Country (USA obviously).
  • Issuing State Name.
  • Issuing State Two-Letter Code.
  • License Number.
  • License Holder’s Full Name.
  • License Holder’s Address.
  • License Holder’s Date of Birth (yyyy-mm-dd).
  • License Issuing Date (yyyy-mm-dd).
  • License Expiry Date (yyyy-mm-dd).
  • License Holder’s Gender.

The code samples that is used to achieve such results are available via the following Gists:

Algorithms Under the hood

  • Face extraction is automatically performed using the /FACEDETECT API endpoint.
  • /DOCSCAN already supports GET & POST HTTP methods so you can upload your document images directly from your application without relying on a foreign server. Refer to this Gist on how to do so.
  • Upon the image processed on our server, it is automatically deleted. We do not keep trace or any log of your input images.
  • Internally, we mainly rely on PP-OCR which is a practical ultra-lightweight OCR system that is mainly composed of three parts: Text Detection, Bounding Box Isolation, & Text Recognition. This combination produces highly accurate results in less than 5 seconds of processing.

Further Reading

  • The official endpoint documentation is available to consult here, and the reader is more than encouraged to take a look at our production ready, code samples, available in various programming languages on the PixLab Code Samples Page.

Full Scan Support for United Arab Emirates (UAE) ID/Residence Cards

The PixLab Document Scanner, development team is pleased to announce that is now fully support scanning Emirates (UAE) ID & Residence Cards via the /DOCSCAN API endpoint at real-time using your favorite programming language.

When invoked, the /DOCSCAN HTTP API endpoint shall Extract (crop) any detected face and transform the raw UAE ID/Residence Card content such as holder name, nationality, ID number, etc. into a JSON object ready to be consumed by your app.

Below, a typical output result of the /DOCSCAN API endpoint for an Emiratis (UAE) ID card input sample:

Input Emirates (UAE) ID Card

UAE ID card specimen

Extracted UAE ID Card Fields

UAE extracted fields

The code samples used to achieve such result are available to consult via the following gists:

The same logic applies to scanning official travel documents like Visas, Passports, and ID Cards from many others countries in an unified manner, regardless of the underlying programming language used on your backend (Python, PHP, Ruby, JS, etc.) thanks to the DOCSCAN API endpoint as shown in previous blog posts:

Algorithm Details

Internally, PixLab's document scanner engine is based on PP-OCR which is a practical ultra-lightweight OCR system, mainly composed of three parts: DB text detection, detection frame correction, and CRNN text recognition. DB stands for Real-time Scene Text Detection.

PP-OCR: A Practical Ultra Lightweight OCR System - Algorithm Overview

PP-OCR Algorithm Overview

The system adopts 19 effective strategies from 8 aspects including backbone network selection and adjustment, prediction head design, data augmentation, learning rate transformation strategy, regularization parameter selection, pre-training model use, and automatic model tailoring and quantization to optimize and slim down the models of each module.

In PP-OCR, Differentiable Binarization (DB) is used as text detector which is based on a simple segmentation network. It integrates feature extraction and sequence modeling. It adopts the Connectionist Temporal Classification (CTC) loss to avoid the inconsistency between prediction and label.

The algorithm is further optimized in five aspect where the detection model adopts the CML (Collaborative Mutual Learning) knowledge distillation strategy and CopyPaste data expansion strategy. The recognition model adopts the LCNet lightweight backbone network, U-DML knowledge distillation strategy and enhanced CTC loss function improvement, which further improves the inference speed and prediction effect.

Modern Passport Structure & Bulk Scan APIs

A Passport is a document that almost everyone has at some point in their lives. It is issued by the country’s government to its citizens and mainly being used for traveling purposes. It also serves as proof of nationality, name, and more importantly an Universally Unique ID for its owner.

Modern Passport Structure

Passport Specimen

Many services have been long-time accepting passports as identification documents from their customers to complete their KYC (Know Your Customer) form as required by the legislation in force. This is especially true and enforced for the Finance, HR or Travel sectors. In most cases, a human operator will verify the authenticity of the submitted document and grant validation or reject it.

Things can get really complicated if you have hundreds of KYC forms to checks, but also if your clients differ in nationality. Quickly, you will find yourself drowning in physical copies of passports in different languages that you can not even understand. Let alone the potential legal problems you can face with passport copies laying around the office. This is why, an automated & safe solution for Passports processing is required!

Modern Passport Structure

From the 1980s on wards, most countries started issuing passports containing an MRZ. MRZ stands for the Machine Readable Zone and is usually located at the bottom of the Passport as shown below:

Modern Passport Specimen

Passports MRZ Sample

Passports that contain an MRZ are referred to as MRPs, machine-readable passports (Almost all modern issued Passports have one). The structure of the MRZ is standardized by the ICAO Document 9303 and the International Electro-technical Commission as ISO/IEC 7501-1.

The MRZ is an area on the document that can easily be read by a machine using an OCR Reader Application or API. It’s not important for you to understand how it works, but if you look at it carefully, you will see that it contains most of the relevant information on the document, combined with additional characters and a checksum that can be extracted programmatically and automatically via API as we will see in the next section.

Once parsed, the following information are automatically extracted from the target MRZ and made immediately available to your app, thanks to the /docscan API endpoint:

  • issuingCountry: The issuing country or organization, encoded in three characters.
  • fullName: Passport holder full name. The name is entirely upper case.
  • documentNumber: This is the passport number, as assigned by the issuing country. Each country is free to assign numbers using any system it likes.
  • checkDigit: Check digits are calculated based on the previous field. Thus, the first check digit is based on the passport number, the next is based on the date of birth, the next on the expiration date, and the next on the personal number. The check digit is calculated using this algorithm.
  • nationality: The issuing country or organization, encoded in three characters.
  • dateOfBirth: The date of the passport holder's birth in YYMMDD form. Year is truncated to the least significant two digits. Single digit months or days are perpended with 0.
  • sex: Sex of the passport holder, M for males, F for females, and < for non-specified.
  • dateOfExpiry: The date the passport expires in YYMMDD form. Year is truncated to the least significant two digits. Single digit months or days are perpended with 0.
  • personalNumber: This field is optional and can be used for any purpose that the issuing country desires.
  • finalcheckDigit: This is a check digit for positions 1 to 10, 14 to 20, and 22 to 43 on the second line of the MRZ. Thus, the nationality and sex are not included in the check. The check digit is calculated using this algorithm.

Automatic Passport Processing

PixLab Logo

Fortunately for the developer wishing to automate Passports scanning, PixLab can automatically scan & extract passport MRZ but also help to detect possible fraudulent documents. This is made possible thanks to the /docscan API endpoint which let you in a single call scan government issued documents such as Passports, Visas or ID Cards from various countries.

Besides extracting MRZ, the /docscan API endpoint shall automatically crop any detected face and transform binary Machine Readable Zone into stream of text content (i.e. full name, issuing country, document number, date of expiry, etc.) ready to be consumed by your app in the JSON format.

Below, a typical output result of the /docscan endpoint for a passport input image:

Input Passport Specimen (JPEG/PNG/BMP Image)

Input Image URL

Extracted MRZ Fields

MRZ Fields

What follow is the gist used to achieve such result:

Other document scanning code samples are available to consult via the following Github links:

Face extraction is automatically performed using the /facedetect API endpoint. For a general purpose Optical Character Recognition engine, you should rely on the /OCR API endpoint instead. If you are dealing with PDF documents, you can convert them at first to raw images via the /pdftoimg endpoint.

Conclusion

The era we are in is more digitized than ever. Tasks that are repetitive are slowly being replaced by computers and robots. In many cases, they can perform these tasks faster, with a smaller amount of mistakes and in a more cost-effective manner. At PixLab we focus on building software to replace manual repetitive labor in administrative business processes. The processing and checking of passports can be very time-consuming. Using /docscan to automate your passport processing will enable you to save cost, on-board customers faster and reduce errors in administrative processes.

PixLab API 1.9.72 Released!

The PixLab development team is pleased to announce the immediate availability of the PixLab API 1.9.72.

PixLab Logo

Since its launch on 2017, PixLab have already processed over 450 Millions of users contents whether static images, GIF or Videos Frames. This milestone release introduces new API endpoints, various minor bug fixes, processing speed improvements by up to 5% and many innovative features. Let's start with the one we are existed about in no particular order:

  • Passports & ID Cards Scan: While documents scanning were introduced in earlier version of the PixLab API via the /docscan endpoint. This release pushes further the accuracy of the OCR engine. A 5MB raw Passport sample now takes less than 3 seconds to execute including face detection & extraction, MRZ (Machine Readable Zone) extraction and finally transformation of the Raw MRZ data into textual content. You can try out the accuracy of the Passport scanning engine using these Python and PHP scripts to see it in action.
  • DNS infrastructure moved to Cloudflare for faster than ever response times.
  • Full support for HTTP/2 and HTTP/3 (QUIC).
  • Up to three layers of redundancy for the standard PixLab OCR engine for faster, accurate & guaranteed scan results.
  • A fresh update of the adult & gore content detection ML model which is used to power the famous PixLab /NSFW API endpoint that have already analyzed over 100 millions of user contents with high accuracy.
  • Face Detection (including facial landmarks extraction) & Emotion Pattern (including gender & age) extraction are now using the RetinaFace Model which scores the highest on the LFW dataset.
  • The /docscan API endpoint now fully support scanning ID cards from Malaysia & Singapore and many other countries (at users request) as well the brand new India Aadhar ID card. Find out more information about Aadhard fully support via our blog announcement here.
  • Finally, a brand new, high performance custom image processing layer written in C/C++ and powered by ImageMagick and our Embedded computer Vision Library SOD is integrated directly into our cloud API.

Pixlab customers are more than advised to take a look at The official API endpoints documentation, The Samples Page, The Github repository for additional information.

Finally, for potentially interested users, you are more than welcome to start a 7 days free trial to see the API in action. Simply head to the PixLab Dashboard and activate your free trial from there.

PixLab Logo