Browsed by
Author: Afzal Badshah, PhD

Introduction to Natural Language Processing (NLP)

Introduction to Natural Language Processing (NLP)

Natural Language Processing (NLP) is a subfield of artificial intelligence (AI) that focuses on the interaction between computers and humans through natural language. The ultimate goal of NLP is to enable computers to understand, interpret, and generate human language in a way that is both meaningful and useful. Detailed tutorial on Data Science can be visited here. Key areas of NLP Text Analysis: Text analysis involves understanding and processing textual data to extract meaningful information. For example, in social media…

Read More Read More

Using Google Sheet API with Python

Using Google Sheet API with Python

Here is the step-by-step guide to connecting Google Sheets to any Python program. In this tutorial, we connect it to Google Collab, but you can connect it to any application with the following the below points. Requirement: 1. Create a New Google Cloud Platform (GCP) Project 2. Enable Google Sheets API 3. Create a New Service Account 4. Download the JSON File Program side: Installed the required libraries Upload the JSON file, which you downloaded from the google API to…

Read More Read More

Ethics and Privacy in Data-Driven Applications

Ethics and Privacy in Data-Driven Applications

The world is driven by Data. From our social media interactions to our fitness trackers, an ever-growing stream of information is generated every second. This vast ocean of data, known as big data, holds immense potential for innovation and progress. Businesses use it to personalize advertising, develop new products, and optimize operations. Researchers leverage it to understand diseases, track climate change, and make scientific discoveries. A detailed tutorial can be found here. However, the power of big data comes with…

Read More Read More

Parallel Programming Models: SIMD and MIMD

Parallel Programming Models: SIMD and MIMD

With the ever-changing landscape of computing, the demand for faster and more efficient processing of big data has become necessary. Traditional sequential programming paradigms are often insufficient to meet these demands, demanding parallel programming techniques. Parallel programming programs the system to allow multiple tasks to be executed simultaneously, leveraging the capabilities of modern parallel hardware architectures. Visit the detailed tutorial on parallel and distributed computing here. Among these models, SIMD (Single Instruction, Multiple Data) and MIMD (Multiple Instruction, Multiple Data)…

Read More Read More

Real-Time Data Processing: Taming the Flood of Data from a Connected World

Real-Time Data Processing: Taming the Flood of Data from a Connected World

Traditional data processing methods, which involve collecting and analyzing data in batches at specific intervals, are simply struggling to keep pace with this ever-increasing data flow. This is where real-time data processing comes into play. It’s a method of processing data streams at near-instant rates, enabling organizations to gain insights and make decisions based on the latest information as it becomes available. Visit the detailed tutorial here. Real-Time Data Processing Real-time data processing is a method of analyzing and interpreting…

Read More Read More

Matrix Multiplication on Multi-Processors: MPI4PY

Matrix Multiplication on Multi-Processors: MPI4PY

In this scenario, each processor handles a portion of the matrices, performing computations independently, and then the results are combined to obtain the final result. This parallelization technique leverages the capabilities of multiple processors to expedite the overall computation time.  Code: Explanation Import MPI Module and Initialize MPI Environment This line imports the MPI module from the mpi4py package, enabling the use of MPI functionalities. These lines initialize the MPI environment. MPI.COMM_WORLD creates a communicator object representing all processes in…

Read More Read More

Data Modeling and Schema Design in MongoDB

Data Modeling and Schema Design in MongoDB

Data modelling and schema design are pivotal aspects of MongoDB database management, crucial for structuring data effectively to meet application requirements. In this tutorial, we’ll explore the fundamentals of data modelling and schema design in MongoDB through practical examples set in a Pakistani context. Visit the detailed tutorial here. Data Model Design Modeling in NoSQL refers to the process of designing how data will be structured and organized within a NoSQL database. Unlike traditional relational databases, NoSQL databases offer more…

Read More Read More

Parallel Summation using MPI in Python with mpi4py

Parallel Summation using MPI in Python with mpi4py

Parallel summation involves distributing the task of summing a large set of numbers across multiple processors or computing nodes, enabling simultaneous computation and aggregation of partial results. Each processor handles a portion of the data, performs local summation, and then communicates its partial sum to a designated root processor. The root processor collects and combines these partial sums to compute the global sum, thereby leveraging parallelism to accelerate the computation process and efficiently handle large-scale data sets. In this tutorial,…

Read More Read More

Parallel Programming Languages and Tools: MPI, OpenMPI, OpenMP, CUDA, TBB

Parallel Programming Languages and Tools: MPI, OpenMPI, OpenMP, CUDA, TBB

In the age of ever-growing devices, massive data and complex computations, the power of multiple processors simultaneously has become crucial. Parallel programming languages and frameworks provide the tools to break down problems into smaller tasks and execute them concurrently, significantly boosting performance. This guide introduces some of the most popular options: MPI, OpenMPI, CUDA, TBB, and Apache Spark. We’ll explore their unique strengths, delve into learning resources, and equip you to tackle the exciting world of parallel programming. Message Passing…

Read More Read More

MPI: Concurrent File I/O for by Multiple Processes

MPI: Concurrent File I/O for by Multiple Processes

In this tutorial, we’ll explore an MPI (Message Passing Interface) program using mpi4py to demonstrate how multiple processors can collectively write to and read from a shared file. The detailed tutorial of MPI with a python can be visited here. Code Code Explanation Imports the necessary MPI module from mpi4py which provides bindings for MPI functionality in Python. Initializes MPI communication (comm) for all processes (MPI.COMM_WORLD). rank is assigned the unique identifier (rank) of the current process, and size represents…

Read More Read More