Author: Minhaz

Efficiently writing large objects to Azure Blob Storage – JSON, Compression, Caching, Streaming in .Net

Azure Storage Blob Azure Storage Blob is an Azure Storage offering that allows you to store GigaBytes of data in from hundreds to billions of objects in hot, cool, or archive tiers, depending on how often data access is needed. Store any type of unstructured data—images, videos, audio, documents and more—easily and cost-effectively. Reference: https://azure.microsoft.com/en-in/services/storage/blobs/ In the

Compress sequence of UNIX timestamps with microseconds accuracy to ~10bits/timestamp

Problem Setup: There is a stream of timestamps that need to be transferred across some network stream; Goal is to compress this sequence of timestamps (with microseconds accuracy, lossless). Also the encoding and decoding process should be very fast so that it can scale for time critical processes; Some assumptions: The data to be transmitted

Introducing minor improvements to CSRF Protector PHP

The OWASP CSRFProtector project started with an aim to develop a solution that can mitigate Cross Site Request Forgery in web applications without much developer effort. The most common solution for mitigating CSRF is using a token which cannot be retrieved by the attacker, thus ensuring the authenticity of the incoming request to the server.

A fault tolerant distributed key value store from scratch

We had a B-Tech course on Distributed Systems and I took a course on on Cloud Computing Concepts 1 by Dr Indranil Gupta (UIUC) a year back and for long, I have been thinking about trying out different concepts explained in the course together as something meaningful. There are assignments in the course which you need to finish, but they

Experiment : Supervised learning model to classify a Github Issue as enhancement or bug based purely on issue title

Quick Summary: I Mined more that 1,00,000 Issue data from Github open source repositories. Mined data included { issue title – string}, {issue description – paragraph } and {labels – discrete strings}. Most of them were enhancement or bug. So started with a simple classifier which classifies an issue as Enhancement or Bug based on issue title.