Massoud Mazar | All posts by admin

Taking on Gemma 3 27B with a Titan RTX

5. April 2025 23:26 / Administrator / / Comments (0)

Google's recent Gemma 3 model release is definitely making waves. While the LLM hype train never stops, what really grabbed my attention wasn't just the claimed power, but the benchmarks showing Gemma 3 rivaling competitors like DeepSeek. Seeing it hold its own, especially the smaller parameter versions, felt like a real step towards more accessible AI power.

The 27B parameter version particularly caught my eye. Could my existing hardware actually handle it? Challenge accepted! More...

7480be49-d298-4bd0-9dcd-23f04017226a|0|.0|96d5b379-7e1d-4dac-a6ba-1e50db561b04

Tags :

Simple Ansible filter plugin

2. March 2022 10:36 / Administrator / / Comments (0)

For a recent project, I used Ansible to configure a bunch of servers. One of the servers needed to be configured as NIS server and part of that configuration is to execute ypinit. When running ypinit script, it asks for some user input and requires users enter CTRL-D when all server names are entered. To automate this through Ansible, I had to somehow pass the CTRL-D (character x04) to stdin, but jinja2 or Ansible did not have a an easy way to do that, so I used a filter plugin.More...

dd0eb913-55d3-4bc2-8374-65de4ecb4cb6|0|.0|96d5b379-7e1d-4dac-a6ba-1e50db561b04

Setting Postgres transaction isolation level through Flask-SqlAlchemy

24. February 2021 12:45 / Administrator / / Comments (0)

Recently I had to deal with a concurrency issue in a legacy Flask application which did not enforce uniqueness. Multiple API clients trying to create a record in the database with the same value in 'name' column would end up creating duplicate entries. Applying a unique constraint to the name column required cleaning the duplicates first, and cleaning up data was not easy, so I decided to first implement a workaround to prevent more duplicates, and then go about cleaning up the data.

The solution was to use SET TRANSACTION ISOLATION LEVEL SERIALIZABLE statement in Postgres to lock the table and first check to see if the record exists, before inserting a new one.More...

402ab7ea-8727-440b-b2b8-25527660d72a|2|5.0|96d5b379-7e1d-4dac-a6ba-1e50db561b04

Tags :

Ship Prefect logs using filebeat

29. January 2021 14:51 / Administrator / / Comments (0)

After few years of working with Airflow for job scheduling, I have settled with Prefect these days. It fixes all problems I have had with Airflow and is the best upgrade in my opinion.

After implementing our logging and monitoring using Logz.io which is a hosted ELK solution, I found it useful to ship the logs from our Prefect flow executions to Logz.io to benefit from centralized logging, monitoring and alerting. More...

cb7001b3-f227-4093-a51f-c4f57b3f37f6|0|.0|96d5b379-7e1d-4dac-a6ba-1e50db561b04

Tags :

VPN DNS resolution problem with CNAME

24. September 2020 14:48 / Administrator / / Comments (0)

This problem took me a day to fix and it worth documenting here for others.

On my Mac laptop, I connect to a VPN, which has its own DNS server defined to resolve internal host names. For example, I could use

nslookup host.foobar.net

and get a good response. So far so good. But when I tried

nslookup host.sub.foobar.net

I did not get a response. After talking to IT team, it turned out this address was a CNAME record and not a A record. Still, I expected the DNS resolution to correctly return an IP address for this host, but it didn't. More...

5a3d083d-a44b-4517-8b1d-c1315803738a|2|5.0|96d5b379-7e1d-4dac-a6ba-1e50db561b04

Tags :

Raspberry Pi SMB server to use with Time Machine

23. August 2020 14:17 / Administrator / / Comments (0)

Last week I had a bit of free time (which is very rare these days), and decided it is finally time to build a file server to be used for backing up my laptops (both Mac and PC), and also as a general purpose shared drive. After doing some research I learned Apple supports SMB protocol for Time Machine, and SMB is obviously compatible with Windows as well.

My criteria to select the hardware was simple:

Gigabit Ethernet
USB 3
Support for Ubuntu

More...

95a6c6db-955e-4bd4-822c-bb080dc9a551|2|5.0|96d5b379-7e1d-4dac-a6ba-1e50db561b04

Tags :

Mobile Sensors: Easy data collection, labeling and model deployment

17. October 2019 10:11 / Administrator / / Comments (0)

Disclaimer: My targets for this article are data scientists which may not be necessarily coming from a software engineering background. Pardon me if you find this over simplified.

Mobile devices provide a rich set of sensors to allow us get a feel of where the device is being used and how. Sensors can tell us about environment, motion and orientation of the device, among other things. A list of sensors supported by Android can be found here.

There are a lot of cool applications for data coming from these sensors and a lot of those applications could benefit from machine learning models to infer more meaning from the sensor data. A famous example is the classic Human Activity Detection using mobile phones which can be found easily on the internet. But what if you need to collect data and label it for a different purpose? Here I show an easy way to build a mobile app which can run on both Android and iOS for both data collection and testing of the trained model. More...

ec2ab4d2-3ccc-4496-af6d-d4ebe54004c3|0|.0|96d5b379-7e1d-4dac-a6ba-1e50db561b04

Tags :

Kafka stream processing: lookup against hive data

10. September 2019 15:55 / Administrator / / Comments (0)

Here is a scenario which in my opinion should be very common:

Suppose you need to build an ETL kafka stream which read data from one stream and checks it against a blacklist before writing to destination stream. This blacklist gets updated daily, andhas the same key as your source stream. One way to implement this is to use a Kafka Table (ktable) and join your stream with the table to find the matches.More...

2f613b01-daa5-4cbe-8aba-1fb971ab4df5|0|.0|96d5b379-7e1d-4dac-a6ba-1e50db561b04

Tags :

Ingesting 250 million daily IoT messages with Hadoop and Hive 3.0 in Azure: Lessons Learned

16. December 2018 11:10 / Administrator / / Comments (0)

250 Million records per day may not be a lot of data for large environments with billions of users, but those companies have huge budgets and countless servers to do it. It's a different story in startup world and you have to squeeze the resources to get the job done with less budget. I will highlight what I learned during optimization of an analytics backend which was designed based on Azure HDInsight. Is Hadoop+Hive most suitable for this purpose is a question for another time and I'm not advocating these technologies, but if you are dealing with them, specially on Azure cloud, I hope this post save you some time.More...

cb0a774c-5e82-47e5-a51e-e18a1b0447e9|4|5.0|96d5b379-7e1d-4dac-a6ba-1e50db561b04

Tags :

Custom Hadoop RecordReader to read JSON with no line breaks

19. October 2018 13:46 / Administrator / / Comments (0)

This past week I had to deal with loading few terra bytes of data into our Spark cluster. This data is stored in a JSON array, and there is no line break to separate individual JSON objects. Spark can easily deal with JSON, but your JSON must be one object per line. I had to write a custom Hadoop RecordReader to work around this issue.More...

81b504c0-b61a-4110-866f-a7429a8735c4|0|.0|96d5b379-7e1d-4dac-a6ba-1e50db561b04

Tags :