Databases Administration

Back to Basics: ORM and Its Impact on Database and Data Architecture

Posted on June 3, 2026 by webgeek

TL;DR: Object-Relational Mapping (ORM) makes it easier for developers to work with databases by automatically translating between code and data. ORMs boost productivity and code maintainability but can introduce performance and data integrity challenges if not managed carefully. Using complementary tools (such as caching, connection pooling, and migration frameworks) and following best practices helps teams…

MySQL Error: 1062 'Duplicate entry' Error

Posted on January 5, 2018September 20, 2025 by webgeek

The all too common MySQL ‘Duplicate entry’ Error. mysql> show slave status\G; *************************** 1. row *************************** Slave_IO_State: Waiting for master to send event Master_Host: master-mysql.local Master_User: repl Master_Port: 3306 Connect_Retry: 60 Master_Log_File: mysql-bin.004768 Read_Master_Log_Pos: 1022786917 Relay_Log_File: relay-bin.001728 Relay_Log_Pos: 929659721 Relay_Master_Log_File: mysql-bin.004768 Slave_IO_Running: Yes Slave_SQL_Running: No Replicate_Do_DB: Replicate_Ignore_DB: information_schema,mysql Replicate_Do_Table: Replicate_Ignore_Table: Replicate_Wild_Do_Table: Replicate_Wild_Ignore_Table: Last_Errno: 1062 Last_Error:…

Installing MariaDB 10.1 on CentOS 6.8

Posted on March 3, 2017September 20, 2025 by webgeek

MariaDB is a fork of the MySQL; it is notable for being led by the original developers of MySQL and is community-developed. The original developers forked it due to concerns over its acquisition by Oracle. MariaDB intends to be a “drop-in” replacement for MySQL, ensuring capability with library binary and matching with MySQL APIs and…

Querying Apache Hadoop Resource Manager with Python.

Posted on November 14, 2016September 20, 2025 by webgeek

Querying Apache Hadoop Resource Manager with Python. I was recently asked to write a script that would monitor the running application on the Apache Hadoop Resource Manager. I wonder over to the Apache Hadoop Cluster Application Statistics API. The API allows to query most of the information that you see in the WEB UI. Information…

Resizing InnoDB Logs

Posted on August 26, 2016September 20, 2025 by webgeek

If you have already created your database and you change the setting for “innodb_log_file_size=###M” and restart here you database and get an error that looks something like InnoDB: Error: log file ./ib_logfile0 is of different size 0 5242880 bytes Here what you need to do. 1.) Make sure your database shutdown clean. 2.) Move(not delete)…

MongoDB Script for counting records in collections in all the databases

Posted on April 6, 2016September 20, 2025 by webgeek

Here is a quick script. I wrote for a co-worker. var host = “localhost” var port = 27000 var dbslist = db.adminCommand(‘listDatabases’); for( var d = 0; d < dbslist.databases.length; d++) { var db = connect(host+”:”+port+”/”+dbslist.databases[d].name); var collections = db.getCollectionNames(); for(var i = 0; i < collections.length; i++){ var name = collections[i]; if(name.substr(0, 6) !=…

Apache Oozie – Shell Script Example.

Posted on October 20, 2015September 20, 2025 by webgeek

Recently I needed the ability to allow a user to submit jobs that required them to pass arguments to a shell script. While it’s easy enough to submit a job using a Web UI like HUE. I wanted to tie it to a homegrown SaaS solution that we were developing to allow developers to load datasets…

Installing MonetDB on CentOS/RedHat.

Posted on July 22, 2015September 20, 2025 by webgeek

Hello world!

Connecting Tableau to DataStax Cassandra with Cassandra CQL ODBC.

Posted on July 16, 2015September 20, 2025 by webgeek

Recently, I did some testing with Tableau Desktop connecting to DataStax Cassandra using their newly released DataStax ODBC driver. Before the release of the DataStax ODBC driver, the only way to connect Tableau Desktop to DataStax was the DataStax Enterprise Connector (a.k.a. Hive Thrift Server). While Hive is a great analytic tool, it is somewhat slow. When…

Connecting Tableau to Google Cloud SQL

Posted on July 14, 2015September 20, 2025 by webgeek

Before connecting your Tableau Application up to your Google Cloud SQL Instance, you will need to make sure that you have assigned an IP Address to the instance. You will also need to allow the network in which your Tableau application is located, access to the Google Cloud SQL Instance. First, I recommend that you…