27 Feb 2021»
In this article, I wrote my LeetCode algorithm with Rust, a wonderful programming language.
01 May 2020»
A stable and scalable data pipeline is important in modern software platform. In this article, a computational data pipeline is built with Apache Spark and Apache Airflow.
27 Apr 2020»
In this blog, Hash table, as an implementation of associative array, is described in details. Several associative array's Hash table implementations are also described in details as examples.
10 Apr 2020»
Shuffle is a crucial component in the distributed computing system. In this article, multiple shuffle algorithms applied in Apache Spark are described briefly.
26 Mar 2020»
COVID-19 is one of the most serious pandemic in the history of human. Both government and people can benefit a lot from an effective data monitoring system on this virus. In this blog, a COVID-19 monitoring dashboard is developed based on RStudio Shiny Project.
17 Mar 2020»
In this blog, the architecture of JVM (Java virtual machine) is described along with brief introduction of Java platform.
10 Mar 2020»
In this article, the definition and usage of TCP sequence and acknowledgment numbers are described. An client-server sample sample are demonstrated with network traffic analysis using Wireshark.
03 Mar 2020»
TensorBoard, as a visualization tool, is introduced to monitor Tensorflow model training in this blog. An example of TensorBoard visualization is demonstrated with both default and self-defined features.
20 Feb 2020»
In this article, a brief introduction on gRPC and benefits of using gRPC are discussed in the first two sections. Then, a sample repo with instruction is provided to setup gRPC server and client in several programming languages.