chenge@sjtu.edu.cn
Master Candidate ( 2013 - 2016 )
Institute of Wireless Communications
I am currently doing research on mining heterogeneous data flow, including online social network. I have some preliminary experiences in data mining with python and Apache Spark.
-- This project aims to extract collaborative behavior features from massive records of messages published on SinaWeibo. We develop several useful network data analysis tools based on MapReduce framework and realized them on Apache Spark. Selected sample statistics are displayed.
-- This project provides a flexible web data crawler for social network study. It is written in Python with a light-weighted framework "Scrapy". Processes are accelarated by a multi-threading backend and a free nonsql database named Redis . Source codes are included.