With the rough introduction from previous post, I’d like to elaborate the project I did over the summer and past two weeks.
Started in July, me and other four college students putted our hands on the planning of a Credit Investigation System which is a system that helps banks to determine a specific person’s credibility when banks are asked to lend money to he or she. In the traditional Credit investigation system, the credit degree is mainly depended on the evaluation of the combination of users’ occupation, family situation, social relationship and other elements. Our team, however, decided to cut into users’ daily consume from analyzing their location information and the location’s average cost, and then build the credit system from those analyses. Because, we believed, one’s daily consumption could effectively reflect one’s consuming ability and, correspondingly, one’s loan repayment ability. The estimate target customers of our project are banks and private organizations that need relevant data to manage their potential customers (like Alibaba or Tencent). Basically we are selling data and the way to manipulate data. The system is called TraceCloud.
The basic working model TraceCloud is presented as the following graph:
We grab location information from users’ cell phones to servers, run our algorithms to generate the credit degrees and then send them to our database. The algorithm is the core part of the whole project. Being part of the algorithm team, I started by analyzing weights of different types of consuming. We divided basic consumes into seven parts: car service, diner service, shopping service, sports and relaxation service, medical service, travel, and living service. Since it’s hard for us to grape precise data for medical, and travel service based on users’ location, we mainly depended on the rest five to determine the weight. For example, for diner service, we applied data released by government about the average cost’s division and average salaries’ division to generate the final weight. Data table is presented below:
|Cost on meal per person per month (yuan)||Average salary (yuan)||Weight after normalizing|
With those weights, we were allowed to roughly compute the credit degree of a person based on his or her dinning manner and frequent dinning location.
The function we used to calculate weights was:
This project opened a relative new way for me to start a project since TraceCloud is a product that need many data to support its operation. Used to be a person who played around database, I discovered during the project that I was actually lack of many knowledge in data managing and researching. Hope during the time when we start running the product and collecting enormous data sets, I can learn more about database. This is what I have for this week.
Citation: No work cited. All images are from the real project.