The University of Science and
Technology of China
22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining
2016/8/13-8/17, San Francisco
Portfolio selections in P2P lending: A Multi-Objective Perspective
Hongke Zhao1
1School
Qi Liu1โ
Guifeng Wang1
Yong Ge2
Enhong Chen1
of Computer Science and Technology, University of Science and Technology of China, {zhhk, wgf1109}@mail.ustc.edu.cn {qiliuql, cheneh}@ustc.edu.cn;
2Eller
1. Abstract
2.2. Multi-objective Assessments for Loans
P2P lending is an emerging wealth-management service for individuals, which allows lenders to directly bid and invest on the loans
created by borrowers. In these platforms, lenders often pursue multiple objectives (e.g., non-default probability, fully-funded
probability and winning-bid probability) when they select loans to invest. How to automatically assess loans from these objectives
and help lenders select loan portfolios is a very important but challenging problem. To that end, in this paper, we present a holistic
study on portfolio selections in P2P lending. Specifically, we first propose to adapt gradient boosting decision tree, which combines
both static features and dynamic features, to assess loans from multiple objectives. Then, we propose two strategies, i.e., weighted
objective optimization strategy and multi-objective optimization strategy, to select portfolios for lenders. For each lender, the first
strategy attempts to provide one optimal portfolio while the second strategy attempts to provide a Pareto-optimal portfolio set.
Further, we design two algorithms, namely DPA and EVA, which can efficiently resolve the optimizations in these two strategies,
respectively. Finally, extensive experiments on a large-scale real-world data set demonstrate the effectiveness of our solutions.
๏ฐ Non-default Probability (๐
๐ ) is the estimated probability that one loan may repay the principal and interest to lenders in
time.
๏ฐ Fully-funded Probability (๐๐ ) is the estimated probability that one loan may receive enough bids in its auction.
๏ฐ Winning-bid Probability (๐ถ๐ ) is the estimated probability that a lenderโs bid on this loan under current loan status will finally
success or participate after the loanโs entire auction.
Problem Statement
Given the lendersโ historical bidding records, and current being-auctioned loans VA={v1, ..., v|V A|}
in market, our goal is to select loan portfolios from VA for each active lender UA={u1, ..., u|UA|}. A
portfolio x=(x1, ..., x|V A|) is an optimal combination of multiple loans, if xj=1, the j-th loan in VA is
selected and put into the portfolio. For each lender ui, the selected portfolios should satisfy her
preference on rate expectation ๐
๐ ๐ข ๐ with minimum risk (maximal non-default probability) and
maximum transaction efficiency (fully-funded probability, winning-bid probability).
Framework Overview
โข
Auction Data
Historical Data
Loan
Historical Loans
All Lenders
Being-auctioned Loans
Recent Bids
Lender
Dynamic Feature Extraction
Model Training
Multi-objective
Assessments
โฆโฆ
โฆโฆ
Assessment Models
Portfolio
Selections
Weighted Objective Strategy
A Portfolio
DPA
โฆ
GBDT
Offline
Flow
Multi-objective Optimization Strategy
A Portfolio Set
EVA
โข
Active Lender Identification
โฆ โฆโฆ
Online
Flow
โข
Identifying active lenders
UA and learning their
preferences
on
rate
expectation,
๐
๐ ๐ข ๐ ,iโ{1,...,|UA|};
Assessing each beingauctioned loan vj โ ๐๐ด on
multiple objectives, (i.e.,
Non-default probability ๐
๐ ,
Fully-funded probability ๐๐ ,
Winning-bid probability ๐ถ๐ );
Selecting portfolios for all
active lenders.
2.1. Active Lenders and Rate Preferences
In summary, for each being-auctioned loan ๐ฃ๐ , we can adopt a three-element vector ๐ท๐ to denote its profile:
๐ท๐ = [๐
๐ , ๐๐ , ๐ถ๐ ]
How to estimate these profile
terms for given loans?
For borrowing money, a borrower will
๏ฌrst create a listing to solicit bids from 2.2.1. Features for Assessments
lenders by describing herself, the reason ๏ฐ Static features are given explicitly before auction, which are
of lending, the required amount and the extracted from the loanโs properties and the associated borrowerโs
maximal interest rate. Then, if a lender properties. These static features directly reflect the basic properties
wants to lend to this loan within its of loans and borrowers.
soliciting duration, a bid is created by ๏ฐ Dynamic features are extracted from the temporal auction and
describing both how much money she incremental bidding lenders of a loan, which are changing from time
wants to lend and the minimum interest to time during an auction. Dynamic features can reflect the
rate. If this listing receives more than its popularity of a loan.
required amount in its soliciting
duration, those bids with lower rates 2.2.2. Loan Assessments
will succeed/win, and other bids with
Gradient boosting decision tree (GBDT )
higher rates will be outbid/fail. In Training process for a specific objective:
Assessment:
contrast, if this listing canโt receive
After we get the models ๐บ๐ต๐ท๐ ๐ , i โ {1,2,3}, on three objectives.
enough bids in time, it would be expired
For each being-auctioned loan ๐ฃ๐ , we can get its estimations on
and all the previous bids would also failใ
three objectives, i.e., the terms in profile ๐ท๐ by the model output
probabilities on positive classes of these objectives
How to identify active
lenders in current market?
Active lenders are those who are most likely to
lend in the following period.
College of Management, University of Arizona, [email protected]
Computational complexity:
๐
O(max{๐ ๐ฝ๐จ ๐ฒ๐ ๐น๐๐๐, |๐ผ๐จ|๐ฒ๐ ๐น๐๐๐});
First loop: considers the second constraint and gets all the
1
optimal selections under any p, q; O( 2 ๐๐ด ๐พ 2 ๐
๐๐๐ฅ);
Second loop: gets the final optimal solution; ๐(๐พ 2 ๐
๐๐๐ฅ) for
each lender.
4. Experiments
4.1.Data analysis
4.2.Evaluations of loan assessments
4.3.Evaluations of portfolio selections
4.1.analyze lendersโ lending behaviors from multi-objective, portfolio perspective and identification of active lender
3. Portfolio Selections
How to select loan portfolios
for each active lender?
Selection strategies๏ผweighted objective strategy v.s. multi-objective optimization strategy
Selection algorithms๏ผDynamic programming v.s. Evolutionary algorithm
Selections for each lender๏ผone optimal portfolio v.s. Pareto-optimal portfolio set
Identifying active lenders in advance can achieve
accurate service of portfolio selection, also
improve the service efficiency and user
experience.
4.2. Methods: GBDT, GBDT_S; LR, LR_S; DT, DT_S
Results: (1)dynamic features can improve the
performances of all models significantly, especially on
the fully-funded objective prediction, i.e., about 10%
improvements on AUC; (2) GBDT and GBDT_S
outperform other models on three assessment tasks,
especially on the non-default objective with more than
10% improvements.
Indeed, most lenders often bid periodically, and
invest Lenders who bid in recent days (a time
range, TR, before current time CT) have higher
probabilities to invest in the following period.
Lender ui โs preference on rate expectation ๐
๐ ๐ข ๐ can be calculated as:
4.3. Methods: DPA series, EVA series, EVAR series, SELF
where ๐๐๐ is the loan set that lender ui invested in the past, ๐
๐ ๐ข ๐ is the declared interest rate of loan vj.
Complexity: ๐ถ(|๐ผ๐จ|๐ฎ๐ต๐ ), ๐ฎ :iteration generations.
Initialization: If ๐ดโ ๐ is the Pareto-optimal solution set for
lender ๐ข๐ , each solution ๐ฅ โ ๐ดโ ๐ must be a feasible solution
for lender uiโฒ , iโฒ โ {i + 1,...,|UA|}, and ๐ดโ ๐ is a good
approximation of the Pareto-optimal solution set of ui โs next
neighbor lender, i.e., ui+1 .
Repair: random greedy strategy
A weight vector ๐๐ = (๐๐1 , ๐๐2 , ๐๐3 ) from
lender ๐ข๐ to balance the importance of
three objectives
Results: EVA series perform best on Non-default, Fully-funded, Winningbid metrics respectively; EVA could provide lenders with more economic
profits (11.5%, 10.3% returns on Return-0.3 and Return-0.6 metrics).
© Copyright 2026 Paperzz