You talk like a delinquent

This is interesting. Not sure how robust the finding is, but according to an analysis of LendingClub data on all past loans, including descriptions of the use for the money, applicants using certain words in their descriptions are much more likely to default.

For our purposes define a Delinquency as either being late in your payments or having defaulted completely. The 10 words with the greatest p-values are below. […]

Word Loans With P(Delinquency|No word) P(Delinquency|Word) p-value
also
215
0.067
0.140
0.0004
need
608
0.062
0.105
0.0015
business
233
0.069
0.116
0.0038
live
91
0.070
0.154
0.0057
already
64
0.071
0.156
0.0059
other
285
0.068
0.112
0.0081
bills
223
0.067
0.135
0.0082
bill
279
0.066
0.125
0.0117
interest
660
0.081
0.053
0.0136

“Words and Credit Scores”, Social Science Statistics Blog
Not something I’ve studied, but I wonder if a neural network could successfully classify these loans?

3 thoughts on “You talk like a delinquent

  1. You should be able to use an off-the-shelf Baysian spam classifier for this. That said, there are certain spam words with much higher P-values than any of these!

Comments are closed.