Towards July 8 I tried remapping ‘Unused Offer’ so you’re able to ‘Accepted’ inside the `previous_software

Towards July 8 I tried remapping ‘Unused Offer’ so you’re able to ‘Accepted’ inside the `previous_software

csv` however, spotted zero update in order to local Curriculum vitae. In addition experimented with creating aggregations depending just into Bare offers and you can Canceled now offers, but spotted no escalation in regional Cv.

Atm distributions, installments) to see if the client is growing Atm withdrawals due to the fact day went on, or if perhaps visitors is decreasing the minimum payment once the date went to your, an such like

I was reaching a wall structure. To your July 13, We decreased my training rates to help you 0.005, and you can my personal local Cv decided to go to 0.7967. The general public Pound was 0.797, and also the individual Pound is actually 0.795. It was the highest regional Cv I became able to find that have just one design.

Following model, We spent really time seeking to tweak the brand new hyperparameters here there. I attempted lowering the reading rates, choosing greatest 700 or 400 has actually, I attempted playing with `method=dart` to rehearse, dropped particular articles, replaced specific opinions having NaN. My personal rating never ever enhanced. I additionally tested dos,step three,4,5,six,eight,8 season aggregations, but nothing aided.

With the July 18 I created a separate dataset with more has to try and improve my personal score. There are they of the pressing right here, therefore the password to produce it from the clicking right here.

Into the July 20 I got an average out-of a couple of activities you to definitely were instructed on more big date lengths having aggregations and you may had public Lb 0.801 and personal Lb 0.796. I did so even more combines next, and several had highest for the individual Lb, but not one previously defeat individuals Pound. I tried and Genetic Programming keeps, target security, modifying hyperparameters, however, little helped. I tried utilising the situated-from inside the `lightgbm.cv` in order to re-show for the complete dataset which don’t let both. I attempted improving the regularization due to the fact I thought that we got too many provides it did not help. I attempted tuning `scale_pos_weight` and discovered it don’t help; in reality, often growing pounds away from low-confident advice perform boost the local Curriculum vitae more than expanding pounds from positive advice (avoid user friendly)!

I additionally thought of Bucks Finance and Individual Fund due to the fact same, therefore i managed to get rid of an abundance of the enormous cardinality

Although this is actually taking place, I found myself messing up to a lot with Neural Channels because We got intentions to create it as a fusion to my model to find out if my score enhanced. I am grateful Used to do, because We discussed individuals sensory communities on my people later. I must give thanks to Andy Harless getting guaranteeing everyone in the competition to develop Sensory Sites, with his very easy-to-pursue kernel you to definitely motivated me to say, “Hello, I can do this as well!” The guy simply put a rss feed forward sensory circle, however, I had plans to use an organization embedded sensory community which have another normalization system.

My personal higher private Lb rating performing alone is actually 0.79676. This would have earned me rank #247, adequate to possess a silver medal nonetheless extremely respected.

August thirteen We authored a special current dataset that had plenty of new possess that we is assured create simply take me also higher. The newest dataset exists of the clicking right loans Pleasant Groves AL here, and also the password to produce it may be found because of the pressing right here.

Brand new featureset had has that we imagine have been extremely novel. This has categorical cardinality cures, conversion from ordered groups so you can numerics, cosine/sine conversion of hour regarding application (so 0 is nearly 23), ratio amongst the reported money and you may median income for your occupations (if the stated income is significantly highest, maybe you are sleeping making it look like the application is better!), earnings split up of the complete part of domestic. I got the sum of the `AMT_ANNUITY` you only pay out per month of the productive earlier applications, immediately after which divided that by your income, to find out if their ratio was suitable to look at an alternative mortgage. I got velocities and you can accelerations out of certain articles (elizabeth.grams. This might tell you if customer are begin to score quick on the money and that very likely to default. I also checked velocities and you may accelerations from those days due and amount overpaid/underpaid to see if they certainly were which have present trends. Instead of other people, I was thinking the brand new `bureau_balance` desk was very useful. We lso are-mapped the newest `STATUS` column so you can numeric, erased all the `C` rows (since they contains no extra guidance, these were simply spammy rows) and you can out of this I became capable of getting aside and therefore agency programs was indeed effective, that have been defaulted to your, etc. In addition, it helped from inside the cardinality prevention. It was bringing regional Curriculum vitae regarding 0.794 even though, thus possibly I tossed aside too much suggestions. Basically got more hours, I’d not have smaller cardinality really and might have only kept the other of use keeps I authored. Howver, they most likely aided too much to the fresh new variety of group pile.

user_post