We collected our results through a pattern of text started by a seed phrase and then another word chosen as the mostly likely word to follow, and then another word after that, etc. Unfortunately, but understandably, our initial results showed repeated patterns of common words like so:
“of the i i of the i i of the i i of the i i of the i i of the i i of the i i of the i i of the i i of the i i...”
To reconcile this, the program was modified to make a random choice out of a top selection of words, the number of which could be modified. Adjusting this value led to some very interesting and different results. On the extreme low end, this repeated pattern could still be seen picking from the top 5:
“i you you know you i i to i i you the to i the to and and i that you to in to to and you to be and to the i you the a i and a visionary i you i you i i that to and i the you i you it to you i i you the i to to you i to the you know to the you know in i i a you the you you you to you i i i you to to the the the you i i the the and and and and you the be the i you to the to you to you yeah i the it of you the and you to you to the the i i you i a you i you the the the i i a hoodie...”
More sporadic and unexpected words like “visionary” and “hoodie” start to emerge. However, the same repeated pattern still remains. Cranking the number up to ten, more interesting things start to happen:
“my the to my i to it it like and morisette was a sharin' i my to you i'm i that it you know m you and t me i compare in you the touchin' is the and in in like that i and a that go a the like you to you you you yeah woo to yeah yeah yeah woo woo...”
The relative frequencies of words start matching what one might expect to see in regular sentences and this seems the most natural of any of the tests so far. Progressing on to picking from the top 20 choices, we get something like this:
“it you with reference how that i to a if but you i'm and benz and i'm you be me my it and lil' a it was go a the with is and be joke the we you to the s**** i'm it to be in it that go very in i to value in that it was about we when the need that the s**** to you in it's was in it it be million a highlights i'm just woah in lockdown to and and the in a to monster to i'm be god...”
Interestingly, this is where you can really tell that the data originated from Kanye. His bravado comes through in words like ‘benz,’ ‘milllion,’ and ‘god;’ the first swears are introduced; and some of his song titles end up filtering into the lyrics, such as “Highlights,” a song from The Life of Pablo, ‘lockdown,’ a potential product of the song “Love Lockdown” off of 808s & Heartbreak, “Monster” from My Beautiful Dark Twisted Fantasy. Additionally, the word ‘god’ appears, which is both a prominent subject in his music and reflective of another title, “I am a God” from the album Yeezus. While it is not a particularly coherent sample, it is distinct in its origin. Past this point, the direct influences are less obvious, but the coherence gets even stronger. My favorite was the sample we took at picking from the top 100 words:
“your i is what of oh leave though price they go am f*** like go mcdonald homie in up two give was fast been all i'll swish from i'm are you new that can a courtesy marc rod paris they it cashmere i'm though s*** your more what say need but can your in cam if is people one s*** that's by i'm kanye honest your name you or get had do with it money for we up psychologically a million one make because over love for clothing made rather the one your s*** p*sses yeah and woman comparisons keep vega co love but composite i when classic me is s why all rap it your look he at smart a he all jobs say know gonna recorded say his you put s is off it's they night when about that all the democrats s at so my just so lifetimes at men and at from but well have me was evening one clinton i'm really them motherf***er...”
This section almost seems like some sort of twitter rant that Kanye might pull off, starting with some advice to “like go [to] mcdonald homie,” proceeding into a discussion of high fashion with “Marc (Jacobs),” “Paris,” and “cashmere.” The text then reasserts “I’m Kanye, honest,” in a very expected way, comments on money, and goes back to talking about a “love for clothing.” The simulated rapper finishes talking a bit about being an artist with words like “classic,” “rap,” and “recorded,” before launching into a political rant, mentioning “democrats” and “Clinton.” While not a perfect coherent work, I would consider this sample to be a major success, as it conveys so many of the talking points Kanye is known for and emulates the rapper and his opinions very well. You can see all of our results in full by
clicking here . Please note that as our results come from raw data of Kanye's lyrics, there is vulgar language.