[ 3 / biz / cgl / ck / diy / fa / ic / jp / lit / sci / vr / vt ] [ index / top / reports ] [ become a patron ] [ status ]
2023-11: Warosu is now out of extended maintenance.

/biz/ - Business & Finance


View post   

File: 188 KB, 847x722, file.png [View same] [iqdb] [saucenao] [google]
26255075 No.26255075 [Reply] [Original]

Any progress on this project, anon?
http://fireplu.me/2020/11/15/wojakindex.html

>> No.26255190
File: 247 KB, 933x835, image.jpg [View same] [iqdb] [saucenao] [google]
26255190

>>26255075
>Update: The Wojak Index has been discontinued. Because I don’t wanna pay $5 a month for a server and $5 a month for the screenshooter service

fucking jew

>> No.26255193

>>26255075
>comment on the bottom
Aniki...

>> No.26255290

I can make this if you fund me
or help me decide how to monetize this?

>> No.26255387

>>26255290
it's not always about the money. it's about sending a message. i'll do it.

>> No.26255419

>>26255387
what message?
inb4 jeets just manipulate it

>> No.26255469
File: 39 KB, 484x484, image.jpg [View same] [iqdb] [saucenao] [google]
26255469

>>26255290
i'm not funding anything you can literally throw up om your VPS anon

>>26255387
based anon
let me know if you need a domain
i have one reg'd for a year for a project that didn't pan out

>> No.26255526

>>26255469
>i'm not funding anything you can literally throw up om your VPS anon
what is r&d

>> No.26255597

>>26255387
>>26255290
>>26255469

How about measuring the ratio of pink/green and chart it? Strong pink = sell signal, strong green = buy?

>> No.26255651

>>26255597
Also needs to measure black/brown for bobo.

Perhaps a better approach is to train neural networks to identify wojak/pepe/bobo? I'm sure some autists have sorted reaction image folders already that can be used to training.

>> No.26255661
File: 527 KB, 926x915, image.jpg [View same] [iqdb] [saucenao] [google]
26255661

>>26255526
OK, i'll bite
$100 in ETH to whoever shits out this CS 101 tier project

>> No.26255703

>>26255651
post your wojacks/bobos/etc. here for training data /biz/fags

>> No.26255705

>>26255075
sounds like ml image clasification would be a really good choice for this and I was looking for something to deepen my understanding maybe I will do something like this these days

>> No.26255736

>>26255703
memeatlas should be fine

>> No.26255808

>>26255705
>sounds like ml image clasification
nigga just take prominent color as indicator how retardedly fucked is your brain with this buzzword technology

>> No.26255953
File: 12 KB, 189x266, bobo-choked-by-pink-wojak.jpg [View same] [iqdb] [saucenao] [google]
26255953

>>26255703
Cases like this one might pose a challenge for a neural net. What message does it convey? How to train a network that can correctly classify this example?

Perhaps remove all examples where more than one character is detected?

>> No.26256013
File: 9 KB, 189x267, bobo-game-of-thrones.jpg [View same] [iqdb] [saucenao] [google]
26256013

>>26255953
More difficult cases

>> No.26256044
File: 228 KB, 526x424, bobo-pepe-whispers.png [View same] [iqdb] [saucenao] [google]
26256044

>>26256013

>> No.26256081
File: 96 KB, 560x564, image.jpg [View same] [iqdb] [saucenao] [google]
26256081

>>26255597
how i would do it

1. scrape images from /biz/
2. classify images into wojack/bobo/pepe/nothingburger
3. track count and frequency history for each image category (hour, day, week, month, etc.)
4. score sentiment using the counts + LMSR or some other scoring rule for each historical period

>> No.26256096
File: 166 KB, 754x427, bobo-faking-death-wiping-off-blood.jpg [View same] [iqdb] [saucenao] [google]
26256096

>>26256044
Maybe we could classify them by hand?

A sentiment analysis on the first post could perhaps increase the accuracy?

>> No.26256151

>>26255387
Based

>> No.26256164

>>26256081
It's simple and possibly effective, but there will be significant noise/errors with this approach. And it's all in vain if the sentiment on /biz/ lags behind changes in price, no?

>> No.26256214
File: 6 KB, 224x225, bobo-green-confused-face.jpg [View same] [iqdb] [saucenao] [google]
26256214

Look at this one, it's as if someone tried to make one that would fool a neural net.

>> No.26256235

>>26255597
>Strong pink = sell signal, strong green = buy?

This is how I know you're a newfags and you'll never make it. It's literally the other way around.

Consider this free life advice:
You buy when biz goes AAAAAAAA
You sell when biz goed OOOOOOO

>> No.26256271
File: 561 KB, 860x758, bobo-pepe-half-face.png [View same] [iqdb] [saucenao] [google]
26256271

>>26256214

>> No.26256287

>>26256096
it should be pretty quick to train an accurate classifier for characters using OpenCV or some other toolkit

we should indeed consider post sentiment (all caps, slurs, etcs) since the character in each post does not necessarily correlate witb the sentiment

>> No.26256351
File: 109 KB, 473x311, image.jpg [View same] [iqdb] [saucenao] [google]
26256351

>>26256214
that is a fucked case

but we don't need to be 100% accurate
+ we can weigh counts by confidence

>> No.26256373

>>26256235
I agree, but I would add that the AAAA and OOOO must have been consistent for a while so it's a trend. /biz/ is flaky, sentiment turns on a dime even with small price fluctuations.

>> No.26256416

>>26256287
A naive classifier will surely not be accurate though, given the examples I've posted so far?

>> No.26256433

>>26255387
Sometimes it's not even about sending a message. I'll pay everyone in this thread to make this project. It's a great opportunity.

>> No.26256447

>>26256164
maybe we could do said approach all at once for each period to create multiple indexes? each increasingly more volatile + tracking a smaller period of time (down to like 10 minutes or something)

>> No.26256463

>>26255190
lol people claim to be millionaires on this board but dont pay 10 dollars a month

>> No.26256505

>>26255075
I spend all day here anyway, don't need an app or whatever to tell me if it's pink or green in the catalog.

>> No.26256514

>>26256433
ok bos on it will create bread when done

>> No.26256526
File: 37 KB, 842x624, pepe-beaver-canada.jpg [View same] [iqdb] [saucenao] [google]
26256526

>>26256447
Yeah. On the other hand if we sample frequently and track continuously and serve the data through an API, anyone can analyze it and use it however they want.

>> No.26256572

>>26255953
>>26256013
Doesn't need ML, just a threshold for proportion of pixels in the 'pink range' for each image to call 'pink images', then proportion of 'pink images'/total images. Bobos don't count and shouldn't be counted.

>> No.26256615
File: 167 KB, 768x1024, image.jpg [View same] [iqdb] [saucenao] [google]
26256615

>>26256416
i'm thinking it might be accurate enough? (since we only need to track general sentiment + not everyone posts sped versions) but it would definitely need some testing and experimentation

>> No.26256686

>>26255597
ngmi

>> No.26256722
File: 123 KB, 1599x999, pink-wojak-todo-list-done.jpg [View same] [iqdb] [saucenao] [google]
26256722

>>26256572
Pic related, positive sentiment that would be classified as "pink".

>> No.26256729
File: 149 KB, 600x859, image.jpg [View same] [iqdb] [saucenao] [google]
26256729

>>26256526
it looks like if we're doing the windowed approach we would have the data anyways to expose this api
so why not do both :)

>> No.26256869
File: 1.78 MB, 3572x2087, wojaks.jpg [View same] [iqdb] [saucenao] [google]
26256869

>>26255075

I already have one made. As others have mentioned before me, this is a fairly CS101 type project.

Not sure why I would share it with lazy fucks on here though

>> No.26256881

What you need is the hash signature of each pink wojack images. Then you can compare the prc. Easy

>> No.26256907

>>26256881
^ this is a valid approach as well

>> No.26256930

>>26256463
You can get 0.5 of LINK for this kind of money.

>> No.26256935

>>26256615
You're probably right. I'm pretty sure that 90%+ of the reaction images that are posted are reposts, so if they were classified once the the maintenance work needed to classify original variations would probably be minimal. Then again for a project like this it's probably not even worth the effort.

>> No.26257014

>>26256722
Yes. However, if the index runs continually then the false-positives provide baseline data. It's the change compared to this baseline in a pink-fields scenario that's significant. Calling of each image doesn't have to be 100% accurate.

>> No.26257015

>>26256881
https://en.wikipedia.org/wiki/PhotoDNA

Maybe something like this can be added to increase the robustness against small alterations.

>> No.26257167

make a discord for the projects ill fund too

>> No.26257172
File: 125 KB, 490x640, image.jpg [View same] [iqdb] [saucenao] [google]
26257172

>>26256935
i'm sure there is some cs tard lurking here on /biz/ that needs the (((experience))) to look for a wageslave job

if you put some modicium of sophistication into this project you can put this on your buzzword resume as 'Data Engineering' or some shit like that

to all college students: i know you're not spending that much effort at Zoom University, so why not work on this meme project and further some skills? :)

>> No.26257216

>>26256869

Also, it was mostly because I was curious about which wojaks were out there. If you think this will make you money you're fairly retarded and I'll save you some time: set a price alert with the coingecko app for when BTC moves 10% in a certain amount of time, and voila, same effect, 0 effort.

>> No.26257223

>>26257172
yes u really hit this is me :)

>> No.26257236

>>26257014
Fair enough, my gut feeling is that it would probably be accurate enough.

>> No.26257436

>>26256869
>I already have one made.

Can you share your data set? (:

>> No.26257549

>>26256463
>I didn't get rich by writing a lot of checks

>> No.26257831
File: 78 KB, 423x456, image.jpg [View same] [iqdb] [saucenao] [google]
26257831

>>26257216
yeah, this will 100% not make you money
if you are doing this it will be purely for the lulz

>> No.26257909

>>26256081
They all seem to use the same shade of pink, maybe you could do something with that to avoid image classification. Memes evolve enough that many would slip through the cracks of your scraper