Quantcast
[ 3 / biz / cgl / ck / diy / fa / g / ic / jp / lit / sci / tg / vr / vt ] [ index / top / reports / report a bug ] [ 4plebs / archived.moe / rbt ]

Due to resource constraints, /g/ and /tg/ will no longer be archived or available. Other archivers continue to archive these boards.Become a Patron!

/g/ - Technology


View post   

[ Toggle deleted replies ]
File: 43 KB, 650x190, 05-09-2014-powerpoint-logo.jpg [View same] [iqdb] [saucenao] [google] [report]
61134822 No.61134822 [Reply] [Original] [archived.moe] [rbt]

Is there a way to count all the words in a powerpoint document?

Google only points to the native """solution""" which doesn't actually count all the words.

>> No.61134864

I have hundreds of these shitty powerpoints to count.
I'd rather avoid having to count them manually.

>> No.61137387

bunb

>> No.61137395

>>61134822
VBA

>> No.61137404

Go the bruteforce way and parse the xml in the scripting language of your choice.

>> No.61138533

So what you're saying is there's really no simple way of doing this?

I either need external software or coding tools?

>> No.61138846

tried exporting the content into a program or format?

>> No.61138865

>>61138846
What kind of program or format?

>> No.61138949

>>61138865
doc=test_doc; libreoffice --impress --headless --convert-to pdf $doc.odp; pdftotext $doc.pdf - | wc

>> No.61138985

>>61138949
where do i put this?

>> No.61139083

>>61138985
in the terminal.
Windows have one now right?

>> No.61139098

>>61139083
I have no idea what that is.

Also, that code seems to say "libreoffice", but I have legit microsoft office.

>> No.61139101

>>61138985
In your shell session duh.

>> No.61139195

>>61139098
I'll break it down for you
doc=test_doc;

This stores the name of the file, in my case, the name of the file is test_doc and I want to store it in the $doc variable.

libreoffice --impress --headless --convert-to pdf $doc.odp;

Here I use libreoffice to convert the presentation to a pdf.
If you want to use microsoft office, you have to use the built in function.

pdftotext $doc.pdf - | wc

Here I take the resulting pdf and convert it to text, the "-" tells the program to simply print it rather than write it to a file.
The wc program (Word Count) counts the words.

You could also open the pdf file in a reader and export the text in another way and count the words in a text file.

You could also just look down in the corner and see how many words there is.

>> No.61139276

select all, copy to notepad, copy again from notepad to word. read the properties. done

>> No.61139303

>>61139276
>select all
Impossible right there.

It either only selects all the text in a certain box, on a certain slide, or everything except the boxes.

>> No.61139541

I think I found it.

I save the fucking things as PDFs, then convert them to word using some shitty third-party converter, and then make sure no text boxes are skipped in the count.

What a load of shit, but it beats counting in the PPT itself.

>> No.61139559

>>61138949
there should be a && between libreoffice and pdftotext senpai, you only want to run pdftotext after libreoffice succeeds right

>> No.61139674
File: 2.98 MB, 1474x1600, 1497821486800.jpg [View same] [iqdb] [saucenao] [google] [report]
61139674

>>61139098
>>61138985
Please KYS now
after fucking off back to /v/
technology board looks more like pajeet support skype by the second

>> No.61139723

>>61139559
There is a ";"
fuck safety

>>
Name (leave empty)
Comment (leave empty)
Name
E-mail
Subject
Comment
Password [?]Password used for file deletion.
Captcha
Action