bdewilde.github.io - While I Was Away









Search Preview

While I Was Away

bdewilde.github.io
data scientist / physicist / filmmaker
.io > bdewilde.github.io

SEO audit: Content analysis

Language Error! No language localisation is found.
Title While I Was Away
Text / HTML ratio 50 %
Frame Excellent! The website does not use iFrame solutions.
Flash Excellent! The website does not have any flash contents.
Keywords cloud data I’ve Python social Burton analysis papers —– interactive DeWilde web hackathon applied cool epic time damn Bible months end
Keywords consistency
Keyword Content Title Description Headings
data 8
I’ve 5
Python 4
social 4
Burton 3
analysis 3
Headings
H1 H2 H3 H4 H5 H6
1 0 0 0 0 0
Images We found 3 images on this web page.

SEO Keywords (Single)

Keyword Occurrence Density
data 8 0.40 %
I’ve 5 0.25 %
Python 4 0.20 %
social 4 0.20 %
Burton 3 0.15 %
analysis 3 0.15 %
papers 3 0.15 %
—– 3 0.15 %
interactive 3 0.15 %
DeWilde 3 0.15 %
web 3 0.15 %
hackathon 3 0.15 %
applied 2 0.10 %
cool 2 0.10 %
epic 2 0.10 %
time 2 0.10 %
damn 2 0.10 %
Bible 2 0.10 %
months 2 0.10 %
end 2 0.10 %

SEO Keywords (Two Word)

Keyword Occurrence Density
of the 4 0.20 %
of data 3 0.15 %
Burton DeWilde 2 0.10 %
as a 2 0.10 %
deal of 2 0.10 %
great deal 2 0.10 %
a great 2 0.10 %
come out 2 0.10 %
data munging 2 0.10 %
Python package 2 0.10 %
A Python 2 0.10 %
and the 2 0.10 %
An interactive 2 0.10 %
my next 2 0.10 %
—– and 2 0.10 %
in the 2 0.10 %
Harmony Institute 2 0.10 %
and fascinating 1 0.05 %
partofspeech tagging 1 0.05 %
builds upon 1 0.05 %

SEO Keywords (Three Word)

Keyword Occurrence Density Possible Spam
A Python package 2 0.10 % No
of data munging 2 0.10 % No
a great deal 2 0.10 % No
great deal of 2 0.10 % No
Burton DeWilde About 1 0.05 % No
the alreadyimpressive NLTK 1 0.05 % No
It builds upon 1 0.05 % No
builds upon the 1 0.05 % No
upon the alreadyimpressive 1 0.05 % No
alreadyimpressive NLTK and 1 0.05 % No
phrase extraction It 1 0.05 % No
NLTK and pattern 1 0.05 % No
and pattern packages 1 0.05 % No
pattern packages bibviz 1 0.05 % No
packages bibviz An 1 0.05 % No
bibviz An interactive 1 0.05 % No
An interactive resource 1 0.05 % No
interactive resource for 1 0.05 % No
extraction It builds 1 0.05 % No
and noun phrase 1 0.05 % No

SEO Keywords (Four Word)

Keyword Occurrence Density Possible Spam
a great deal of 2 0.10 % No
Burton DeWilde About Me 1 0.05 % No
the alreadyimpressive NLTK and 1 0.05 % No
noun phrase extraction It 1 0.05 % No
phrase extraction It builds 1 0.05 % No
extraction It builds upon 1 0.05 % No
It builds upon the 1 0.05 % No
builds upon the alreadyimpressive 1 0.05 % No
upon the alreadyimpressive NLTK 1 0.05 % No
alreadyimpressive NLTK and pattern 1 0.05 % No
tagging and noun phrase 1 0.05 % No
NLTK and pattern packages 1 0.05 % No
and pattern packages bibviz 1 0.05 % No
pattern packages bibviz An 1 0.05 % No
packages bibviz An interactive 1 0.05 % No
bibviz An interactive resource 1 0.05 % No
An interactive resource for 1 0.05 % No
interactive resource for exploring 1 0.05 % No
and noun phrase extraction 1 0.05 % No
partofspeech tagging and noun 1 0.05 % No

Internal links in - bdewilde.github.io

About Me
About Me
Archive
Archive
Intro to Automatic Keyphrase Extraction
Intro to Automatic Keyphrase Extraction
On Starting Over with Jekyll
On Starting Over with Jekyll
Friedman Corpus (3) — Occurrence and Dispersion
Friedman Corpus (3) — Occurrence and Dispersion
Background and Creation
Friedman Corpus (1) — Background and Creation
Data Quality and Corpus Stats
Friedman Corpus (2) — Data Quality and Corpus Stats
While I Was Away
While I Was Away
Intro to Natural Language Processing (2)
Intro to Natural Language Processing (2)
a brief, conceptual overview
Intro to Natural Language Processing (1)
A Data Science Education?
A Data Science Education?
Connecting to the Data Set
Connecting to the Data Set
Data, Data, Everywhere
Data, Data, Everywhere
← previous
Burton DeWilde

Bdewilde.github.io Spined HTML


While I Was Away Burton DeWilde About Me Archive CV While I Was Away 2013-10-05 hackathon Harmony Institute top links treasury.io I’ve not posted in scrutinizingly six months, but I was, like, totally busy. Here’s what I’ve been up to: Way when in February, I participated in a hackathon with a few data friends from CSV Soundsystem; we made a Federal Management Service symphony, and it won Best in Show. Rather than let the project die at the end of the hackathon, we unromantic for —– and received! —– aLawmakingSprint grant from the Knight Foundation to build it out. I performed an epic, damn near uncounted feat of data munging, and the other guys did everything else. The end result was treasury.io (and its companion tweetbot, @TreasuryIO). It provides the first-ever electronically-searchable database of the federal government’s daily revenues, spending, and borrowing. It lets you do lots of tomfool things, like plot public debt versus the debt ceiling over time: I’ve moreover been working nonflexible at Harmony Institute on (among other things) a massive interactive web app that maps the landscape of films virtually social issues, positioning them withal the issues’ conversational zeitgeist, and permitting for deep viewing and comparison of films’ social impacts. It’s tabbed ImpactSpace… until we decide on a name that wasn’t recently personal by someone else –— damn! I’ve washed-up a unconfined deal of data mining from dozens of sources via web crawls, web scrapes, API access, and structured data dumps; performed still increasingly epic feats of data munging; dived into cutting-edge NLP research and come out with fancy algorithms that I then implemented in Python; and plane gotten my feet wet in social and semantic network analysis. Much work remains, but we’re making good progress! :) I’ve tried to alimony up with developments in data science… Some seriously tomfool code, projects, and papers have come out in the past few months. In specimen you missed them: Personality, Gender, and Age in the Language of Social Media: The Open-Vocabulary Approach: In a nutshell, the words we use to express ourselves on social media are strongly indicative of our personality, age, and gender. Or, as Gawker put it, “science shows men and women are both villainous stereotypes on Facebook.” prettyplotlib: A Python package built on top of the de-facto plotting standard matplotlib that produces pretty plots by default, saving yourself a unconfined deal of trouble. Inspired by Tufte! TextBlob: A Python package that simplifies and improves a number of vital natural language processing tasks like part-of-speech tagging and noun phrase extraction. It builds upon the already-impressive NLTK and pattern packages. bibviz: An interactive resource for exploring some of the increasingly negative aspects of holy books, such as Bible contradictions, biblical inerrancy, and the Bible as a source of morality. Fun and fascinating. Paperscape: An interactive tool to visualize the arXiv, an open, online repository for scientific research papers, as a network of papers unfluctuating by citations. NLP with Deep Learning: Google went superiority and unromantic deep learning techniques to language wringer with pretty spectacular results —– and they open-sourced it! Python ports appeared quickly. Oh man, there’s so much more… but you’ll have to search through my Twitter feed. :) Where else has the time gone? Well, I went to a handful of weddings, moved into an suite in Chelsea, spent ten days in Scandinavia with my boyfriend, got 241 out of 242 power stars in Super Mario Galaxy 2, and resumed regular gym-going. Finally, my on-again, off-again data side-project, the megacosm and wringer of a Thomas L. Friedman corpus, will be the subject of my next few blog posts. And no, it won’t be years until my next entry –— I’m no George R.R. Martin. ← previous ↑ next → Please enable JavaScript to view the comments powered by Disqus. comments powered by Disqus Burton DeWilde data scientist / physicist / filmmaker © 2014 Burton DeWilde. All rights reserved.