1) @ProjectJupyter Extension of the Day: Spellchecker!

This #nbextension uses a @CodeMirror overlay mode to highlight incorrectly-spelled words in Markdown and Raw cells. The typo.js library does the actual spellchecking, and is included as a dependency.

…r-contrib-nbextensions.readthedocs.io/en/latest/nbex…
.@ProjectJupyter Extension of the Day #2: Codefolding!

This extension adds codefolding functionality from @CodeMirror to each code cell in your notebook. The folding status is saved in the cell metadata, so reloading a notebook restores the folded view.

…r-contrib-nbextensions.readthedocs.io/en/latest/nbex…
.@ProjectJupyter Extension of the Day #3: ExecuteTime!

This extension displays when the last execution of a code cell occurred and how long it took. The timing information is stored in the cell metadata, restored on notebook load, & can be togged on/off.

…r-contrib-nbextensions.readthedocs.io/en/latest/nbex…
.@ProjectJupyter Extension of the Day #4: nbTranslate!

This extension converts markdown cells in a notebook from one language to another & enables one to selectively display cells from a given language in a multilanguage notebook. LaTeX is also supported.
…r-contrib-nbextensions.readthedocs.io/en/latest/nbex…
.@ProjectJupyter Extension of the Day #5: Hinterland!

This extension enables a code autocompletion menu for every keypress in a code cell, instead of only calling it with tab. It also displays helpful tooltips based on customizable timed cursor placement.
…r-contrib-nbextensions.readthedocs.io/en/latest/nbex…
PS: if you've ever wondered how Intellisense works, or how search engines are able to autocomplete so quickly - it's all due to tries!

Advantages: speed, space, & partial matching. Finding a word in this structure is O(m), where m is the length of the word you’re trying to find.
.@ProjectJupyter Extension of the Day #6: highlighter!

This extension provides several toolbar buttons for highlighting text within markdown cells. Highlights can also be preserved when exporting to HTML or #LaTeX, and color schemes are customizable.

…r-contrib-nbextensions.readthedocs.io/en/latest/nbex…
...and because I am horrific at Twitter threads:

.@ProjectJupyter Extension of the Day #7: 2to3!

This nbextension converts python2 in notebook cells to python3 code.
Under the hood, it uses a call to the notebook kernel for reformatting; & the conversion run by the kernel uses the stlib lib2to3 module.

…r-contrib-nbextensions.readthedocs.io/en/latest/nbex…

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with 👩‍💻 DynamicWebPaige @ 127.0.0.1 🏠

👩‍💻 DynamicWebPaige @ 127.0.0.1 🏠 Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @DynamicWebPaige

Oct 8, 2018
TIL:

💃Good folks at @UniofOxford tagged+categorized pose categories in several episodes of "Buffy the Vampire Slayer"

📃"2D pose estimation in TV shows" is a body of academic work

🙌I've a heretofore unrealized desire to determine which BuffyPose has the highest % of frames
✨🧠

robots.ox.ac.uk/~vgg/data/buff…

(1) Train a model on the annotated example BuffyPoses;

(2) have it cycle through every episode of every season of "Buffy";

(3) determine BuffyPose with the greatest percentage of frames;

(4) don't forget to take ample time in S01 and S02 to:
💁‍♀️(so, so happy rn)
Read 7 tweets
Sep 27, 2018
📓 Am rereading my class notes from grad school, as well as from mentoring students for @Coursera and @EdX courses on statistics - and thought I'd share the most common mistakes when doing data analysis.

✨Have counted 8 of 'em, with examples - please feel free to add your own!
MISTAKE #1:
Garbage in, garbage out.

🤦‍♀️Failing to investigate your input for data entry or recording errors.

📊Failing to graph data and calculate basic descriptive statistics (mean, median, mode, outliers, etc.) before analyzing it in-depth.
👉EXAMPLE #1:
It's easy to make bad decisions on shoddy input! Here you see an outlier's impact on descriptive statistics.

Also: always consider the uncertainty in your measuring instruments. Just because you've gotten an *accurate* value doesn't mean it's *actually* correct.
Read 14 tweets
Sep 18, 2018
🗣Some recommendations for budding machine learning engineers:

(1) Make sure your sample dataset is representative of your entire population - and remember that more data is usually - but not necessarily! - better.

Also consider using image preprocessing tools, like Augmentor.
(2) Use small, random batches to train rather than the entire dataset.

⏳Reducing your batch size increases training time; but it also decreases the likelihood that your optimizer will settle into a local minimum instead of finding the global minimum (or something closer to it).
(3) Make sure the data that you're using is standardized (mean and standard deviation for the training data should match that of the test data). 📊

If you're using @TensorFlow, standardization can be accomplished with something like tf.nn.moments and tf.nn.batch_normalization.
Read 11 tweets
Feb 12, 2018
Inspired by the big ol' long list of deep learning models I saw this morning, and @SpaceWhaleRider's love of science-y A-Z lists, I've decided to create an A to Z series of tweets on popular #MachineLearning and #DeepLearning methods / algorithms.

Ready? Here we go:
A is for... the Apriori Algorithm!

Intended to mine frequent itemsets for Boolean association rules (like market basket analysis). Ex: if someone purchases the same products as you, in general, then you'd probably purchase something they've purchased.

cran.r-project.org/web/packages/a…
B is for... Bootstrapped Aggregation (Bagging)!

This is an ensemble meta-algorithm designed to improve the stability and accuracy of machine learning algorithms used in statistical classification+regression. Reduces variance, helps to avoid overfitting.

Example: Random Forests.
Read 28 tweets
Dec 1, 2017
So, time to drop some knowledge bombs. Most data scientists aren't taught:

- TCP/IP Protocol architectures
- how to deploy a server
- RESTful vs SOAP web services
- Linux command line tools
- the software development life cycle
- modular functions + the concept of writing tests
- distributed computing
- why GPU cores are important
- client-side vs server-side scripting

..and that's just a subset. If you meet a data scientist who has familiarity with those concepts, it's because they either have a CS or IT background, or they taught themselves.
So be thankful if folks are following along! 😀

And be mindful that sometimes more detailed, patient, lower-level explanations are necessary - especially when writing docs.

R is fantastic at this: for example, @hadleywickham's httr vignette.

cran.r-project.org/web/packages/h…
Read 8 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us on Twitter!

:(