Go back to the Homepage.

Learning Log

Tuesday, 18/12/2018

Deep Learning
- Submitted an entry to Human Protein Atlas Classification Competition which yielded a top 15% score on the public leaderboard.
- Read Super-Convergence: Very Fast Training of Neural Networks Using Large Learning Rates
- Cross-referenced fast.ai's modified versions of the 1cycle policy and learning rate finder with their specification as described by Leslie Smith in the literature.
Pavel Grinfeld Lectures

Monday, 17/12/2018

Deep Learning
- Read a disciplined approach to neural network hyper-parameters: part 1 – learning rate, batch size, momentum, and weight decay
- Applied the 1cycle policy from the aforementioned paper to the Human Protein Atlas Classification Competition.
Pavel Grinfeld Lectures

Friday, 14/12/2018

Deep Learning
- Worked on Human Protein Atlas Classification Competition.
- Experimented with the fast.ai library's callback system.
- Read Cyclical Learning Rates for Training Neural Networks
Pavel Grinfeld Lectures

Thursday, 13/12/2018

Deep Learning
- Worked on getting a baseline classifier up for Human Protein Atlas Classification Competition.
- Fixed an issue that I ran into in the fast.ai library
Pavel Grinfeld Lectures

Wednesday, 12/12/2018

Deep Learning
- Set up a dev environment which will allow me to use the Human Protein Atlas Classification Competition as a test bed against the master branch of fast.ai
- Worked on getting a baseline classifier up for Human Protein Atlas Classification Competition, ran into and fixed quite a few teething problems.
Pavel Grinfeld Lectures

Wednesday, 05/12/2018

Deep Learning
- Worked on the Human Protein Atlas Classification Competition
- Got frustrated by an environment issue which led me to dockerize my setup and to read about and configure NVIDIA-docker.
Pavel Grinfeld Lectures
I'm away on a trip to NYC - back in a week.

Tuesday, 04/12/2018

Caught up with study buddy
Worked on the Human Protein Atlas Classification Competition
Pavel Grinfeld Lectures

Monday, 03/12/2018

Deep Learning
- Checked out the fast.ai 1.0 release - I've worked with the fast.ai 0.7 library whilst following Jeremy Howards series.
- Moved my GPU workflow from AWS to paperspace as the latter is cheaper to run.
- Started looking at the Human Protein Atlas Classification Competition
Pavel Grinfeld Lectures

Friday, 29/11/2018

Deep Learning
- Precomputed the weights for the images in my dog breed notebook on the ResNeXt architecture, to obtain feature encodings.
- Removed the final layer from the ResNext architecture and trained a few fully connected layers in my dog breed notebook.
Pavel Grinfeld Lectures

Thursday, 28/11/2018

Deep Learning:
- Read about transfer learning and did the pytorch transfer learning walkthrough.
- Fetched the weights for ResNeXt from FAIR and processed them for consumption from pytorch.
- Read about pytorch's persistant state model.
Pavel Grinfeld Lectures

Wednesday, 27/11/2018

Deep Learning:
- Dog breed classification challenge experimented with some traning protocols / CNN architectures.
- Read about Pytorch's CUDA semantics
Pavel Grinfeld Lectures

Tuesday, 27/11/2018

Deep Learning:
- Experimented with some CNN architectures with pytorch.
- Read the VGG paper
Pavel Grinfeld Lectures

Monday, 26/11/2018

Deep Learning:
- Read Karpathy's cs231n CNN write up.
- Pytorch
  - Read about autograd and experimented
  - Read about the nn module and experimented
  - Read about a typical pytorch classification pipeline and experimented
  - Read about the Dataset and Dataloader abstractions and experimented
Pavel Grinfeld Lectures

Friday, 23/11/2018

Deep Learning:
- The sixth lecture from Jeremy Howard's deep learning series.
- The seventh lecture from Jeremy Howard's deep learning series.
Pavel Grinfeld Lectures

Thursday, 22/11/2018

Deep Learning:
- The fourth lecture from Jeremy Howard's deep learning series.
- The fifth lecture from Jeremy Howard's deep learning series.
- Experimented with architecture and training of convnet in my dog breed identification notebook
- Got started with pytorch.
Pavel Grinfeld Lectures

Wednesday, 21/11/2018

Deep Learning:
- The third lecture from Jeremy Howard's deep learning series.
- Entered the dog breeds kaggle competiton with a convnet.
Pavel Grinfeld Lectures

Tuesday, 20/11/2018

Deep Learning, CNNs:
- The first and second Jeremy Howard lectures from his deep learning series.
- Worked through the lab notebook
- Workflow for experimenting in Jupyter on ec2 with access to cuda enabled gpu.
Pavel Grinfeld Lectures

Monday, 19/11/2018

Met up with my study buddy.
- Talked through career/curriculum plans.
- Pontificated about software and the future of work.
Pavel Grinfeld Lectures

Friday, 16/11/2018

More career and curriculum planning
- Researched roles I'd be interested in doing
- Reviewed the curriculum, planned next steps.
Pavel Grinfeld Lectures

Thursday, 15/11/2018

Career and curriculum planning
- Listened to this podcast; a couple of research engineers talking about what they do.
- Researched roles I'd be interested in doing
Pavel Grinfeld Lectures

Wednesday, 14/11/2018

Working on my West Nile Virus notebook
Pavel Grinfeld Lectures

Tuesday, 13/11/2018

Working on my West Nile Virus notebook
Pavel Grinfeld Lectures

Monday, 12/11/2018

Working on my West Nile Virus notebook
- Did a detailed exploratory data analysis.
- Read about Quality Controlled Local Climatological Data and checked out NOAA
Pavel Grinfeld Lectures

Friday, 9/11/2018

Working on my West Nile Virus notebook
- Read data descriptors and did an initial exploratory data analysis.
- Read about kernel density estimation
Pavel Grinfeld Lectures

Thursday, 8/11/2018

Working on my house prices notebook
- Did a better implementation of stacking in my my house prices notebook, which is less wasteful of training data.
- Feature engineering
Pavel Grinfeld Lectures

Wednesday, 7/11/2018

Working on my house prices notebook
- Looked for some literature on the subject of stacking/blending/meta-ensembling and stuggled to find anything authoritative. These techniques seems to be omitted from textbooks but are very popular amongst kagglers. I read two write ups here and here. There is some higher brow academic coverage from none other than Leo Brieman himself, which I will return to.
- Did a quick and dirty implementation of stacking in my my house prices notebook.
Pavel Grinfeld Lectures

Tuesday, 6/11/2018

Working on my house prices notebook
- Feature engineering
- Anomaly identification and removal
Pavel Grinfeld Lectures

Monday, 5/11/2018

Working on my house prices notebook
- Initial exploratory data analysis
- Identification and preprocessing of numerical, categorical and ordinal predictors
- Fit L1 and L2 regularised linear regressions, boosted tree regressor and random forest.
Pavel Grinfeld Lectures

Friday, 2/11/2018

Struck down with a chest infection 🤒
- Indulged in some Pavel Grindfeld lectures, surely the greatest maths lecturer out there. Although the material is not new to me, I feel like he is going in and organsing my brain and it is glorious.
- Not much else

Thursday, 1/11/2018

Struck down with a chest infection 🤒
- Indulged in some Pavel Grindfeld lectures
- Not much else

Wednesdy, 31/10/2018

Struck down with a chest infection 🤒
- Added classification to my trees notebook
- Indulged in some Pavel Grindfeld lectures
- Not much else

Tuesday, 30/10/2018

Competing in South East Water's preventing bursts hackathon
- With the clock ticking we finished off yesterdays preprocessing pipeline and had only a couple of hours to fit some models - which, given the size of the dataset took 20 minutes each to train.
- We got best performance with a boosted classification tree and presented our solution: Burst Forecast.
- Although we did not make it into the prizes, my study buddy and I got a special mention from the judges - who saw potential in our solution. It was encouraging to get nice feedback from them and the South East Water team. The process was a lot of fun and I learned alot.

Monday, 29/10/2018

Competing in South East Water's preventing bursts hackathon
- I entered alongside my study buddy.
- The datasets we were given access to were very large, containing hundreds of millions of rows.
- We set ourselves the goal of predicting the chance of a burst occuring tomorrow in each section of the network.
- We concieved of and implemented a data preprocessing pipeline that emitted a dataset that was ameanable to our arsenal of supervised learning techniques. This was the lion's share of the work.

Friday, 26/10/2018

Investigating AWS Sagemaker
- Read the developer guide
- Worked through the breast cancer prediction notebook
- Worked through the XGBoost notebook

Thursday, 25/10/2018

Read about XGBoost
Working on my algorithm notebooks
- Added boosting to my regression trees notebook
- Wrote a PCA notebook

Wednesday, 24/10/2018

Working on my algorithm notebooks
- Added bagging to yesterday's regression trees notebook
- Added random forests to yesterday's regression trees notebook

Tuesday, 23/10/2018

Working on my algorithm notebooks
- Wrote a regression tree notebook
- Read about cost complexity pruning the details of which are omitted in The Elements of Statistical Learning and the algorithm is not implemented in SKLearn, I will consider contributing an implementation.

Monday, 22/10/2018

Working on my algorithm notebooks
- Wrote a ridge regression notebook
- Read section 3.4.2: The Lasso from The Elements of Statistical Learning and read about proximal gradient methods, this technique sits in the domain of convex optimization, which I have yet to take down systematically. I added some notes to an optional module that I may well persue on this subject.
- Wrote a k-means notebook

Friday, 19/10/2018

Working on my algorithm notebooks
- Wrote a very small bootstrap notebook
- Wrote a best subsets notebook
- Wrote a stepwise selection notebook

Thursday, 18/10/2018

Working on my algorithm notebooks
- Finished off yesterday evening's k-fold cross validation notebook
- Wrote a perceptron notebook
- Read section 4.5: Seperating Hyperplanes and 12.2: Support Vector Classifier from The Elements of Statistical Learning with the aspiration of writing an SVM notebook. I was thwarted by some unfamiliar mathematics - which was frustrating but I didn't feel too bad about it as the section is associated with an Edvard Munch - The Scream icon in the book. I'll need to grok Lagrangian/Wolfe Duality before I return to it.

Wednesday, 17/10/2018

Working on my algorithm notebooks
- Tidied up yesterday's linear discriminant analysis notebook
- Wrote a quadratic discriminant analysis notebook
- Started a k-fold cross validation notebook

Tuesday, 16/10/2018

Working on my algorithm notebooks
- Tidied up yesterday's logistic regression notebook
- Wrote a linear discriminant analysis notebook

Monday, 15/10/2018

Working on my algorithm notebooks
- Got more comfortable writing LaTeX, it reminds me of regex - quick and easy to write, horrible to come back to and read/edit.
- Wrote a linear regression notebook
- Wrote a logistic regression notebook

Friday, 12/10/2018

Chapter 10 of an Introduction to Statistical Learning
- I completed the conceptual problems
- I completed the applied problems
I updated the curriculum page of this site to bring it up to date with what I've been up to.

Thursday, 11/10/2018

Chapter 10 of an Introduction to Statistical Learning
- I completed the lab
- I started the conceptual problems

Wednesday, 10/10/2018

Chapter 10 of an Introduction to Statistical Learning
- I watched the online lectures for Chapter 10: Unsupervised Learning
- I read the corresponding Chapter in the book

Tuesday, 09/10/2018

Chapter 9 of an Introduction to Statistical Learning
- I completed the conceptual problems
- I completed the applied problems

Monday, 08/10/2018

Chapter 8 of an Introduction to Statistical Learning
- I completed the applied problems
Chapter 9 of an Introduction to Statistical Learning
- I watched the online lectures for Chapter 9: Support Vector Machines
- I read the corresponding Chapter in the book
- I completed the lab

Friday, 05/10/2018

Chapter 8 of an Introduction to Statistical Learning
- I read the chapter from the book
- I completed the lab.
- I completed the conceptual questions.
Research - scoping out what content to include in a unit on Bayesian Inferance
- I watched the second of Aubrey Clayton's lecture series which follows Probability Theory - The Logic of Science

Thursday, 04/10/2018

Chapters 7 and 8 an Introduction to Statistical Learning
- I finished the applied questions from Chapter 7: Moving Beyond Linearity
- I watched the online lectures for Chapter 8: Tree Based Methods
Research - scoping out what content to include in a unit on Bayesian Inferance
- I read the preface to and flicked through the rest of E.T Jaynes's Probability Theory - The Logic of Science
- I watched the first of Aubrey Clayton's lecture series which follows Jayne's text

Wednesday, 03/10/2018

Chapter 7 of an Introduction to Statistical Learning
- I finished off the chapter 7 lab.
- I completed the conceptual questions.
- I started the applied questions.

Tuesday, 02/10/2018

Chapter 7 of an Introduction to Statistical Learning
- I read chapter 7 from the book
- I started the chapter 7 lab.

Monday, 01/10/2018

Chapter 6 and 7 of an Introduction to Statistical Learning
- I completed the conceptual questions (with the exception of Q7 - which will require an excursion into the Bayesian statistics literature, I've discussed this with my study buddy and we are going to fill this knowledge gap soon).
- I completed the applied questions.
I started chapter 7 from an Introduction to Statistical Learning
- I watched the online lectures

Friday, 26/09/2018

Chapter 6 of an Introduction to Statistical Learning
- I completed the lab
- I started the conceptual questions.

Thursday, 25/09/2018

Chapter 6 of an Introduction to Statistical Learning
- I continued working on the chapter 6 lab.

Wednesday, 25/09/2018

Chapter 6 of an Introduction to Statistical Learning
- I finished reading chapter 6.
- I started the chapter 6 lab.

Tuesday, 24/09/2018

Chapters 5 and 6 an Introduction to Statistical Learning
- I completed the applied questions from chapter 5
- I worked through the online lectures and questions from chapter 6.
- I read chapter 6 from the book.

Monday, 24/09/2018

Chapter 5 of an Introduction to Statistical Learning
- I completed the lab.
- I completed the conceptual questions.
- I very nearly completed the applied questions, just the last problem to finish off.

Friday, 20/09/2018

Chapters 4 and 5 of an Introduction to Statistical Learning
- I completed the applied excercises from Chapter 4 of an Introduction to Statistical Learning.
- I worked through the online lectures and questions for Chapter 5: Resampling Methods.
- I read the chapter 5 from the book.

Thursday, 19/09/2018

Chapter 4 of an Introduction to Statistical Learning
- I completed the lab.
- I very nearly completed the applied excercises, just the last problem to do.

Wednesday, 18/09/2018

Chapter 4 of an Introduction to Statistical Learning
- I completed the conceptual excercises.
- I started the lab.

Tuesday, 18/09/2018

Back to the books:

I read Chapter 4 of an Introduction to Statistical Learning.
I completed the lectures covering Chapter 4 from Hastie and Tibshirani's Statistical Learning course.
I started the conceptual excercises from Chapter 4.

Monday, 17/09/2018

Back from holiday! I spent the day playing around with some applied project ideas that I hope to share once I've articulated them better. I intend to spent one day a week on the applied track from now on.

Wednesday, 4/09/2018

A half day, I finished the applied exercises from chapter 3 of an Introduction to Statistical Learning. Now I'm off to Bruges for for the rest of the week for my birthday, then I'm away next week in Greece for a holiday.

Tuesday, 4/09/2018

I continued the applied exercises from chapter 3 of an Introduction to Statistical Learning. Which lead me to read this page from the statsmodels docs about diagnostic regression plots. I also played around with patsy which I discovered via the statsmodels formula API and skimmed the scipy.stats documentation.

Monday, 3/09/2018

I started the applied exercises from chapter 3 of an Introduction to Statistical Learning which I am porting from R to Python. I reviewed some of the material from the chapter and reached out to wikipedia and The Elements of Statistical Learning for more detail. This article was useful in reproducing the diagnostic regression plots that come for free with R.

Friday, 31/08/2018

I completed reading chapter 3 from An Introduction to Statistical Learning and did the conceptual excercises. I reviewed some of the material from the chapter and reached out to wikipedia and The Elements of Statistical Learning for more detail.

Thursday, 30/08/2018

I completed the Ch3 lectures and questions from Hastie and Tibshirani's Statistical Learning course. I started reading the associated chapter from An Introduction to Statistical Learning.

Wednesday, 29/08/2018

I completed the Ch1 and Ch2 lectures and questions from Hastie and Tibshirani's Statistical Learning course. I read the associated chapters from An Introduction to Statistical Learning and did the exercies.

Tuesday, 28/08/2018

I met with my study buddy to discuss our experience of the House Prices: Advanced Regression Techniques competition, work thorough some sticking points and discuss our next steps. We resolved to complete Hastie and Tibshirani's Statistical Learning course.

Friday, 24/08/2018

I completed modules 5.1 and 5.2 (cross-validation) from Hastie and Tibshirani's Statistical Learning course. I cross referenced with the corresponding sections of The Elements of Statistical Learning.
I completed modules 6.6, 6.7 and 6.8 (shrinkage methods, ridge and lasso regression, finding lambda) from Hastie and Tibshirani's Statistical Learning course. I cross referenced with the corresponding sections of The Elements of Statistical Learning.
I checked out the the scikit-learn cross validation, model evaluation and pipeline docs and used them in my House Prices: Advanced Regression Techniques notebook. Using lasso regression yielded a competition score in the middle of the table.

Thursday, 23/08/2018

I worked on on the kaggle House Prices: Advanced Regression Techniques competition.
I read Chapter 10: Predicting Continuous Target Variables with Regression Analysis of Python Machine Learning

Wednesday, 22/08/2018

I started working on the kaggle House Prices: Advanced Regression Techniques competition, by the end of the day I'd fought my way out of the bottom quartile.
I dipped in to a few of the padas guides as I went.
I watched Manipulating and analysing multi-dimensional data with Pandas.

Tuesday, 21/08/2018

I met up with my study buddy, as we'd agreed last week.

We reviewed some data sets, we decided to spent one week working on Kaggle's House Prices: Advanced Regression Techniques competition.
We spent more time comparing notes from Mathematics for Machine Learning and helped eachother through a few sticking points.
We pontificated on various topics including: death, software and statistics.

I read Andrej Karpathy's deep reinforcement learning blog post. I made a mental note to spend a couple of weekends sometime getting an agent running in an OpenAI Gym environment.

Monday, 20/08/2018

I had a look online for a dataset that would be suitable for the week. I looked at:

I took a closer look at Jupyter notebooks, which I've been using a fair amount:

I read the architecture doc and grokked how it fits in with the other open source tools under the Jupyter umbrella.
I checked out some services in the sphere including binder, kaggle and AWS sagemaker.

I signed up for and poked around on kaggle:

Forked some kernels and ran them.
Entered the Titanic: Machine Learning from Disaster competition and submitted a trivial model to get a feel for kaggle and competitions.

I played with some python libraries:

I read the pandas intro and got familiar with Series, DataFrame and a handful of operations like groupby and crosstab.
I became aware of and got familiar with seaborn.

Friday, 17/08/2018

I met up with my study buddy.

We compared notes on some of the sticking points we had from Mathematics for Machine Learning.
We decided to do spend next week doing something applied. We're going to meet up on Tuesday to kick off a week long sprint, in which we will select a dataset, perform some exploratory data analysis and make some predictions.
We pontificated on the future of the software industry.

Thursday, 16/08/2018

I finished my work on the Statistical Learning Theory section of the Bloomberg Concept Check 1. I faired well on the subjects I've seen recently: like fitting linear and quadratic functions to data using the normal equation. I faired less well, though not catastrophically, on the probabilty material, which I've yet to review methodically as part of this sabattical - I suspect I may need to do this - one to discuss with my study buddy next time we meet.

Wednesday, 15/08/2018

I did a unit of work in the practice track - proving this, I watched a couple of short Pavel Grinfeld lectures on the null space which enhanced my intution.
I started working on the Statistical Learning Theory section of the Bloomberg Concept Check 1.

Tuesday, 14/08/2018

I did some monumental admin which included booking: dentist, hygenist, haircut and car MOT. I went through the links, miscellaneous notes and TODOs I had left over from the Mathematics for Machine Learning I finished yesterday and pruned/consolidated them. I modified the curriculum to include a "Practice track: A "little and often" track; small programming exercises and mathematical problems. Intended to keep maintain and enhance practical skills.". I created the track to try to retain and enhance what I've learned so far, whilst still making headway in the foundational track.

I ordered another book - Hands-On Machine Learning with Scikit-Learn and TensorFlow. Which is a referece text for the Bloomberg course.

I attended Lecture 2 from Bloomberg's Foundations of Machine Learning, which was a Case Study: We were asked to frame the problem of customer churn for a mobile network operator as a machine learning problem: predict when a user will churn. Again, the students made this an entertaining and informative session, some suggestions included: a probability distribution over the days in the future that a user may churn, a binary classification of churn/no-churn in some specified window in the future, and 'number of days' until churn prediction. The objective of this activity was to demonstrate that mapping the choice of outcome measure when approaching business problems is often non-trivial.

I attended Lecture 3 from Bloomberg's Foundations of Machine Learning, which was an introduction to Statistical Learning Theory, topics included:

The definition of input, action, output spaces and the definition of decision functions and loss functions in terms of these spaces.
The assumptions made when analysing a problem using the Statistical Learning Theory framework.
The definition of risk, bayes decision function, empirical risk, empirical risk minimizer, constrained empirical risk minimiser and the hypothesis space.
Linear regression and multiclass classification from the perspective of Statistical Learning Theory.

A fair amount of this language was new to me, I took the opportunity to read the introduction to The Elements of Statistical Learning and the Statistical Learning Theory Wikipedia page before writing up my notes from today's lectures.

I'll take down the concept check and homework problems tomorrow.

Monday, 13/08/2018

I worked through the Principal Components Analysis proof on pages 392-3 of Murphy's MLPP. Feeling confident following this proof was a satisfying capstone to Mathematics for Machine Learning as it requied the application of much of the knowledge I've acquired over the past few weeks. The proof begins by constructing an expression for the projection error and shows that that it is minimized when the projection onto the subspace is orthonormal, before demonstrating that minimising the projection error is equivalent to maximising the variance of the projected data. This allows one to write an expression for the variance of the projected data in terms of the covariance matrix of the high dimensional data which we can then maximize in a constrained optimization - making use of a Lagrange multiplier. This maximization yields an expression for the variance of the projected data that can be recognised as an eigen problem - we arrive at an expression identifying the vector in the direction of maximal variance as an eigenvector of the covariance matrix with the largest eigenvalue.

I wrote up my notes from the last module of Mathematics for Machine Learning: PCA which I finished on saturday morning and incorporated the proof from Murphy's MLPP.

Though I still have some loose ends to tie up with Mathematics for Machine Learning, I permitted myself to watch the first lecture from Bloomberg's Foundations of Machine Learning which is the next item in the curriculum. The lecture was a gentle introduction to machine learning, though we are assured the learning curve is due to increase steeply. The content was a survey of the basics and the material was familiar from Andrew Ng's Machine Learning Course: classification and regression, bias and overfitting, training/validation/test sets etc. The figures on polynomial curve fitting with a power series were familiar from Mathematics for Machine Learning and the introduction to Bishop's PRML. Most noteworthy were the good questions from the students in the lecture, I suspect some were software engineers as they shared my discomfort with the nature of deploying the non-deterministic artefacts of ML to production - the Q and A on this subject was interesting. The instructor mentioned the upcoming homeworks, I looked ahead and they look like they are pitched at the right level - I'm looking forward to getting to them and writing some code.

Friday, 10/08/2018

I completed week 4 of Mathematics for Machine Learning: PCA, topics included:

The axiomatic definition of: Groups, Fields, Vector Spaces, Vector Subspaces and their orthogonal complement.
The objective of PCA; equivalence of maximising variance of projected data and minimizing projection error.
Deriving a proof for Principal Components Analysis.
Practical considerations when performing PCA on a data set.
Programming Exercise: Implementing PCA.

This unit was the most detailed so far. I didn't have time to write up my notes and I ended up finishing the programming exercise on Saturday morning. I'll write up my notes on monday morning and that'll conclude the Mathematics for Machine Learning series.

Thursday, 9/08/2018

I completed week 3 of Mathematics for Machine Learning: PCA, topics included:

Orthogonal projections using different inner products.
Orthogonal projections as reconstruction error in the context of dimensionality reduction.
Orthogonal projection with numpy using the Olivetti faces dataset and the Boston house prices dataset.

Wednesday, 8/08/2018

I completed week 2 of Mathematics for Machine Learning: PCA, topics included:

The axiomatic definition of an inner product.
The dot product as an example of an inner product.
General definition of norms angles and orthogonality with respect to the inner product.
The inner product over a continuous domain; the inner product of a pair of functions as an integral and the inner product of a pair of random variables as their covariance.

During the week's programming exercise I took a detour to review broadcasting with numpy and read the relevant chapter from the python data science handbook. I had an understanding of broadcasting that served me well in performing binary operations between scalars and arrays, and between pairs of arrays. My intuition was however not robust enough to generalise well to broadcasting pairs of matrices - which requires an intuition fit for three dimensions, after some playing around I grokked it.

Tuesday, 7/08/2018

I completed week 1 of Mathematics for Machine Learning: PCA which covered some elementary statistics material, topics included:

Expected Value, Variance and Covariance
Effect of a linear transformation of the dataset on the moments of the distibution.
Statistical operations in numpy.

I started week 2 of Mathematics for Machine Learning: PCA, which began with a refresher of the dot product before moving on to the more general definition of an inner product.

Monday, 6/08/2018

I completed week 6 of Mathematics for Machine Learning: Multivariate Calculus, topics included:

Regression as a minimisation of errors problem
Distinguish appropriate models for particular data sets
Fitting functions to data using gradient descent

This concluded Mathematics for Machine Learning: Multivariate Calculus. I did a quick review of the course.

Friday, 3/08/2018

I completed week five of Mathematics for Machine Learning: Multivariate Calculus. The week's focus was numerical optimisation, topics covered:

The Newton-Raphson method
Gradient Descent
Constrained Optimization: The method of Lagrange Multipliers

Thursday, 2/08/2018

I completed the second half of week four of Mathematics for Machine Learning: Multivariate Calculus and wrote up my notes for the week, new topics in the second half of the week were:

Multivariate Taylor series
Linearisation

I registered for a meetup next month that my study buddy discovered; from the description: "There is NO speaker at Journal Club. We split into small groups of 6 people and discuss the papers. For the first hour the groups are random to make sure everyone is on the same page. Afterwards we split into blog/paper/code groups to go deeper". Some swatting up required to avoid blushes here.

Wednesday, 1/08/2018

I met up with my study buddy. We discussed how we were getting on with the curriculum, we are both having too much of a good time in the foundational track and have been neglecting the applied and interview training tracks. Fair enough, the interview training track is pretty dull and we agreed the applied track can wait until the Mathematics for Machine learning unit is wrapped up - which it should be within the next couple of weeks.

I completed the first half of week four of Mathematics for Machine Learning: Multivariate Calculus, topics covered:

Building approximate functions
Maclaurin series
Taylor series

Tuesday, 31/07/2018

I worked through weeks two and three of Mathematics for Machine Learning: Multivariate Calculus, topics covered:

Partial differentiation
The Jacobian matrix
The Hessian matrix
The multivariate chain rule
Applying the multivariate chain rule to train a neural network

Monday, 30/07/2018

I reviewed and consolidated my notes from Mathematics for Machine Learning: Linear Algebra before moving onto the next course in the specialisation. I completed week one of Mathematics for Machine Learning: Multivariate Calculus, which was a univariate differential calculus review covering:

Definition of the derivative
The sum, power, product and chain rules
The derivatives of 1/x, e^x and trig functions.

Friday, 27/07/2018

I completed the assignments for week 5 of the Mathematics for Machine Learning: Linear Algebra, which included a quiz on diagonalization and an implementation of power iteration. This concluded the course, I had a flick back through the course. I will do a review of the material on Monday before moving onto the next course in the series.

Thursday, 26/07/2018

I completed the assignments for week 4 and worked through week 5 of the Mathematics for Machine Learning: Linear Algebra. Week 5's topic is eigenvectors/values.

I completed the timetabling excercise on InterviewCake.

Wednesday, 25/07/2018

I worked through week 4 of Mathematics for Machine Learning: Linear Algebra, which continued yesterday’s linear algebra review. Topics included:

Einstein summation notation
Transformations in a changed basis
Properties of orthogonal matrices
The Gram-Schmidt process

I started my systematic transit through InterviewCake, I did the readings in the first two sections: “Algorithmic Thinking” and “Array and string manipulation“ which was a back to basics CS101 style intro to the rest of the material on the site.

I added some more thoughts to the applied track doc

Tuesday, 24/07/2018

I completed the first three weeks of Mathematics for machine learning: Linear Algebra. This was a nice back to basics linear algebra review, which I didn't mind as it felt good to stake out some ground - topics included:

Dot product
Scalar and vector projection
Changing basis
Linear Independence
Matrix transformations and their composition
Gaussian elimination
Matrix inverses
Determinants

I signed up for and had a poke around on interviewcake to get a feel for it. I'll start a more systematic transit through the material tomorrow.

I created a document to track project ideas for the applied track.

Monday, 23/07/2018

I met up with my study buddy, we compared notes and constructed our curriculum.

Friday, 20/07/2018

I reviewed:

Recommender systems / low rank matrix factorization
Batch / Mini Batch / Stocastic GD

Which concluded my review of my notes from Andrew Ng's ML course.

I reviewed the curricula from some masters courses and made notes here

Thursday, 19/07/2018

Review of Andrew Ng's Machine Learning topics:

K-Means
Principal Components Analysis
Anomaly Detection

In the afternoon I continued working through chapter 3 of "Python Machine Learning".

Wednesday, 18/07/2018

Review of Andrew Ng's Machine Learning topics:

Neural Networks
Bias & Variance, Precsion & Recall
SVMs

In the afternoon I started working through chapter 3 of "Python Machine Learning".

I took a look at the UCL machine learning masters syllabus and made notes here

Tuesday, 17/07/2018

Revisited my notes from Andrew Ng's course and cross referenced some topics with some more advanced resources. I found the book I was a little afraid of - Bishop's PRML is challenging but well written and accessible.

I reviewed:

Linear Regression
Logistic Regression
Overfitting & Regularisation

Monday, 16/07/2018

Morning working on my curriculum. Activities have included:

Reviewing Karl Rosean's learning log (my notes here) and resources
Scanning resources
Writing up curriculum page and this log

I hope to get a first draft out today and solicit some feedback.

I met up with a colleague in the afternoon. He's interested in pursuing a machine learning sabbatical of his own which is fantastic news. After talking for a few hours I'm convinced that I should take a slightly more measured approach to planning my curriculum. I'm going to take a step back and explore the space of possible curricula a little further before seeking wider feedback. I plan to systematicaly review curricula offered from masters programmes and look at the requirements on ML job listings. I'll also continue to thumb through more resources and feel out what looks promising. I'm going to meet up with said colleague early next week, we intend compare notes on what to include in what we plan on learning, we've agreed it makes sense to share a set of 'core modules' but to have the freedom to also pursue 'optional modules' so that we're not shackled to eachother and can still pursue interest.

I'm going to spend the rest of this week engaging in the following activities:

Revisiting my notes from Andrew Ng's Machine Learning course.
Cross referencing the topics is the Andrew Ng course with some of the resources I have discovered and feeling out which are suitable.
Reviewing curricula of masters programmes.
Exploring core and optional modules for inclusion in the curriculum.

Sunday, 15/07/2018

A day off at Hampstead Heath. Felt out the Talking Machines Podcast, listening to the first 3 episodes.

In the first episode we were privy to a chat with: Yan LeCun, Yoshua Benugo and Geoff Hinton. I'm aware of LeCun having downloaded the MNIST dataset from his website a while ago to do clj_mnist. I was also aware of Benugo as a co-author of the deep learning bible and I've seen Hinton before in an interview with Andrew Ng in a coursera course - these guys are the Deep Learning Mafia. We also met Kevin Murphy who's a head honcho at google and the author of Machine Learning a Probabalistic Perspective which is fighting it out with Pattern Recognition and Machine Learning to be the canonical advanced level machine learining reference - I'll hope to graduate onto these books in the not too distant future (I bought PRML last week for the odd flick through, to gauge how deep the pool is).

In the second episode we met Ilya Sutskever. He's an deep learning fan boy working at google. Amongst other things, he said he felt it was not well understood why it should be that gradient decent empirically appears to be an appropriate algorithm for finding good paramaters for deep NNs given the high-dimensional non-convex surface of the function they are tasked with optimizing. He linked this to the AI winter, saying that in the 80s/90s people had failed to train deep neural networks for other reasons (badly initialized weights in paticular) and incorrectly concluded that the optimisation problem posed by deep NNs was intractable. In the third episode the host dug out a relevant paper from Yoshua Bengio’s Lab entitled: "Identifying and attacking the saddle point problem in high-dimensional non-convex optimization", the paper contains empirical and theoretical evidence that saddle points are more frequently the cause of slow training in large NNs than local minima, the paper also proposes some approaches for tackling this problem.

Summary

A few people to follow on twitter to keep abreast of the moving frontier
A couple of earmarked textbooks
Interesting podcast, nice to get a historical context, nice technical level; not superficial but doesn't require frequent pausing / taking of notes. Will continue to listen.

Saturday, 14/07/2018

I spun up this website and began feeling out some resources that I may decide to include in the curriculum.