...

Full Bio

Artificial Intelligence Bias And The People Factor In AI Development

today

Awesome Programming Language Practices Every Programmer Should Follow

today

How Does Data Science Influence Everyday Life For The Better

yesterday

How Does Programming Language Influence Daily Life - Life Of A Programmer

yesterday

10 Things Only A Programming Language Programmer Can Understand

2 days ago

Which Programming Languages in Demand & Earn The Highest Salaries?

382077 views

Top 10 Best Countries for Software Engineers to Work & High in-Demand Programming Languages

311235 views

50+ Data Structure, Algorithms & Programming Languages Interview Questions for Programmers

228648 views

100+ Data Structure, Algorithms & Programming Language Interview Questions Answers for Programmers - Part 1

202767 views

Why I Studied Full-time 8 Months For A Google Programming Language Interview

153858 views

### Why We Need to Forget 'For-Loop' for Data Science Code And Embrace Vectorization

- ndarray, a fast and space-efficient multidimensional array providing vectorized arithmetic operations and sophisticated broadcasting capabilities
- Standard mathematical functions for fast operations on entire arrays of data without having to write loops

You will often come across this assertion in the data science, machine learning, and Python community that Numpy is much faster due to its vectorized implementation and due to the fact that many of its core routines are written in C (based on CPython framework).

- Create a list of a moderately large number of floating point numbers, preferably drawn from a continuous statistical distribution like a Gaussian or Uniform random. I chose 1 million for the demo.
- Create a ndarray object out of that list i.e. vectorize.
- Write short code blocks to iterate over the list and use a mathematical operation on the list say taking logarithm of base 10. Use for-loop, map-function, and list-comprehension. Each time use time.time() function to determine how much time it takes in total to process the 1 million records.

- Do the same operation using Numpy's built-in mathematical method (np.log10) over the ndarray object. Time it.

- Store the execution times in a list and plot a bar chart showing the comparative difference.