{"id":67425,"date":"2022-09-13T12:24:15","date_gmt":"2022-09-13T10:24:15","guid":{"rendered":"https:\/\/eiposgrados.com\/?p=67425"},"modified":"2022-09-14T15:39:37","modified_gmt":"2022-09-14T13:39:37","slug":"the-best-python-libraries-for-data-science","status":"publish","type":"post","link":"https:\/\/eiposgrados.com\/eng\/blog-python\/las-mejores-librerias-de-python-para-ciencia-de-datos\/","title":{"rendered":"Best Python Libraries for Data Science"},"content":{"rendered":"<p>In this article, we will talk about the most popular Python libraries to work as a data scientist, as well as how to develop Machine Learning algorithms that solve complex real-world problems:<\/p>\n\n\n\n<h2 class=\"wp-block-heading\"><strong>1.<\/strong> <strong>pandas<\/strong><\/h2>\n\n\n\n<p><strong>pandas <\/strong>is a free Python software library for data analysis and processing. It was created as a community bookstore project and was originally published around 2008.<\/p>\n\n\n\n<figure class=\"wp-block-image size-full is-resized\"><img fetchpriority=\"high\" decoding=\"async\" src=\"https:\/\/eiposgrados.com\/wp-content\/uploads\/2022\/09\/image-13.png\" alt=\"Image\" class=\"wp-image-67427\" width=\"886\" height=\"358\" title=\"\" srcset=\"https:\/\/eiposgrados.com\/wp-content\/uploads\/2022\/09\/image-13.png 886w, https:\/\/eiposgrados.com\/wp-content\/uploads\/2022\/09\/image-13-300x121.png 300w, https:\/\/eiposgrados.com\/wp-content\/uploads\/2022\/09\/image-13-768x310.png 768w, https:\/\/eiposgrados.com\/wp-content\/uploads\/2022\/09\/image-13-123x50.png 123w, https:\/\/eiposgrados.com\/wp-content\/uploads\/2022\/09\/image-13-200x81.png 200w\" sizes=\"(max-width: 886px) 100vw, 886px\" \/><\/figure>\n\n\n\n<p>It provides several powerful and easy-to-use data structures and operations to process data in the form of numerical tables and time series. Pandas also has several tools for reading and writing data between in-memory data structures and different file formats. <\/p>\n\n\n\n<p>In short, it is perfect for quick and easy data manipulation, data aggregation, data reading and writing, and data visualization. Pandas can also take data from various file types like CSV, Excel, etc. or a SQL database and create a Python object called a dataframe. A data frame contains rows and columns and can be used for manipulating data with operations such as <em>join<\/em>, <em>merge<\/em>, <em>group by<\/em>.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\"><strong>2. Numpy<\/strong><\/h2>\n\n\n\n<p><strong>NumPy <\/strong>is a free software Python library for numerical calculation of data, which is usually represented with large arrays and multidimensional arrays.<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img decoding=\"async\" width=\"886\" height=\"399\" src=\"https:\/\/eiposgrados.com\/wp-content\/uploads\/2022\/09\/image-15.png\" alt=\"Image\" class=\"wp-image-67429\" title=\"\" srcset=\"https:\/\/eiposgrados.com\/wp-content\/uploads\/2022\/09\/image-15.png 886w, https:\/\/eiposgrados.com\/wp-content\/uploads\/2022\/09\/image-15-300x135.png 300w, https:\/\/eiposgrados.com\/wp-content\/uploads\/2022\/09\/image-15-768x346.png 768w, https:\/\/eiposgrados.com\/wp-content\/uploads\/2022\/09\/image-15-123x55.png 123w, https:\/\/eiposgrados.com\/wp-content\/uploads\/2022\/09\/image-15-200x90.png 200w\" sizes=\"(max-width: 886px) 100vw, 886px\" \/><\/figure>\n\n\n\n<p>These multidimensional arrays are the main objects of NumPy, whose dimensions are called axes and the number of axes is called a range. NumPy also provides several tools for working with these arrays and high-level mathematical functions for manipulating this data with linear algebra, Fourier transforms, random number calculations, etc. Some of the basic array operations that can be performed with NumPy include adding, cutting, multiplying, reducing, reshaping, and indexing arrays. Other advanced features include stacking arrays, splitting arrays, sending arrays, etc.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\"><strong>3. SciPy<\/strong><\/h2>\n\n\n\n<p>SciPy is a free software library for scientific computing and data engineering.<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img decoding=\"async\" width=\"800\" height=\"400\" src=\"https:\/\/eiposgrados.com\/wp-content\/uploads\/2022\/09\/image-16.png\" alt=\"Image\" class=\"wp-image-67459\" title=\"\" srcset=\"https:\/\/eiposgrados.com\/wp-content\/uploads\/2022\/09\/image-16.png 800w, https:\/\/eiposgrados.com\/wp-content\/uploads\/2022\/09\/image-16-300x150.png 300w, https:\/\/eiposgrados.com\/wp-content\/uploads\/2022\/09\/image-16-768x384.png 768w, https:\/\/eiposgrados.com\/wp-content\/uploads\/2022\/09\/image-16-123x62.png 123w, https:\/\/eiposgrados.com\/wp-content\/uploads\/2022\/09\/image-16-200x100.png 200w\" sizes=\"(max-width: 800px) 100vw, 800px\" \/><\/figure>\n\n\n\n<p>It was created as a community library project and was originally published around 2001. The SciPy library is based on the NumPy array object and is part of the NumPy stack which also includes other scientific computing libraries and tools such as Matplotlib, SymPy, Pandas, etc The NumPy stack has a number of features for scientific and technical computing. <\/p>\n\n\n\n<p>NumPy has users who also use similar applications like GNU Octave, MATLAB, GNU Octave, Scilab, etc. SciPy enables various scientific computing tasks that perform data optimization, data integration, data interpolation, and data modification using linear algebra and Fourier transforms, random number generation, special functions, etc. Like NumPy, multidimensional arrays are the main objects in SciPy, provided by the NumPy module itself.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\"><strong>4.<\/strong> <strong>Scikit-learn<\/strong><\/h2>\n\n\n\n<p>Scikit-learn is a free software library for coding machine learning, primarily in the Python programming language. It was originally developed as a Google Summer of Code project by David Cournapeau and originally published in June 2007.<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"886\" height=\"601\" src=\"https:\/\/eiposgrados.com\/wp-content\/uploads\/2022\/09\/image-17.jpg\" alt=\"Image\" class=\"wp-image-67460\" title=\"\" srcset=\"https:\/\/eiposgrados.com\/wp-content\/uploads\/2022\/09\/image-17.jpg 886w, https:\/\/eiposgrados.com\/wp-content\/uploads\/2022\/09\/image-17-300x203.jpg 300w, https:\/\/eiposgrados.com\/wp-content\/uploads\/2022\/09\/image-17-310x210.jpg 310w, https:\/\/eiposgrados.com\/wp-content\/uploads\/2022\/09\/image-17-768x521.jpg 768w, https:\/\/eiposgrados.com\/wp-content\/uploads\/2022\/09\/image-17-243x165.jpg 243w, https:\/\/eiposgrados.com\/wp-content\/uploads\/2022\/09\/image-17-123x83.jpg 123w, https:\/\/eiposgrados.com\/wp-content\/uploads\/2022\/09\/image-17-200x136.jpg 200w\" sizes=\"(max-width: 886px) 100vw, 886px\" \/><\/figure>\n\n\n\n<p>Scikit-learn is based on other Python libraries such as NumPy, SciPy, Matplotlib, Pandas, etc. and, therefore, offers full interoperability with these libraries. Although Scikit-learn is mainly written in Python, it has also used Cython to write some core algorithms to improve performance. With Scikit-learn, various models for supervised and unsupervised machine learning can be implemented in Scikit-learn, such as classification, regression, vector machine support, random forests, nearest neighbors, naive Bayes, decision trees, clustering, etc.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\"><strong>5. TensorFlow<\/strong><\/h2>\n\n\n\n<p>TensorFlow is a free open source platform with a wide range of artificial intelligence tools, libraries and resources. It was developed by the Google Brain team and released on November 9, 2015.<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"886\" height=\"297\" src=\"https:\/\/eiposgrados.com\/wp-content\/uploads\/2022\/09\/image-17.png\" alt=\"Image\" class=\"wp-image-67461\" title=\"\" srcset=\"https:\/\/eiposgrados.com\/wp-content\/uploads\/2022\/09\/image-17.png 886w, https:\/\/eiposgrados.com\/wp-content\/uploads\/2022\/09\/image-17-300x101.png 300w, https:\/\/eiposgrados.com\/wp-content\/uploads\/2022\/09\/image-17-768x257.png 768w, https:\/\/eiposgrados.com\/wp-content\/uploads\/2022\/09\/image-17-123x41.png 123w, https:\/\/eiposgrados.com\/wp-content\/uploads\/2022\/09\/image-17-200x67.png 200w\" sizes=\"(max-width: 886px) 100vw, 886px\" \/><\/figure>\n\n\n\n<p>With TensorFlow, you can easily create and train machine learning models with high-level APIs like Keras. It also offers multiple levels of abstraction so you can choose the option you need for your model. TensorFlow also allows you to deploy machine learning models anywhere in the cloud, in the browser, or on your own device. You should use TensorFlow Extended (TFX) if you want the full experience, TensorFlow Lite if you want to use it on mobile devices, and TensorFlow.js if you want to train and deploy models in JavaScript environments. TensorFlow is available for Python and C APIs, as well as C++, Java, JavaScript, Go, Swift, etc., but with no guarantee of compatibility with older APIs. Third party packages are also available for MATLAB, C#, Julia, Scala, R, Rust, etc.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\"><strong>6. Keras<\/strong><\/h2>\n\n\n\n<p>Keras is a free and open source neural network library written in Python. It was created primarily by Fran\u00e7ois Chollet, a Google engineer, and published on March 27, 2015.<\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"297\" src=\"https:\/\/eiposgrados.com\/wp-content\/uploads\/2022\/09\/image-18-1024x297.png\" alt=\"Image\" class=\"wp-image-67462\" title=\"\" srcset=\"https:\/\/eiposgrados.com\/wp-content\/uploads\/2022\/09\/image-18-1024x297.png 1024w, https:\/\/eiposgrados.com\/wp-content\/uploads\/2022\/09\/image-18-300x87.png 300w, https:\/\/eiposgrados.com\/wp-content\/uploads\/2022\/09\/image-18-768x223.png 768w, https:\/\/eiposgrados.com\/wp-content\/uploads\/2022\/09\/image-18-123x36.png 123w, https:\/\/eiposgrados.com\/wp-content\/uploads\/2022\/09\/image-18-200x58.png 200w, https:\/\/eiposgrados.com\/wp-content\/uploads\/2022\/09\/image-18.png 1200w\" sizes=\"(max-width: 1024px) 100vw, 1024px\" \/><\/figure>\n\n\n\n<p>It was created to be easy to use, extensible and modular, and supports experimentation in deep neural networks. Therefore, it can be run on other libraries and languages such as TensorFlow, Theano, Microsoft Cognitive Toolkit, R, etc. Keras has several tools that make it easier to work with different types of image and text data for encoding in deep neural networks. There are also various implementations of neural network building blocks such as layers, optimizers, activation functions, objectives, etc. Various actions can be performed with Keras, such as creating custom function layers.<\/p>\n\n\n\n<p>Which bookstores do you usually work with the most?<\/p>\n\n\n\n<p>We read you in comments!<\/p>\n\n\n\n<p>Remember that you can learn much more with our <a href=\"https:\/\/eiposgrados.com\/eng\/programs\/master-in-python\/\" target=\"_blank\" rel=\"noreferrer noopener\"><strong>Master in Advanced Programming in Python for Big Data, Hacking and Machine Learning.<\/strong><\/a><\/p>","protected":false},"excerpt":{"rendered":"<p>In this article, we will talk about the most popular Python libraries to work as a data scientist, as well as how to develop Machine Learning algorithms\u2026 <a title=\"Best Python Libraries for Data Science\" class=\"read-more\" href=\"https:\/\/eiposgrados.com\/eng\/blog-python\/las-mejores-librerias-de-python-para-ciencia-de-datos\/\" aria-label=\"Read more about Best Python Libraries for Data Science\">Read more<\/a><\/p>","protected":false},"author":90,"featured_media":67755,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"inline_featured_image":false,"footnotes":""},"categories":[407],"tags":[],"class_list":["post-67425","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-blog-python"],"acf":[],"_links":{"self":[{"href":"https:\/\/eiposgrados.com\/eng\/wp-json\/wp\/v2\/posts\/67425","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/eiposgrados.com\/eng\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/eiposgrados.com\/eng\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/eiposgrados.com\/eng\/wp-json\/wp\/v2\/users\/90"}],"replies":[{"embeddable":true,"href":"https:\/\/eiposgrados.com\/eng\/wp-json\/wp\/v2\/comments?post=67425"}],"version-history":[{"count":0,"href":"https:\/\/eiposgrados.com\/eng\/wp-json\/wp\/v2\/posts\/67425\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/eiposgrados.com\/eng\/wp-json\/wp\/v2\/media\/67755"}],"wp:attachment":[{"href":"https:\/\/eiposgrados.com\/eng\/wp-json\/wp\/v2\/media?parent=67425"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/eiposgrados.com\/eng\/wp-json\/wp\/v2\/categories?post=67425"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/eiposgrados.com\/eng\/wp-json\/wp\/v2\/tags?post=67425"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}