Dirk Hovy
Text Analysis in Python for Social Scientists: Discovery and Exploration
Text Analysis in Python for Social Scientists: Discovery and Exploration
💎 Earn 88 Points when you buy this item.
YOU SAVE £0.29
- Condition: Brand new
- UK Delivery times: Usually arrives within 2 - 3 working days
- UK Shipping: Fee starts at £2.39. Subject to product weight & dimension
Bulk ordering. Want 15 or more copies? Get a personalised quote and bigger discounts. Learn more about bulk orders.
Couldn't load pickup availability
- More about Text Analysis in Python for Social Scientists: Discovery and Exploration
Text is a valuable resource for social scientists, but it is challenging to extract information due to its abundance and variability. AI has developed methods for text analysis (natural language processing), and many of these methods are available as Python implementations. This element will teach you when to use which method, the mathematical background, and the Python code to implement it.
Format: Paperback / softback
Length: 75 pages
Publication date: 21 January 2021
Publisher: Cambridge University Press
Text is an abundant resource for social scientists, but due to its vastness and linguistic variability, extracting the desired information can be challenging. To address this, there exists a dedicated subfield within Artificial Intelligence (AI) known as text analysis (natural language processing). Numerous fundamental analysis methods have been developed and are now readily accessible as Python implementations. This Element aims to guide you in selecting the appropriate method based on specific circumstances, providing insights into its mathematical foundations and offering Python code for its implementation.
Text analysis encompasses various techniques and approaches to extracting meaningful information from text data. These methods can be categorized into different stages, such as preprocessing, tokenization, stemming, lemmatization, part-of-speech tagging, and named entity recognition. Preprocessing involves cleaning and preparing the text for further analysis, such as removing punctuation, converting words to lowercase, or removing stopwords. Tokenization involves breaking down the text into individual words or phrases, while stemming reduces words to their base or root forms. Lemmatization further simplifies words by identifying their grammatical roots. Part-of-speech tagging assigns a grammatical category to each word, while named entity recognition identifies and categorizes entities such as persons, organizations, and locations within the text.
Each of these methods has its advantages and applications. For instance, tokenization is useful for analyzing large volumes of text and identifying specific keywords or phrases, while part-of-speech tagging can be used for language translation and text mining. Named entity recognition is particularly valuable in domains such as healthcare and finance, where identifying relevant information is crucial.
Python provides a wide range of libraries and frameworks that facilitate text analysis. Some popular libraries include NLTK (Natural Language Toolkit), SpaCy, and Gensim. These libraries offer a comprehensive set of tools and functionalities for text preprocessing, tokenization, stemming, lemmatization, part-of-speech tagging, and named entity recognition. They provide easy-to-use interfaces and support for various languages, making it convenient to implement text analysis in Python applications.
In conclusion, text analysis is a crucial field within AI that has the potential to transform the way we extract and analyze information from text data. With the availability of Python implementations and the wide range of libraries and frameworks, it has become easier than ever to implement text analysis techniques and gain valuable insights from text. Whether you are a social scientist, data analyst, or developer, learning text analysis can enhance your ability to extract meaningful information from text and make informed decisions based on the data.
Weight: 172g
Dimension: 152 x 228 x 10 (mm)
ISBN-13: 9781108819824
This item can be found in:
UK and International shipping information
UK and International shipping information
UK Delivery and returns information:
- Delivery within 2 - 3 days when ordering in the UK.
- Shipping fee for UK customers from £2.39. Fully tracked shipping service available.
- Returns policy: Return within 30 days of receipt for full refund.
International deliveries:
Shulph Ink now ships to Australia, Belgium, Canada, France, Germany, Ireland, Italy, India, Luxembourg Saudi Arabia, Singapore, Spain, Netherlands, New Zealand, United Arab Emirates, United States of America.
- Delivery times: within 5 - 10 days for international orders.
- Shipping fee: charges vary for overseas orders. Only tracked services are available for most international orders. Some countries have untracked shipping options.
- Customs charges: If ordering to addresses outside the United Kingdom, you may or may not incur additional customs and duties fees during local delivery.