📚 ANSWER KEY 📚

TEACHER EDITION

Text Analysis Worksheet

Lesson 10: How AI Understands Language - NLP Basics

📝 Note for Teachers: This answer key provides sample correct answers and guidance for grading. Student responses may vary in wording but should demonstrate similar understanding. Use the rubric at the end of this document to assess student work holistically. Remember that the goal is to help students understand how NLP systems process language, not to achieve perfect technical accuracy.

📄 Text Passage (Reference)

Passage A (Grades 6-8):

Last Tuesday, Emma Rodriguez visited the new Artificial Intelligence Museum in San Francisco. She was amazed by the interactive exhibits showcasing how computers can understand human language. "This technology is incredible!" Emma exclaimed to her friend Marcus. The museum opened three months ago and has already welcomed over 50,000 visitors. One exhibit demonstrated how chatbots use Natural Language Processing to answer questions. Emma learned that AI systems break down sentences into smaller parts called tokens. She found it fascinating that computers need step-by-step instructions to understand what humans grasp naturally. Before leaving, Emma purchased a robotics kit for $45.99 at the gift shop. She can't wait to explore more about AI technology!

✅ Task Answers

1
Tokenization - Breaking Text into Words

Answers:

a) Total number of tokens: Approximately 104-110 tokens (count may vary based on how punctuation is counted)
b) Tokens with punctuation: Approximately 8-10
c) Examples of challenging tokens:
d) Why tokenization is difficult:

Sample answer: Tokenization is difficult for computers because language has many special cases. Contractions need to be split or kept together, numbers can include commas and decimals, hyphenated words might be one word or multiple, and punctuation can be part of a word (like in "$45.99") or separate. Computers need rules for every possible situation, unlike humans who understand context naturally.

Teaching Note: Accept any reasonable token count between 100-115. The exact number depends on methodology. The important concept is that students understand tokenization involves systematic word separation and that edge cases (contractions, numbers, hyphenation) create challenges.
2
Part-of-Speech (POS) Tagging - Identifying Word Types

Sample Answers (15 words):

# Word POS Explanation
1TuesdayN (Noun)Name of a day - proper noun
2visitedV (Verb)Action word - past tense of visit
3newAdj (Adjective)Describes the museum
4amazedV (Verb)Past tense verb showing action/state
5interactiveAdj (Adjective)Describes the exhibits
6howAdv (Adverb)Modifies "understand" - in what way
7incredibleAdj (Adjective)Describes technology
8exclaimedV (Verb)Action of speaking loudly
9threeAdj (Adjective)Number describing months
10alreadyAdv (Adverb)Modifies "welcomed" - when/time
11demonstratedV (Verb)Action word - past tense
12smallerAdj (Adjective)Comparative adjective describing parts
13naturallyAdv (Adverb)Modifies "grasp" - in what manner
14purchasedV (Verb)Action of buying - past tense
15roboticsN (Noun)Thing/concept - name of field

Reflection Answers:

a) Difficult words to categorize:

Sample answer: "opened" was tricky because it could be a verb (The museum opened) or an adjective (the opened box). In this sentence it's a verb. Also, "three" could be a number, but in grammar it's functioning as an adjective describing "months."

b) Same word, different parts of speech:

Sample answer: Yes! The word "light" can be different POS:

Teaching Note: Student word choices will vary - that's expected! Grade based on correct identification of the part of speech for their chosen words, not on matching this exact list. Common challenges: distinguishing between past tense verbs and past participles used as adjectives, identifying adverbs that don't end in "-ly," and recognizing numbers as adjectives.
3
Named Entity Recognition (NER) - Finding Key Information

Complete List of Named Entities:

Entity Category Importance
Last TuesdayDATE/TIMETells when the event occurred
Emma RodriguezPERSONMain character of the story
Artificial Intelligence MuseumORGANIZATIONPlace where events happen
San FranciscoLOCATIONGeographic location of museum
MarcusPERSONEmma's friend mentioned in story
three months agoDATE/TIMEWhen museum opened
50,000QUANTITYNumber of visitors
Natural Language ProcessingORGANIZATION/CONCEPTKey technology discussed
$45.99MONEYPrice of robotics kit
Emma (second mention)PERSONContinued reference to main character

Analysis Answers:

a) Total entities: 10 distinct named entities (some names appear multiple times)
b) Category with most entities: DATE/TIME and PERSON (tied with 2-3 each, depending on if "Emma" mentions are counted separately)
c) Real-world uses of NER:

Sample answers:

d) Challenges in identifying entities:

Sample answer: Computers face several challenges: (1) Names can be common words (Apple is both a fruit and a company), (2) Context matters to determine entity type, (3) New names and entities are created all the time, (4) Different cultures have different naming patterns, (5) Abbreviations and nicknames need to be linked to full names.

Teaching Note: Some students may debate whether "Natural Language Processing" is an organization or a concept/technology. Accept either interpretation with reasonable justification. The key is recognizing it as a named entity that's important to the passage.
4
Sentiment Analysis - Understanding Emotional Tone

Sentiment Assessment:

a) Overall sentiment: ☑ Positive (or Very Positive)
b) Explanation:

Sample answer: The passage is clearly positive because Emma is described as "amazed," she calls the technology "incredible," she finds things "fascinating," and she "can't wait" to explore more. There are no negative statements or complaints. The entire tone is enthusiastic and excited about AI and the museum visit.

Emotion Words:

POSITIVE Words/Phrases NEGATIVE Words/Phrases
  • amazed
  • incredible
  • exclaimed (with excitement)
  • fascinating
  • can't wait
  • explore more
  • new (in positive context)
  • interactive
  • None found in this passage

Analysis Answers:

a) Neutral statements (facts):
b) Sarcasm changing sentiment:

Sample answer: If read sarcastically, "incredible" could mean the opposite - that the technology is disappointing or unimpressive. For example, if Emma said "This technology is incredible" with an eye roll, it would actually be negative. This shows why sarcasm is so hard for AI - the same words have opposite meanings depending on tone and context.

c) Computer challenges in detecting sentiment:

Sample answer: Computers would struggle because: (1) They can't hear tone of voice, (2) They might not understand that "can't wait" is positive enthusiasm, not literally being unable to wait, (3) Context matters - "incredible" could be positive or sarcastic, (4) Cultural differences in how emotions are expressed, (5) Subtle emotional nuances that require life experience to understand.

d) Real-world sentiment analysis:

Sample answers:

5
Key Phrase Extraction & Summary

Main Topic and Key Phrases:

a) Main topic:

Sample answers (students should write something similar):

b) Five key phrases:
c) 20-word summary:

Sample answer: "Emma visited an AI museum in San Francisco and learned how computers use natural language processing to understand human language."

(Exactly 20 words)

Semantic Understanding:

a) Who is "She"?

Emma Rodriguez

b) How did you know?

Sample answer: The previous sentences are all about Emma - she was amazed, she exclaimed, Emma learned. "She" comes right after talking about what Emma learned, so it must refer to Emma. Also, Emma is the only female person mentioned who would be finding things fascinating at the museum.

c) Why is this difficult for computers?

Sample answer: Computers have to track which nouns the pronoun "she" could refer back to. In this passage, "she" appears multiple times and could theoretically refer to any female person mentioned. Computers need complex rules to understand pronoun references (called "anaphora resolution") and must consider sentence structure, proximity, and context - things humans do automatically without thinking.

Teaching Note: The pronoun resolution question is crucial - it demonstrates semantic understanding. Good answers will explain how humans use context clues and implicit reasoning, while computers need explicit rules and algorithms to make the same connections.
6
Reflection - Connecting to NLP Concepts

Sample Reflection Answers:

1. Easiest task:

Sample answers might include:

2. Most challenging task:

Sample answers might include:

3. Manual vs. computer analysis:

Sample answer: "When I analyzed manually, I used my understanding of context, grammar rules I learned, and common sense. A computer has to follow strict algorithms and rules for every step. I can 'just know' what a word means, but a computer has to check thousands of patterns and examples. Humans are flexible; computers are systematic."

4. Biggest NLP challenge:

Sample answers:

5. NLP in daily life:

Sample answers:

6. Designing an NLP application:

Sample answers (should demonstrate understanding of NLP concepts):

Teaching Note: Reflection answers will be highly personal and varied. Look for evidence that students: (1) understand the core NLP concepts, (2) recognize challenges computers face that humans don't, (3) can connect NLP to real-world applications, and (4) demonstrate critical thinking about technology and its implications.

📊 Grading Rubric

Suggested Point Distribution (Total: 100 points)

Task/Section Points Criteria
Task 1: Tokenization 15 Reasonable token count (10 pts), identified challenging tokens (3 pts), explained difficulty (2 pts)
Task 2: POS Tagging 20 Correctly identified 15 words and POS (12 pts), reasonable explanations (5 pts), reflection questions (3 pts)
Task 3: Named Entity Recognition 20 Identified 8+ entities correctly (12 pts), proper categorization (5 pts), analysis questions (3 pts)
Task 4: Sentiment Analysis 15 Correct overall sentiment (3 pts), identified emotion words (6 pts), thoughtful analysis answers (6 pts)
Task 5: Key Phrases & Summary 15 Accurate main topic (3 pts), appropriate key phrases (5 pts), concise summary (4 pts), semantic understanding (3 pts)
Task 6: Reflection 15 Thoughtful, detailed responses showing understanding of NLP concepts and challenges (2.5 pts per question)

Overall Quality Assessment:

Score Range Description
90-100 (A) Exceptional understanding of NLP concepts. Accurately completed all tasks with minimal errors. Reflections demonstrate deep thinking about how computers process language vs. humans. Clear connections to real-world applications.
80-89 (B) Strong understanding of NLP concepts. Most tasks completed accurately with few errors. Reflections show good thinking about computer language processing. Some connections to real-world applications.
70-79 (C) Basic understanding of NLP concepts. Tasks completed with several errors or omissions. Reflections are present but may lack depth. Limited connections to real-world applications.
60-69 (D) Partial understanding of NLP concepts. Many tasks incomplete or inaccurate. Reflections are minimal or superficial. Struggles to connect concepts to applications.
Below 60 (F) Limited understanding of NLP concepts. Most tasks incomplete or incorrect. Minimal effort in reflections. Does not demonstrate basic understanding of how computers process language.
📌 Additional Grading Guidelines: