Dynamic development of lexical sophistication

0 downloads 0 Views 38MB Size Report
Apr 19, 2017 - Gen der. TOEFL. ITP. # of EMI taken previously. # of words/w. (SD). VLT ... Low Range score: Domain Specific ... the mean concreteness score.
Dynamic development of lexical sophistication through a series of academic tasks: A semester-long study Masaki Eguchi Waseda University University of Hawaii at Manoa

The Seventh International Conference on Task-Based Language Teaching @ University of Barcelona 04/19/2017

Agenda 1) Literature review

A. Lexical Richness and Lexical sophistication B. Multidimensional lexical sophistication C. Usage-based lexical studies

2) Methodology 3) Results 4) Discussion 5) Conclusion

3

Literature Review 1

EMI and linguistic outcome English-Medium Instruction (EMI) • “Non-language subject taught through English” (Hellekjær, 2010, p.1) • A series of unfocused tasks (Ellis, 2003) • Language learning = “by product” (Taguchi, 2014a) Research in EMI settings q Pragmatic development q Academic literacy q Attitude, self-perception and L2 speech (Mahboob, 2014; Pessoa & Miller, 2014; Taguchi, 2014b; Suzuki et al., 2017)

Vocabulary retrieval : Most frustrating aspect for students in EMI in Japan (Suzuki et al., 2017)

4

Literature Review 2

Lexical Richness

(Daller et al., 2003; Lu, 2012; Skehan, 2009ab; see also Read, 2000)

What is rich vocabulary use?? Text-internal measures Diversity: the variety of words

Text-external measures Sophistication: the quality of words

# of different words

Corpus

(COCA,BNC etc.)

# of total words Oral/Written

Oral/Written

The two approaches seem to be independent constructs (Skehan, 2009a).

1000 75% 2000: 15% 3000: 7% Off: 3%

5

Literature Review 3

Multidimensional Lexical Sophistication Li and Lorenzo-Dus (2014): Conceptualization of “Lexical sophistication” • 25 raters commented on “sophistication” Búlte & Housen (2012) : Ø Sophistication extensively relies on

Frequency

e.g., Lexical Frequency Profile (Laufer & Nation, 1995), Lambda (Meara & Bell, 2001)

Need for investigating multidimensional lexical sophistication

Word length Technical vocabulary

Lexical sophistication

Colloquial words

Abstract concept

Descriptive vocabulary

6

Literature Review 4

Lexical Sophistication Revised • Multidimensional Lexical Sophistication has been shown to capture learners’ proficiency (Kyle & Crossley, 2015, 2016). Frequency

Range (dispersion)

Psycholinguistic property

•How infrequent a word is. •(Crossley, Cobb & McNamara, 2013)

Proportion of Multiword Units (Ngrams)

•How much a learner use Ngrams (contiguous words) in production •(Pawley & Syder, 1983; Wray, 2002)

•“How widely a word is used” •Indication of domain specific vocabulary •(Kyle & Crossley, 2015, p.4)

Association Strength of Multiword Units (Ngrams)

•How MWUs co-occur together in large corpora. •(Bestgen & Granger, 2014; Brezina et al., 2015)

•Concreteness, familiarity etc. •Crossley et al., (2011)

Among others (see Kyle & Crossley, 2015)… • Sense relations (hypernymy, polysemy) • Contextual distinctiveness

7

Literature Review 5

Usage-based, dynamic lexical development Usage-based model of L2 acquisition: (Ellis, 2002; Ellis & Cadierno, 2009; Ellis & Larsen-Freeman, 2006)

Study

Settings

Mode/Genre

Constructs

Findings

Crossley , ESL Kyle, Salsbury (1 yr) (2016)

Oral/ casual conversation in lab

Psycholinguistic Properties (e.g., Concreteness, Familiarity)

Less concrete, less familiar words produced (time and proficiency)

Bestgen & Granger (2014)

ESL (Semester )

Written

Ngram association strength (e.g., T score and MI)

• Decrease the overreliance of highly frequent Bigrams • Did not develop the use of infrequent Bigrams

Zheng (2016)

Intensive EFL (10 mths)

Written

Diversity Frequency-based Sophistication Lexical bundles

Dynamic development • Increase in D and Soph • U-shaped: Lexical bundles

Hou, Loerts & EFL Written No study in(18 mths: Few studies on Verspoor CBI classroom (2016) pre/post) oral mode

Range of chunk measures • Few study ratio measured multifaceted • Chunk lexical sophistication • Lexical/grammatical • chunks

Lexical chunks strongly correlated with holistic Significant improvement of lexical chunk and

8

Literature Review 6

Research Question How do learners develop their multidimensional lexical sophistication throughout the in-class opinion exchange/reasoning tasks in EMI, in terms of: a) b) c) d) e)

Frequency Range Concreteness Ngram proportion Ngram strength

9

Methodology 1

Participants • Three students (juniors) in an EMI course in a private university in Tokyo. • “Immersion (bilingual) education” (13 week * 2 hours = 26-28 hours/ semester) Gen TOEFL # of EMI der ITP taken previously

# of words/w (SD)

VLT 30

A F

530

6 (2)

46 (21)

113

B F

530

7 (2)

147.75 (83.47)

95

10

C M

570

8 (5)

349.18 (130.08) 112

**( ) shows the number of student-centered EMI

28 23

25 20

Vocabulary Levels Test

29 29 30

26 25

29 30 29 21

14

15

2

5

3

2

0

A

2000

B

3000

AWL

C

5000

10000

**98% of the token in textbook were within 6-7000 level

Learner A gave few opinions in the small group discussion (< 100 words/week). => Unable to process the text (due to reliability of the measures)

10

Methodology 2

Tasks in the target EMI

Pre

Beginning

Classroom

• Assigned readings (English: 10 – 15 pages / weekly)

• Weekly written quizzes (2 definitions & 1 discussion) • Two student presentations • Small group discussions (3-12 Qs) • Informal lecture by the instructor

Data collection site: 4 groups of 4 – 5 students One IC recorder in each group

11

Methodology 3

Data collection & handling • Audio recording of the in-class small discussions – 13 weeks of instruction (1 class canceled and combined).

• Transcribed by the researcher, and exclusion based on… – Verbatim responses by the participants – Repetitions of the other speakers – Direct quotation from the textbooks

• Discussion questions were coded by the researcher – Based on cognitive processes involved (Ellis, 2003) – Also allowing for open-coding to maximize the flexibility

12

Methodology 4

Types of discussion tasks in the current EMI Question types Opinion exchange (Ellis, 2003) Explaining/reasoning (Ellis, 2003) Definition of a term/ what is XX? Review questions Factual question based on the reading Causal connection/ inference by the data Compare/Contrast Role play Experiences/ preference? Past decision Explaining concepts with example Impression about the particular part of reading Impression about a model lesson Unidentifiable Total

# of Qs 34 22 16 4 3 1 1 1 1 1 1 1 4 90

Since these two tasks can elicit extended turn by the learners, production from these two tasks were combined and analyzed. => Constraints on text lengths

13

Methodology 5

Indices of Lexical Sophistication • Tool for Automatic Analysis of Lexical Sophistication 2.4 (TAALES, Kyle & Crossley, 2015) – Corpus of Contemporary American (COCA) Academic/ Spoken (Davies, 2009)

Frequency CW

• # of occurrence in Ref corpus

Ngram Proportion

•# of Ngram found in corpus/ total Ngrams •Similar to Chunk Ratio (Hou et al., 2016; )

Range CW

• # of texts that contain words in Ref corpus

Ngram Association strength

•MI2 (Evert, 2005) •(see also Bestgen & Granger, 2014, Li & Schmitt, 2010)

Concreteness CW

• How concrete mental image of a word is • (Brysbaert, 2014)

Lemmatized (run, runs, and ran gets the same score) Type counts = the same word counted only once in the same text (=> Controlled for the repetitions) çè Lexical diversity

14

Methodology 6

Indices of Lexical Sophistication| Range High Range score : General vocab know

70

Beg.

TAALES calculates

Int.

Adv.

policy term recruit formative recess

36 35 9 2 1

Low Range score: Domain Specific

the mean range score of the words for each text. e.g., I know about the policy. know => 70 policy => 36

Range score: 70 + 36 / 2 = 53

15

Methodology 7

Indices of Lexical Sophistication| Concreteness High concreteness score: Concrete

Beg.

Int.

Adv.

child graph

581 553

context impression definition normal responsibility

314 288 262 237 222

Low concreteness score: Abstract

TAALES calculates

the mean concreteness score of the words for each text.

16

Methodology 8

Indices of Lexical Sophistication| Association Strength High score: Strong association Adv.

Int.

academic achievement the kind of more emphasis on

13.73 11.25 10.01

but if they to deepen the the purpose be

6.80 5.30 4.79

Beg.

Low score: Weak association

TAALES calculates

the mean Ngram Association strength score of the words for each text.

17

Methodology 9

Analysis | Dynamic System Theory • Describing individual trajectories (= development in context) – Individual variability: “intrinsic property of the developmental process” (Verspoor et al., 2011). – “Flux and variability leading to stability signal self-organization and emergence” (LarsenFreeman, 2012, p.80; Larsen-Freeman & Cameron, 2008; Verspoor et al., 2011)

• Moving Min-Max graph (Verspoor et al., 2011; see example for Polat & Kim, 2013; Zheng, 2016) 2) Maximum among the 5 adjacent observations

10000

Increase in Maximum, but large variability ↓ Restructuring of the system

9000 8000 7000 6000

1) Observed data

5000 4000 3000 2000

3) Minimum among the 5 adjacent observations

1000 0

1

2

3

4

5 Data

6

7 Max2

8

9 Min2

10

11

12

13

Results

19

Results 1

Results | Overview A

B

C

Academic

Spoken

Academic

Spoken

Frequency

Fluctuate



Range

Shift to more general vocabulary

Slight trends to general vocab

Concreteness





Bigrams prop









Trigram prop









Bigram Strength













Trigram Strength

Fluctuate -> stable

** Excluding A from the analysis does not mean A’s development is invalid.

20

Results 2 | Range

Range ü B’s range shows a shift to more general words at the end. ü C’s range shows slight trends to general vocabulary.

B and C: Spoken Range CW

B and C: Academic Range CW

Ad v.

100

90 80 70

B

60

C

50

maxB

40

minB

30

MaxC

20 10

MinC

Discipline Specific

0 1

2

3

4

5

6

7

Time

MICASE 8

9

10

11

12

13

Percentage of texts in corpus (%)

Int.

Widely used

100

Percentage of texts in corpus (%)

Beg .

90 80 70

B

60

C

50

maxB

40

minB

30

MaxC

20

MinC

10

MICASE

0 1

2

3

4

5

6

7

Time

8

9

10

11

12

13

21

Results 3 | Bigram Proportion

Bigram Proportion ü B produces more bigrams towards the end (both academic and spoken). ü C produces relatively stable amount of bigrams. B: Spoken Bigram Proportion

Beg.

100

100

90

90

80

80

70

70

60

Data

50

Max

40

Min

30

MICASE

20 10 0

Proportion (%)

Int.

Proportion (%)

Adv

C_COCA_lemma_spoken_bi_prop_90k_TP

60

Data

50

Max2

40

Min2

30

MICASE

20 10

1

2

3

4

5

6

7

Time

8

9

10

11

12

13

0

1

2

3

4

5

6

7

Time

8

9

10

11

12

13

22

Results 4 | Trigram Proportion

Trigram Proportion ü B produces more Spoken Trigrams towards the end.. C: Spoken Trigram Proportion

Int. Beg.

100

100

90

90

80

80

70

70

60

Data

50

Max

40

Min

30

MICASE

20 10 0

Proportion (%)

Adv

Proportion (%)

B: Spoken Trigram Proportion

60

Data

50

Max2

40

Min2

30

MICASE

20 10

1

2

3

4

5

6

7

Time

8

9

10

11

12

13

0

1

2

3

4

5

6

7

Time

8

9

10

11

12

13

23

Results 5 | Trigram Association

Trigram Association Strength ü B and C experienced high variability ü C’s Trigram approaching to target like. Increase in upper boundary B and C: Academic Trigram Strength (MI2) 10

8.5

B

8

C

7.5

maxB

7

minB

6.5

Beg.

MaxC

6

Variability

5.5

MinC MICASE

5 1

2

3

4

5

6

7

Time

8

9

10

11

12

13

9.5

MI2 (Degree of associasion)

9

MI2

Int.

10

Approach more target like use

9.5

Adv

B and C: Spoken Trigram Strength (MI2)

9 8.5

B

8

C

7.5

More Stable

7 6.5

maxB minB MaxC

6

Variability

5.5

MinC MICASE

5 1

2

3

4

5

6

7

Time

8

9

10

11

12

13

24

Results 6 | Summary

Summary of Results C’s production 1) Stable production of less frequent 2) Ngram proportion: same 3) Ngram Ass Str (Spoken) increase

B’s production 1) General vocabulary 2) Ngram proportion increase 3) Ngram Ass Str more stable

A

B

C

Academic

Spoken

Academic

Spoken

Frequency

Fluctuate



Range

Shift to more general vocabulary

Slight trends to general vocab

Concreteness





Bigrams prop









Trigram prop









Bigram Strength













Trigram Strength

Fluctuate -> stable

25

Discussion 1

Discussion | Overview B’s production 1) General vocabulary 2) Ngram proportion increase 3) Ngram Ass Str more stable

C’s production 1) Stable production of less frequent 2) Ngram proportion: same 3) Ngram Ass Str (Spoken) increase

A. Individual trajectories 1) Development of Ngram proportion 2) Development of Ngram Association strength

B. The shift to general vocabulary use

26

Discussion 2

A) Individual trajectories of Ngrams TOEF L ITP B 530

VLT (5000 level)

# of past EMI

Lecture -based EMI

Studentcentered EMI

Amount of production in EMI

Ngram Proportion

Ngram Strength (MI2)

95 (14)

7

5

2

147.75 (83.47)

Increase (510%)

Less variability

C 570

112 8 3 5 349.18 (130.08) No change Increase (21) Usage-based acquisition of construction (Ellis, 2002) B’s development in Ngram proportion • increase the amount of “Formula” Low-scope Formula Construction • B learned conventionalized units of utterance patterns (Hou et al., 2016; Verspoor et al., 2012)

C’s development in Ngram Association Strength • increase strongly associated collocations (MI2) • ó Bestgen & Granger (2014) => C’s prior proficiency & Active participation in EMI

27

Discussion 3

A) Individual trajectories of Ngrams TOEF L ITP

VLT (5000 level)

# of past EMI

Lecture -based EMI

Studentcentered EMI

Amount of production in EMI

Ngram Proportion

Ngram Strength (MI2)

B 530

95 (14)

7

5

2

147.75 (83.47)

Increase (510%)

Less variability

C 570

112 (21)

8

3

5

349.18 (130.08)

No change

Increase

Teacher centered and student centered EMI 1) Initial proficiency level & receptive vocab

(Assigned Reading)

2) The type of EMI (exposure)

Lecture

Assigned Reading Student Presentation Discussion

Exam

3) The amount of production in the EMI classroom => Different usage experiences

Exam/ Term papers

Interactions between learning environment and individual trajectories can be dynamic.

28

Discussion 4

B) The shift to general vocabulary use ASSUMPTION: academic discourse = domain specific vocabulary (see Kyle & Crossley, 2015). FINDING: a slight shift to general vocabulary 1) Task types: Opinion exchange/ reasoning (interactive) – Decision making (interactive) => frequent words (Skehan, 2009ab, 2014)

2) Teacher’s role: encouragement for negotiation (e.g., confirmation checks & clarification requests etc.) – => strategic use of general vocabulary use enhanced

Type of vocabulary use required by the immediate task needs (see also Polat & Kim, 2014 for diversity measure)

29

Conclusion

Conclusion üThe unfocused academic tasks provided opportunity for the two learners to develop multifaceted lexical sophistication. 1) Relationships between individual profiles and development: ØInitial proficiency / Previous experiences in EMI ØThe amount of production in EMI

2) The interactive tasks and teacher’s encouragement for negotiation ØImmediate communicative needs => more general words towards the end of the semester.

30

Limitations

Limitations [Measurement] üWe need measures to see the student A’s trajectory, who produced less amount of production.

[Analysis] üItem analyses of the Ngrams the learner produced in each session üInferential statistics about the individual variability (e.g., Monte Carlo analysis)

[Generalizability] üThe results should be interpreted with caution because of the nature of the study (exploratory case study). To study the

generalizable pattern of development, further studies should investigate the learners with different profiles (Larsen-Freeman, 2012)

31

Further research

Further research [For the current project] ØTriangulation with interview data, learning logs, quasi-experimental speech data, as well as analysis of discourse of the classroom discussions

[Future projects] üHow different design and implementation of tasks influences: ü Learners’ usage-experiences in EMI (frequency, association, perceptual learning, interaction among the factors). ü Development of multidimensional lexical sophistication (potential interactions)

üMore longitudinal, dense learner corpora that document various aspects of learner characteristics (e.g., previous experiences, moment-by-moment experience in EMI etc.) to reveal a richer picture of how learning happens in EMI (see also Ellis et al., 2016).

Acknowledgements Dr. Tetsuo Harada (Waseda University) Dr. Kristopher Kyle (University of Hawaii at Manoa) Dr. Nicole Ziegler (University of Hawaii at Manoa) Shuhei Kudo and Ryo Moriya (Waseda University) for supporting the data collection

33

References • Bestgen, Y., & Granger, S. (2014). Quantifying the development of phraseological competence in L2 English writing: An automated approach. Journal of Second Language Writing, 26, 28–41. https://doi.org/10.1016/j.jslw.2014.09.004 • Brezina, V., Mcenery, T., & Wattam, S. (2015). Collocations in context A new perspective on collocation networks*. International Journal of Corpus Linguistics, 202(2015), 139–173. https://doi.org/10.1075/ijcl.20.2.01bre • Bulté, B., & Housen, A. (2012). Defining and operationalising L2 complexity. Dimensions of L2 Performance and Proficiency: Complexity; Accuracy and Fluency in SLA, 21–46. https://doi.org/10.1075/lllt.32.02bul • Chang, J.-Y., Kim, W., & Lee, H. (2015). A language support program for English-medium instruction courses: Its development and evaluation in an EFL setting. International Journal of Bilingual Education and Bilingualism, 50(September 2015), 1–19. https://doi.org/10.1080/13670050.2015.1080658 • Coyle, D. (2007). Content and language integrated learning: Towards a connected research agenda for CLIL pedagogies. International Journal of Bilingual Education and Bilingualism, 10(5), 543–562. https://doi.org/10.2167/beb459.0

34

References • Crossley, S. A., Cobb, T., & McNamara, D. S. (2013). Comparing count-based and bandbased indices of word frequency: Implications for active vocabulary research and pedagogical applications. System, 41(4), 965–981. https://doi.org/10.1016/j.system.2013.08.002 • Crossley, S. A., Salsbury, T., & Kyle, K. (2016). A Usage-Based Investigation of L2 Lexical Acquisition: The Role of Input and Output. The Modern Language Journal. https://doi.org/10.1111/modl.12344 • Crossley, S. A., Salsbury, T., Mcnamara, D. S., & Jarvis, S. (2011). Predicting lexical proficiency in language learner texts using computational indices. Language Testing, 28(4), 561–580. https://doi.org/10.1177/0265532210378031 • Daller, H., Van Hout, R., & Treffers-Daller, J. (2003). Lexical Richness in the Spontaneous Speech of Bilinguals. Applied Linguistics, 24(2), 197–222+267. https://doi.org/10.1093/applin/24.2.197 • Doiz, A., Lasagabaster, D., & Sierra, J. M. (2013). English-Medium Instruction at Universities: Global Challenges. Bristol, UK: Multilingual Matters. https://doi.org/10.1093/elt/cct045

35

References • Ellis, N. C. (2016). Cognition, Corpora, and Computing: Triangulating Research in Usage-Based Language Learning. Language Learning, 1–26. https://doi.org/10.1111/lang.12215\ • Ellis, R. (2003). Task-based language learning and teaching. Oxford University Press. https://doi.org/10.1017/S0272263104293056 • Evert, S. (2005). The Statistics of Word Cooccurrences Word Pairs and Collocations. Unpublished Doctoral Dissertation Institut Fur Maschinelle Sprachverarbeitung Universitat Stuttgart, 98(August 2004), 353. https://doi.org/10.1073/pnas.141413598 • Harada, T. (2017). Developing a content-based English as a foreign language program: Needs analysis and curriculum design at the university level. In M. A. Snow & D. M. Brinton (Eds.), The content-based classroom. Ann Arbor, MI: University of Michigan Press. • Hellekjaer, G. O. (2010). Lecture comprehension in English-medium higher education. Hermes, 45(45), 11–34. • Kyle, K., & Crossley, S. A. (2015). Automatically Assessing Lexical Sophistication: Indices, Tools, Findings, and Application. TESOL Quarterly, 49(4), 757–786. https://doi.org/10.1002/tesq.194 • Li, H., & Lorenzo-Dus, N. (2014). Investigating how vocabulary is assessed in a narrative task through raters’ verbal protocols. System, 46(1), 1–13. https://doi.org/10.1016/j.system.2014.06.006

36

References • Lu, X. (2012). The Relationship of Lexical Richness to the Quality of ESL Learners’ Oral Narratives. Modern Language Journal, 96(2), 190–208. https://doi.org/10.1111/j.15404781.2011.01232_1.x • Pawley, A., & Syder, F. H. (1983). Two puzzles for linguistic theory: Nativelike selection and nativelike fluency. Language and Communication, 191, 225. • Polat, B., & Kim, Y. (2014). Dynamics of complexity and accuracy: A longitudinal case study of advanced untutored development. Applied Linguistics, 35(2), 184–207. https://doi.org/10.1093/applin/amt013 • Read, J. A. S. (2000). Assessing vocabulary. Cambridge : Cambridge . • Skehan, P. (2009a). Lexical performance by native and non-native speakers on languagelearning tasks. Vocabulary Studies in First and Second Language Acquisition: The Interface between Theory and Application, 107–124. https://doi.org/10.1057/9780230242258 • Skehan, P. (2009b). Modelling second language performance: Integrating complexity, accuracy, fluency, and lexis. Applied Linguistics, 30(4), 510–532. https://doi.org/10.1093/applin/amp047

37

References • Taguchi, N. (2014a). English-medium education in the global society. International Review of Applied Linguistics in Language Teaching, 52(2), 89–98. https://doi.org/10.1515/iral-2014-0004 • Taguchi, N. (2014b). Pragmatic socialization in an English-medium university in Japan. IRAL - International Review of Applied Linguistics in Language Teaching, 52(2), 157–181. https://doi.org/10.1515/iral-2014-0007 • Zheng, Y. (2016). The complex, dynamic development of L2 lexical use: A longitudinal study on Chinese learners of English. System, 56(220), 40–53. https://doi.org/10.1016/j.system.2015.11.007 • Verspoor, M., De Bot, K., & Lowie, W. (2011). A dynamic approach to second language development: Methods and techniques (Vol. 29). John Benjamins Publishing. • Verspoor, M., & van Dijk, M. (2013). Variability in a Dynamic Systems Theory Approach to Second Language acquisition. The Encyclopedia of Applied Linguistics, 1–9. https://doi.org/10.1002/9781405198431 • Wray, A. (2002). Formulaic language and the lexicon. Cambridge, U.K. : Cambridge, U.K. . • Zheng, Y. (2016). The complex, dynamic development of L2 lexical use: A longitudinal study on Chinese learners of English. System, 56(220), 40–53. https://doi.org/10.1016/j.system.2015.11.007

Mahalo! Thank you for your attention!! TBLT 2017 Barcelona