ASTM F2889-2011 Standard Practice for Assessing Language Proficiency《评估语言水平的标准操作规程》.pdf
《ASTM F2889-2011 Standard Practice for Assessing Language Proficiency《评估语言水平的标准操作规程》.pdf》由会员分享,可在线阅读,更多相关《ASTM F2889-2011 Standard Practice for Assessing Language Proficiency《评估语言水平的标准操作规程》.pdf(24页珍藏版)》请在麦多课文档分享上搜索。
1、Designation: F2889 11Standard Practice forAssessing Language Proficiency1This standard is issued under the fixed designation F2889; the number immediately following the designation indicates the year oforiginal adoption or, in the case of revision, the year of last revision. A number in parentheses
2、indicates the year of last reapproval. Asuperscript epsilon () indicates an editorial change since the last revision or reapproval.1. Scope1.1 PurposeThis practice describes best practices for thedevelopment and use of language tests in the modalities ofspeaking, listening, reading, and writing for
3、assessing abilityaccording to the Interagency Language Roundtable (ILR)2scale. This practice focuses on testing language proficiency inuse of language for communicative purposes.1.2 LimitationsThis practice is not intended to addresstesting and test development in the following specialized areas:Tra
4、nslation, Interpretation, Audio Translation, Transcription,other job-specific language performance tests, or DiagnosticAssessment.1.2.1 Tests developed under this practice should not be usedto address any of the above excluded purposes (for example,diagnostics).2. Referenced Documents2.1 ASTM Standa
5、rds:3F1562 Guide for Use-Oriented Foreign Language Instruc-tionF2089 Guide for Language Interpretation ServicesF2575 Guide for Quality Assurance in Translation3. Terminology3.1 Definitions:3.1.1 achievement test, nan instrument designed to mea-sure what a person has learned within or up to a given t
6、imebased on a sampling of what has been covered in the syllabus.3.1.2 adaptive test, nform of individually tailored testingin which test items are selected from an item bank where testitems are stored in rank order with respect to their itemdifficulty and presented to test takers during the test on
7、thebasis of their responses to previous items, until it is determinedthat sufficient information regarding test takers abilities hasbeen collected. The opposite of a fixed-form test.3.1.3 authentic texts, ntexts not created for languagelearning purposes that are taken from newspapers, magazines,etc.
8、, and tapes of natural speech taken from ordinary radio ortelevision programs, etc.3.1.4 calibration, nthe process of determining the scale ofa test or tests.3.1.4.1 DiscussionCalibration may involve anchoringitems from different tests to a common difficulty scale (thetheta scale). When a test is co
9、nstructed from calibrated itemsthen scores on the test indicate the candidates ability, i.e. theirlocation on the theta scale.3.1.5 cognitive lab, na method for eliciting feedback fromexaminees with regard to test items.3.1.5.1 DiscussionSmall numbers of examinees take thetest, or subsets of the ite
10、ms on the test, and provide extensivefeedback on the items by speaking their thought processesaloud as they take the test, answering questionnaires about theitems, being interviewed by researchers, or other methodsintended to obtain in-depth information about items. Theseexaminees should be similar
11、to the examinees for whom thetest is intended. For tests scored by raters, similar techniquesare used with raters to obtain information on rubric function-ing.3.1.6 computer adaptive test, na test administered by acomputer in which the difficulty level of the next item to bepresented to test takers
12、is estimated on the basis of theirresponses to previous items and adapted to match theirabilities.3.1.7 construct, nthe knowledge, skill or ability that isbeing tested.3.1.7.1 DiscussionThe construct provides the basis for agiven test or test task and for interpreting scores derived fromthis task.3.
13、1.8 constructed response, adja type of item or test taskthat requires test takers to respond to a series of open-endedquestions by writing, speaking, or doing something rather thanchoose answers from a ready-made list.1This practice is under the jurisdiction of ASTM Committee F43 on LanguageServices
14、 and Products and is the direct responsibility of Subcommittee F43.04 onLanguage Testing.Current edition approved May 1, 2011. Published June 2011. DOI: 10.1520/F2889-11.2Interagency Language Roundtable, Language Skill Level Descriptors (http:/www.govtilr.org/Skills/ILRscale1.htm).3For referenced AS
15、TM standards, visit the ASTM website, www.astm.org, orcontact ASTM Customer Service at serviceastm.org. For Annual Book of ASTMStandards volume information, refer to the standards Document Summary page onthe ASTM website.1Copyright ASTM International, 100 Barr Harbor Drive, PO Box C700, West Conshoh
16、ocken, PA 19428-2959, United States.3.1.8.1 DiscussionThe most commonly used types ofconstructed-response items include fill-in, short-answer, andperformance assessment.3.1.9 content validity, na conceptual or non-statisticalvalidity based on a systematic analysis of the test content todetermine whe
17、ther it includes an adequate sample of the targetdomain to be measured.3.1.9.1 DiscussionIn order to achieve content validity, anadequate sample involves ensuring that all major aspects arecovered and in suitable proportions.3.1.10 criterion-referenced scale, na graduated and sys-tematic description
18、 of the domain of subject matter that a testis designed to assess; (or) a rating scale that provides fortranslating test scores into a statement about the behavior to beexpected of a person with that score and/or their relationship toa specified subject matter.3.1.10.1 DiscussionA criterion-referenc
19、ed test is one thatassesses achievement or performance against a cut score that isdetermined as a reflection of mastery or attainment of specifiedobjectives. Focus is on ability to perform tasks rather thangroup ranking.3.1.11 cut score, na score that represents achievement ofthe criterion, the line
20、 between success and failure, mastery andnon-mastery.3.1.12 dichotomous scoring, nscoring based on two cat-egories, e.g., right/wrong, pass/fail. Compare to polytomousscoring.3.1.13 equated forms, ntwo or more forms of a test whosetest scores have been transformed onto the same scale so thata compar
21、ison across different forms of a test is made possible.3.1.14 expert panel, na group of target-language expertswho take a test under test-like conditions and provide com-ments about any problem areas.3.1.14.1 DiscussionAn expert panel should include atleast 8 members. Panel members receive training
22、before theytake the test in order to ensure that their comments will behelpful.3.1.15 face validity, nthe degree to which a test appears tomeasure the knowledge or abilities it claims to measure, basedon the subjective judgment of an observer.3.1.16 fixed-form test, na test whose content does not va
23、ryin order to better accommodate to the examinees level ofknowledge, skill, ability or proficiency. The opposite of anadaptive test.3.1.17 genre, na type of discourse that occurs in aparticular setting, that has distinctive and recognizable patternsand norms of organization and structure, and that h
24、as particularand distinctive communicative functions.3.1.18 ILR scale, na scale of functional language abilityof 0 to 5 used by the Interagency Language Roundtable.23.1.18.1 DiscussionThe range of the ILR scale is from0no knowledge of a language to 5equivalent to a highlyeducated native speaker.3.1.
- 1.请仔细阅读文档,确保文档完整性,对于不预览、不比对内容而直接下载带来的问题本站不予受理。
- 2.下载的文档,不会出现我们的网址水印。
- 3、该文档所得收入(下载+内容+预览)归上传者、原创作者;如果您是本文档原作者,请点此认领!既往收益都归您。
下载文档到电脑,查找使用更方便
5000 积分 0人已下载
下载 | 加入VIP,交流精品资源 |
- 配套讲稿:
如PPT文件的首页显示word图标,表示该PPT已包含配套word讲稿。双击word图标可打开word文档。
- 特殊限制:
部分文档作品中含有的国旗、国徽等图片,仅作为作品整体效果示例展示,禁止商用。设计者仅对作品中独创性部分享有著作权。
- 关 键 词:
- ASTMF28892011STANDARDPRACTICEFORASSESSINGLANGUAGEPROFICIENCY 评估 语言 水平 标准 操作规程 PDF

链接地址:http://www.mydoc123.com/p-539665.html