المقرر الثالث اختيار العينة.

المقرر الثالث اختيار العينة

أهداف المقرر بنهاية هذا المقرر سوف تكون قادراً على ما يلي:
اتخاذ قرار حول متى وكيف تقوم بتطبيق كل طريقة من طرق اختيار العينات فهم العوامل التي تؤثر على حجم العينة استخدام هذه العوامل لتقرير حجم العينة المناسب لبرنامج ENA Notes for trainers: This module is based on ENA Delta, but it can be easily adapted to ENA or ENA Beta.

سؤال سريع هل ينبغي علينا دائماً استخدام طرق اختيار العينة؟ 5 دقائق
هل ينبغي علينا دائماً استخدام طرق اختيار العينة؟ Notes for trainers: Answer: No. It is sometimes possible to survey the whole population, as long as it is not too numerous and is quite concentrated geographically.

المسح الشامل يتم أخذ القياسات من كامل السكان
Stankovic Camp II, Skopje, Macedonia, 1999 يتم أخذ القياسات من كامل السكان صورة دقيقة عن الوضع التغذوي و/أو معدلات الوفيات في الحالات الاستثنائية Notes for trainers : In situations such as this one (in a camp), it is possible to take measurements of all the children. By definition, sampling is not needed here. Example: An organization would like to know the malnutrition rate in a small camp that it is responsible for. There are between 2,000 and 3,000 people in this camp. The camp is composed of roughly 400 to 600 children less than 5 years old. In this case, ALL the children can be measured. The results will therefore represent the exact situation of the children in the camp. For larger populations, an exhaustive survey will be long, expensive, difficult and unnecessary. يتم تنفيذه فقط للسكان الذين يتركزون في منطقة جغرافية واحدة أكثر من 1000 فرد لا يمكن استقراء النتائج

اختيار العينة ... لأننا لا نستطيع أن نقيس كل شخص في السكان
ما هو الضروري لتحقيق هذا الهدف؟ لماذا عينة؟ ... لأننا لا نستطيع أن نقيس كل شخص في السكان ... لتقدير حالة معينة في أوساط السكان ينبغي أن تكون العينة تمثيلية Notes for trainers : Brainstorm these questions with the participants. Example: An organization would like to know the malnutrition rate of a district. Imagine that the entire population is estimated at individuals and that of them are between 6 and 59 months. If we measure each child aged between 6 and 59 months, we would know the precise prevalence of malnutrition. BUT imagine the enormous costs and time that it would require. It is much easier, less expensive and faster to only measure a sample of the children in order to evaluate the entire population. When you are explaining this concept to the participants, it is helpful to provide numbers so that they can better assimilate the concept. Scenario: Children aged between 6 to 59 months in a district = ; 5 min to measure each child; 8 hours of work per day. Calculate the number of hours needed to measure all the children in this population. x 5 min = min; ( divided by 60) = hours. 2 667 hours are required to measure all the children in this district. Calculate the number of days needed to measure all the children : 2 667/ 5 = 533 days (teams work 8 hours per day, but taking into account their movement and breaks, a team can only give 5 hours a day to actually take the children’s measurements). It would therefore take 533 days to measure all the children in the district! There are only 365 days in a year. Even with many teams, it would take more than a year to measure all the children in the district. A random sample is sufficient to have an overall idea of the nutritional situation. The sample would be taken in such a way that it is representative of the entire population aged between 6 and 59 months. ما هو هدف اختيار العينة؟ ... لتقدير هذه الحالة في أوساط السكان ككل (Adapted from CDC, 2010)

اختيار العينة للمسح القياسات التي تؤخذ من عينة من السكان التي ينبغي أن يكون لها نفس الخصائص مثلها مثل بقية السكان المستهدف يساوي عينة تمثيلية فنياً العينة التمثيلية تعني أن: كل فرد أو وحدة عينة في السكان لديه فرصة معروفة لا تبلغ صفر في أن يتم اختياره (نفس الفرصة) اختيار فرد واحد ينبغي أن يكون مستقلاً عن اختيار الفرد الآخر Notes for trainers : Sampling only gives an estimate of the nutritional status and mortality rate. Information can only be generalized for the rest of the population if the sample is representative. Very commonly used.

اختيار العينة للمسح... تابع
تذكر هذه النقطة بغض النظر عن طريقة اختيار العينة فإنه إذا لم تكن العينة تمثيلية فإنها لا تعطيك أي معلومات حول السكان Notes for trainers :

كيف نحصل على عينة تمثيلية؟
5 دقائق سؤال سريع كيف نحصل على عينة تمثيلية؟ Notes for trainers: Answer: By using probability sampling. Probability sampling uses random selection to guarantee that each individual of the population has a known non-zero chance of being part of the sample.

اختيار العينة الاحتمالية
اختيار العينة الاحتمالية يتطلب وجود "عشوائية" وأن تكون العينة مُمثلة إحصائياً بمعنى أن لا يكون لأي شخص تحكم في مسألة اختيار الأفراد أو الأسر Notes for trainers : As opposed to probability sampling, non-probability sampling does not use random selection to make up a sample and thus cannot be used to obtain a representative sample. However, non-probability sampling is very useful to gather qualitative data. We will go over 3 types of probability samplings: Simple random sampling Systematic random sampling Cluster sampling

بعض التعاريف فضاء العينة إطار العينة وحدة العينة
وحدة العينة الأساسية (أو الابتدائية) فترة العينة تجزيء العينة المجيب موضوع المسح وحدة التحليل Notes for trainers : Note that the term Village is used throughout the training package for convenience. It is used here to denote any area, where people live that has been given a name by the local authorities or population. This may be a traditional village, part of a town or city, sub-district, an enumeration area or even a rural area bounded by geographical features such as a stream or river. When the area is named, the population knows the boundaries of the area, and the authorities either know or can estimate the population of the area. Sampling universe: population from which we are sampling. Sampling frame: the sampling universe’s description, that is the list of our sampling units (households or individuals, for example). Sometimes different from the sampling universe (data not recent, security reasons, etc.) Sampling unit: the unit selected during the process of sampling. If you select households from a list of all households in the population, the sampling unit is the household. If you are selecting districts in the first stage of cluster sampling, the sampling unit (also called primary sampling unit) at the first sampling stage is district. Basic sampling unit or elementary unit: the sampling unit at the last stage of sampling. In a multi-stage cluster survey (see further), if you select first villages, then select households within selected villages, the basic sampling unit would be the household. Sampling interval: the total number of sampling units in our sampling frame/ the number of sampling units to select. Used in systematic random sampling (see further.) Sampling fraction: ratio of sample size / sampling universe. Respondent: person who’s responding to your questions on the field. Survey subject: entity or person on whom you’re collecting data. Unit of analysis: may differ from the survey subject. For example, you can calculate the percentage of HH that have malnourished children. The unit of analysis will be the household in that case.

الاختيار العشوائي لوحدات العينة
أختر رقم من حقيبة أو قبعة أو استخدم برنامج كمبيوتر لحجم السكان الكبير Source: T. Bianco Notes for trainers : See Annex 3.1 for the procedures on how to use a random number table. Randomly picking a number is the easiest concept to understand. Procedure : Determine the range of numbers to pick; e.g. : 1 – 50. Write each number on a separate small piece of paper. Fold the papers and place them in a hat or plastic bag. Pick from the hat or bag as many pieces of paper as you need. Each picked paper will have a number that represents a household that you will have to visit or an individual that you will have to survey. For larger surveys, random selection is done by a computer program using the same concept of picking numbers out of a hat or bag. ENA software can produce a random table number. Show ENA Planning Screen and how to get a random number table. Procedures for obtaining a random number table from ENA: Determine the range of numbers to pick from (e.g. Total number of eligible 6-59 months children in the refugee camp where you’re conducting your survey); e.g. : 1 – Enter 1 in the box named Range from and 1500 in the box named To. Determine the number of units needed (e.g. Number of children in your calculated sample when using the simple random selection method); e.g. : 450 children 6-59 months . Enter 450 in the box named Numbers. Click on the Generate Table button.

طريقة اختيار العينة هناك ثلاث طرق رئيسية لاختيار العينة الاحتمالية:
العينة العشوائية البسيطة العينة العشوائية المنتظمة العينة العنقودية Notes for trainers: Sampling methods are decided according to the distribution of households and the size of the population. The basic principle for selecting households to visit is that each child belonging to the target population must strictly have a known non-zero chance of being selected for the survey, as well as for each person in a mortality survey. If at one point or another, the recommended method needs to be modified for practical reasons, survey managers or field supervisors must then determine whether the chances of participation for each household and for each child are still similar. ENA Planning Screen: how to choose between a random (simple or systematic) and a cluster sampling. In <Sampling>: Check the <random> box if it will be a simple or systematic random survey. Check the <cluster> box if cluster survey.

ما هي طرق اختيار العينات التي استخدمتها بشكل متكرر في الماضي؟
طرق اختيار العينة... تابع 15 دقيقة ما هي طرق اختيار العينات التي استخدمتها بشكل متكرر في الماضي؟ ما هي مزايا هذه الطريقة؟ ما هي سلبيات هذه الطريقة؟ Notes for trainers:

العينة العشوائية البسيطة
يتم اختيار وحدات العينة بشكل عشوائي من إطار العينة للتعداد السكاني الصغير (أكبر من 1000 وحدة عينة أساسية) تتطلب قائمة مكتملة وحديثة بوحدات العينة الأساسية ومن النادر أن تتوفر هذه القائمة في حالات الطوارئ ترقيم كل وحدة عينة ومن ثم اختيار كل أسرة أو كل فرد بشكل عشوائي باستخدام جدول الأرقام العشوائية Notes for trainers: What may be the basic sampling unit used in SMART surveys? Answer: individuals or HH. In the anthropometric survey, the basic sampling unit is the children, if we’re using simple random sampling; for the mortality survey, it’s the household. It is generally used when the target population is composed of more than 1000 sampling units and that the list of all those units (e.g. Children including their names, addresses and shelters) is available. If the population is less than 1000, then we might opt for an exhaustive survey.

العينة العشوائية البسيطة ... تابع
Notes for trainers : Here is a list of all children under 5 years old living in a camp, with their exact location. Simple random sampling consists of picking randomly from this list the desired number of individuals that will be part of your sample.

العينة العشوائية المنتظمة
يتم اختيار وحدات العينة على فترات عددية منتظمة = فترة العينة المسوحات صغيرة الحجم (حوالي 1000 إلى 4500 فرد) يمكن أيضاً تنفيذها في الميدان بدون وجود قائمة بوحدات العينة ينبغي أن يكون هناك سكان متركزين جغرافياً أو مساكن منظمة بترتيب هندسي محدد (مخيمات أو شوارع) Notes for trainers: Such a situation can arise in a camp where the tents are organized in rows; apartment buildings situated on quadrangle roads; or when homes are along a river, hill or other important geographical element. It is also the most frequently used method of selecting households within clusters.

العينة العشوائية المنتظمة ... تابع
هذه الترتيبات مناسبة لاختيار العينة العشوائية المنتظمة Notes for trainers : Source: MSF مخيم ستانكوفتش 2، سكوبجي، مقدونيا – 1999م مثال على منطقة حضرية مرتبة هندسياً

تقرير فترة العينة: فترة العينة يساوي إجمالي عدد وحدات العينة في السكان/ عدد وحدات العينة في العينة اختيار وحدة العينة الأولى: بين 1 و فترة العينة اختيار وحدات العينة التالية: رقم أول وحدة عينة + فترة العينة...الخ Notes for trainers : Example: Total Population : 2400 children between 6-59 months. Sampling unit: children between 6-59 months. Sample size: 300 children between 6-59 months. Sampling interval: ÷ 300 = 8 Random number chosen between 1 and 8 is 2, for example. You will therefore start with child No 2,. The second child chosen will be = 10; so child number 10. Then = 18; so the 3rd child is number 18. Etc. Same reasoning with households.

مثال: تتكون قرية قور من 3400 أسرة مرقمة بشكل تسلسلي من 1 إلى 3400 وأنت بحاجة لعينة تتكون من 250 أسرة. فترة العينة: يساوي 3400/250 يساوي 13.6 تطلب من شخص ما أن يختار عشوائياً رقم بين 1 و 13 ويختار الشخص مثلاً رقم 11. هذا الرقم يكون هو رقم الأسرة الأولى في المسح ويتم اختيار الأسر التالية كما يلي: Notes for trainers :

مرتبة الأسرة الحسابات الإجمالي رقم الأسرة الأسرة الأولى 11 الأسرة الثانية 24.6 25 الأسرة الثالثة 38.2 38 الأسرة الرابعة 51.8 52 الأسرة الخامسة 65.4 65 الأسرة السادسة 79 الأسرة السابعة 92.6 93 الأسرة الثامنة 106.2 106 الأسرة التاسعة 119.8 120 الخ فترة العينة: 13.6 Notes for trainers :

10 دقائق تمارين: المدينة التي تعيش فيها يوجد بها 8400 أسرة. أنت بحاجة لعينة تتكون من 560 أسرة. ما هي فترة العينة؟ أختر الأسرة الأولى والثانية المراد مسحها يتم تنفيذ مسح في مدينة بها 1800 أسرة وعليك أن تختار 680 أسرة للعينة. ما هي فترة العينة؟ أختر الأسرة الأولى والأسر الثلاث التالية التي سيتم مسحها Notes for trainers: Exercise 1: 8400/560 =15 1st household: we randomly choose a number between 1 and 15; say 12. Following household: the random number chosen to start with is 12. The 2nd HH would then be = 27. Exercise 2: 1800/680 = 2.6 1st HH: choose a random start number between 1 and 2; say 1. Following HH would then be = 3.6; so the 2nd HH will be number 4. The 3rd HH will be: = 6.2.  the 3rd HH will be number 6. The 4th HH = = 8.8.  4th HH will be number 9.

استخدم تصميم العينة العنقودية
ماذا لو لم يكن لديك قائمة بوحدات العينة الأساسية (ولا تستطيع أن تقوم بإعداد قائمة) ولم يكن بالإمكان ترتيب الأسر بشكل منتظم؟ استخدم تصميم العينة العنقودية Notes for trainers:

العينة العنقودية ... تابع يتم اختيار العينة العنقودية داخل المجموعات (القرى والمديريات...الخ) وتسمى عناقيد الطريقة الشائعة الأكثر استخداماً عدد سكان كبير (أكبر من 4500) متواجدين في قرى أو مخيمات Notes for trainers:

العينة العنقودية ... تابع الهدف:
اختيار مناطق جغرافية صغيرة يمكن فيها تنفيذ طريقة اختيار العينات العشوائية البسيطة أو المنتظمة Notes for trainers:

العينة العنقودية ... تابع المزايا السلبيات
أقل تكلفة بسبب قرب وحدات العينة من بعضها البعض لا تتطلب قائمة شاملة بجميع وحدات العينة الأساسية (الأسر، الأطفال،...الخ) تقلل من دقة تقديرات انتشار حالة صحية معينة حساب فترات الثقة يكون أكثر تعقيداً Notes for trainers:

العينة العنقودية ... تابع يتم تطبيق هذه الطريقة على مرحلتين:
الاختيار العشوائي للعناقيد من إجمالي عدد الوحدات الجغرافية (القرى على سبيل المثال) الاختيار العشوائي للأسر في إطار العناقيد Notes for trainers: The sampling is done in 2 stages: Stage 1 : The entire population is divided into small distinct geographic zones, such as “villages”. We can either estimate or know the size of the population of each “village”. Clusters are then randomly selected from these villages. Stage 2 : Households are chosen randomly within clusters or villages. For the procedures on household selection within clusters, see module 4.

Notes for trainers: In this example, you want to survey the region of Gao in Mali.

العينة العنقودية: المرحلة الأولى
Notes for trainers: We begin by dividing the entire population into distinct geographical zones, such as villages. The population of each village is known or can be estimated.

العينة العنقودية: المرحلة الأولى ...تابع
اختيار العينة بنسبة احتمالية تتناسب مع حجم السكان: تكون احتمالية اختيار كل قرية في العنقود متناسبة مع حجم السكان في القرية. وهذا يعني أن كل شخص في المنطقة لديه فرصة معروفة لا تساوي صفر لاختياره في العينة Notes for trainers : For the methodology of cluster selection using PPS, see Annex 3.2.

الاحتمالية بالتناسب مع الحجم
العينة العنقودية: المرحلة الأولى ...تابع الاحتمالية بالتناسب مع الحجم Tsaag- annur Nogoon- nuur Ulgii Altant- sogts المديرية Bugat Bayan- غير احتمالية بالتناسب مع الحجم 231 912 3,099 376 484 763 231 912 3,099 376 484 763 احتمالية بالتناسب مع الحجم Tsaag- annur Nogoon- nuur Ulgii Altant- sogts Bugat Bayan- Notes for trainers : In most cluster samples, the first stage of sampling selects groups of people. This first sampling stage may be districts or villages or some other geographic unit. It is usually done with probability proportional to size of population (PPS). What is PPS? This slide illustrates this concept using districts with different population sizes. Each of the districts is represented by a box, and the population of the district is written inside the box. In the top row, each box is of equal size regardless of the population. In the bottom row, the size of the box is proportional to the population of the district. If we throw a dart at the top row to select a district, you can see that each district has the same chance of being hit by the dart even though the populations are quite different. Instead of throwing a dart, we wrote the name of each of the districts on a piece of paper and then selected one piece of paper at random, each district would have the same chance of being selected. On the other hand, if we throw a dart at the bottom row to select a district, those districts, such as Ulgii, which have larger populations are much more likely to be hit by the dart than districts such as Tsaagannur, which have much smaller populations. PPS sampling is like using the bottom row. So if we want to select districts so that their chance of being selected is proportional to their size, we cannot just write the names of the districts on pieces of paper. Somehow, we must make each districts chance of being selected proportional to its size.

+ العينة العنقودية: المرحلة الأولى ...تابع 0.5 x 0.5 = 0.25
احتمالية حدوث حدثين اثنين في نفس الوقت = احتمالية الحدث الأول × احتمالية الحدث الثاني ما هي احتمالية الحصول على الصورة في العملة المعدنية مرتين عند رمي العملة مرتين اثنتين؟ Notes for trainers : + 0.5 x = 0.25

احتمالية اختيار أي مديرية معينة المرحلة الثانية احتمالية اختيار أسرة معينة في إطار مديرية مختارة احتمالية اختيار أسرة معينة x = عدد الأسر في المديرية في البلد 30 x 20 عدد الأسر في المديرية 30 x 20 عدد الأسر في البلد Notes for trainers : In this example, we assume that we need 30 clusters of 20 HH each for our survey. x =

القرى عدد السكان التقديري عدد السكان التراكمي الرقم المخصص N° 1 500 N° 2 400 900 N° 3 160 1060 N° 4 650 1710 N°5 520 2230 N°6 640 2870 N°7 700 3570 N°8 104 3674 N°9 470 4144 N°10 52 4196 … Notes for trainers: First, we must decide what will be the definition of our geographic unit to be selected in the first sampling stage. We therefore define camp sections or villages or districts, then make a list of each of these geographic units with their population sizes. In the next column, list the cumulative population sizes. That is, for each geographic unit, the cumulative population size will be the size of the population for that unit plus all the units which come before it on the list.

القرى عدد السكان التقديري عدد السكان التراكمي الرقم المخصص العناقيد N° 1 500 1 N° 2 400 900 2-3 N° 3 160 1060 N° 4 650 1710 4 - 5 N° 5 520 2230 6-7 N°6 640 2870 8-9 N°7 700 3570 10-11 N°8 104 3674 N°9 470 4144 12 N°10 52 4196 13 … Notes for trainers: Then we calculate the sampling interval. The sampling interval is just the total population of all the geographic units divided by the number of cluster. If our total population is and we need 30 clusters, sampling interval will be /30 = 333 We want to begin sampling at a randomly selected spot on the list, therefore, we choose a random number between 1 and the sampling interval. The geographic unit where this number occurs is the first selected cluster. Example: the number randomly selected is 200. We then add the sampling interval to our randomly selected number to find the second cluster. We continue adding the sampling interval until the end of the list is reached. If the calculations are correct, you should have selected 30 clusters when the end of the list is reached. Mention that later in the module, you will see how ENA completes all those calculations automatically. In the second stage, selected individuals or households are chosen randomly within each cluster or village.

العينة العنقودية: المرحلة الثانية
يمكن اختيار العديد من الطرق: الاختيار العشوائي البسيط للأسر الاختيار العشوائي المنتظم طريقة برنامج التحصين الموسع المعدلة في بعض الحالات ينبغي تجزيء العينة قبل التمكن من تطبيق أحد الطرق المذكورة أعلاه Notes for trainers: Recall : objective of cluster sampling = to choose areas so that eventually, simple or systematic random sampling can be done . 2nd stage sampling is generally dedicated to that. These techniques will be covered in more details in module 4: Field procedures.

حجم العينة الدقة = الاتساق المصداقية = الصلاحية
الحصول على نتائج متطابقة عند تكرار القياسات (قد تكون ذات مصداقية أو لا) الحصول على نتائج قريبة من الحقيقة عند تكرار القياسات Notes for trainers: Two big sampling issues: accuracy and precision. مصداقية منخفضة دقة عالية مصداقية عالية دقة منخفضة مصداقية عالية دقة عالية

المصداقية والدقة عدم وجود المصداقية يعني وجود انحياز
عدم وجود المصداقية يعني وجود انحياز الانحياز = الفرق بين تقديرات العينة والقيم الموجودة في السكان بسبب خطأ في القياس أو اختيار عينة غير تمثيلية أو عوامل أخرى بخلاف حجم العينة: لا نستطيع أن نتحكم في الانحياز إذا كان موجوداً غياب الدقة يتعلق بخطأ في العينة خطأ العينة = الفرق بين تقديرات العينة والقيم الموجودة لدى السكان بسبب الاختيار العشوائي للعينة: نستطيع أن نتعامل مع ذلك من خلال التحكم في حجم العينة Notes for trainers: We cannot control bias but we can try to reduce its occurrence by standardizing survey procedures, providing an adequate training to every member of the survey team, and closely supervising all survey work. Any measure obtained from a sample is subject to sampling error since we’re only covering part of the population.

اختيار العينة – حجم العينة
الدقة مقابل الانحياز كلما ازداد حجم العينة كلما كانت الدقة أفضل: لكن هذا لا يضمن عدم وجود الانحياز مطلقاً يمكن أن ينتج عن الانحياز تقديرات غير صحيحة إذا كان هناك خطأ صغير في العينة فإن ذلك قد يؤدي إلى الثقة في تقديرات خاطئة كلما حجم العينة أكبر كلما كان من الصعب ضبط الجودة ولذلك قد يكون من الأفضل أن يكون حجم العينة صغير بدقة أقل وبمستوى انحياز منخفض كثيراً Notes for trainers:

حجم العينة: فترة الثقة يقوم برنامج ENA باحتساب حجم العينة المطلوبة وسوف يقرر ذلك باعتماد فترة ثقة تبلغ 95%: ما الذي يعنيه ذلك؟ بافتراض أن معدل انتشار سوء التغذية الحاد العام في كامل السكان ما نقوم بتقييمه في المسح عندئذ إذا قمنا بتكرار المسح مائة مرة بعينة مختلفة في كل مرة فإن 95 مسحاً من هذه المسوحات سوف تعطينا نتيجة في حدود فترة الثقة Notes for trainers: ENA will also determine the confidence interval automatically.

فترة الثقة ... تابع نتائج مسح واحد حدود الثقة 95% Source: CDC
Notes for trainers: Of course, if you do only 1 survey, you end up with an estimate (only an estimate) of the prevalence and a calculated 95% confidence limit, as shown by the dart and the blue circle. There is 95% chance that the true population value lies within the circle, but you don't really know what the actual population value is, as shown by the target. But if the sampling is representative and the measurements are not biased, you are pretty sure the actual population value lied within the blue circle. The upper and lower limits of this range determines the confidence interval of the estimate. The size of the CI is related to the size of the sample (the larger the sample size, the smaller the CI and the tighter the circle). Source: CDC

حجم العينة – القياسات الحيوية للجسم البشري
التحكم في أخطاء العينة يتطلب التحكم في حجم العينة. سوف يقوم برنامج ENA بحساب الحد الأدنى من حجم العينة المطلوب ولكن من أجل القيام بذلك فأنت بحاجة عن ما يلي: ما هو معدل انتشار سوء التغذية المُتوقع و/أو معدل الوفيات؟ ما مدى فترة الثقة المسموح بها = ما هو الحد الأدنى من الدقة المطلوبة فيما يتعلق بتقديرات معدلات سوء التغذية أو معدلات الوفاة والذي يمكن أن ينتج عنه نتائج مفيدة؟ Notes for trainers: Control is not cheap. Collecting data is expensive, and the more data you collect, the less control you have on its quality (more risk of bias); you then need to find the minimum basic sampling units (HH or individuals) required to both control sampling error and reduce the risk of bias. Estimated prevalence/death rates can be found in previous surveys and from qualitative assessments. In cluster survey you will also need to determine the likely design effect, which will be covered further on. Show the sample size calculation section on the ENA Planning Screen.

حجم العينة – القياسات الحيوية للجسم البشري...تابع
من أجل تقدير حجم العينة لمسح واحد ينبغي أن تعرف ما يلي: تقديرات مستوى الانتشار للنتيجة الدقة المطلوبة تأثير التصميم متوسط حجم الأسرة نسبة الأطفال دون سن خمس سنوات معدل عدم الاستجابة حجم السكان ككل (ذو أهمية فقط في حالة أن عدد السكان صغير) مستوى الثقة (استخدام دائماً نسبة 95%) Notes for trainers: Last two points are not big issues to deal with. If the target population is more than approximately , you do not need to consider it in the sample size. You do NOT generally need a larger sample size if the population is bigger. However, in case the target population is below this limit (approximately ), you have to check the box called Correction small population size in order for ENA to adjust the sample size. By convention, we tend to choose a 95% level of confidence.

معدلات الانتشار – الوفيات التقديرية
مصادر المعلومات: المسوحات السابقة بيانات الترصد التقديرات النوعية ونتائج التقييم السريع الأخصائيين الصحيين والقيادات الدينية يُفضل اختيار سلسلة من القيم الممكنة وليس اختيار قيمة واحدة إذا لم يكن هناك تأكد فيما يتعلق بهذه القيمة ينبغي اختيار القيمة الأعلى (القصوى) لمستوى الانتشار المُتوقع السقف المحدد الذي يتطلب تدخل طارئ عدم توفر البيانات: التقديرات غير الدقيقة القياسات الحيوية للجسم البشري: معدل الانتشار الأكثر تحفظاً: 50% Notes for trainers: Surveillance data including proportions of children visiting clinics with malnutrition. Prior survey or a survey in an adjacent area of the country to estimate outcomes. Persons who have worked with the population since a previous survey may have a sense of whether malnutrition or mortality has become more or less common. A more general impression can be obtained by asking health workers if they see many thin children. Religious leaders should be asked how many funerals they have conducted recently, the approximate size of their congregations as well as whether the number of funerals has increased or decreased recently. If interested in a particular prevalence or death rate (e.g. the level that would trigger an emergency response), and suspect the actual prevalence is below this threshold, enter the threshold number. Examples: “Given the situation, the malnutrition prevalence is unlikely to be above 20% or below 10%”. You would therefore use 20%. “Given the situation, the death rate is unlikely to be above 2/10 000/d or below 0.5/ /d.” You would therefore use 2 deaths/10 000/d.

معدلات الانتشار ...تابع Notes for trainers:

الدقة بإبقاء كافة العناصر الأخرى متساوية إذا ما قمت بتخفيض نطاق فترة الثقة سوف يرتفع حجم العينة لديك بشكل كبير. حاول تجنب فترة الثقة الضيقة جداً (أقل من 3%) Notes for trainers: Enter the largest confidence interval (or the smallest precision value) acceptable for the estimation of malnutrition prevalence or death.

الدقة ... تابع لا توجد دقة "قياسية". يتم تحديد الدقة مباشرة من أهداف المسح عندما نريد نتائج لمرة واحدة لغرض المناصرة فإننا لسنا بحاجة إلى وجود دقة متناهية (+/-0.10، هذا المعدل جيد بما يكفي) عندما تكون بحاجة لنتائج متعددة تقوم بالمقارنة حيالها في المستقبل فأنت بحاجة لدقة أكبر (+/-0.05 إذا كان للبرنامج أثر كبير) النتائج التي تراقبها بشكل متكرر (على سبيل المثال النتائج التي تُقارن سنوياً) بحاجة إلى دقة أكبر (+/-0.02) Notes for trainers: Most surveys are correct with a ±5% precision for GAM. Example: Estimated prevalence of anaemia is 70%. Do you need a 2% precision? No, because over 40%, WHO treats the situation as an emergency. Also consider surveys done previously. If a survey is done to determine whether one third or more of the population is malnourished, or there is a gross elevation of death rate, for example, then much less precision is needed than for a survey designed to estimate whether there has been a change since a similar survey conducted three months previously. Adapted from CDC, 2010.

الدقة ... تابع Notes for trainers:

معدل انتشار سوء التغذية
الدقة ... تابع معدل انتشار سوء التغذية % الدقة المرغوبة ± % حجم العينة (أطفال) 10 1.5 1537 2.0 864 2.5 553 3.0 384 معدل الوفيات / 10,000/يوم الدقة المرغوبة / 10,000 / يوم حجم العينة (أسرة) 1 0.2 2200 0.3 978 0.4 550 0.5 352 Notes for trainers: Increasing the precision slightly can have a dramatic effect on the sample require for the survey.

معدل انتشار سوء التغذية
الدقة ... تابع معدل انتشار سوء التغذية % الدقة المرغوبة ± % حجم العينة (أطفال) 5 2.0 456 10 2.5 553 15 3.0 544 20 5.0 246 30 7.5 143 40 10.0 92 معدل الوفيات / 10,000/يوم الدقة المرغوبة / 10,000 / يوم حجم العينة (أسرة) 0.5 0.30 489 1.0 0.40 550 1.5 0.50 528 2.0 0.75 313 3.0 1.00 264 Notes for trainers: These table shows the minimum sample size calculated for various prevalences of GAM or death rates and the level of precision that is commonly asked for in conjunction with the anticipated prevalence or death rate. Keep in mind that the values on these tables are suggestions. Depending on the context of the survey, you may want to choose a precision that is more adapted to your survey. In general, the lower the prevalence or death rate the greater the precision needed. If there is a very high prevalence of acute malnutrition – say 40% - , or a high death rate – say 3 deaths/10000/day - then the precision does not need to be high to enable agencies to make appropriate decisions. If there is a low prevalence of acute malnutrition-say 5%-then the precision needs to be high (at least 3%). If have a low precision-say 5%-, it is not sufficiently precise to decide whether or not intervene, as the confidence interval encompasses no malnutrition at all to a substantial proportion of the population. Results would be useless for making program decisions. A CDR of 1/10,000/d is important because it is the level that is often used to declare an emergency. Always choose the widest confidence interval that is acceptable. It is usually not feasible to achieve a precision greater than 2% for GAM, or 0.3 death/10000/day for death rate.

حجم العينة – SRS – قياسات الجسم البشري
يتم احتساب حجم العينة باستخدام المعادلة التالية: n = { t2 x p x q } d2 حيث أن: n = حجم العينة t = خطر الخطأ (خطأ 1.96أو 5%) b = معدل الانتشار المتوقع (كسر عشري من رقم واحد) q = 1 - P (معدل عدم الانتشار المتوقع) d2 = الدقة النسبية المرغوبة Notes for trainers: This slide shows the calculation used to calculate a sample size for SRS. Go through slowly. What could be risk of error? For most surveys, 5% error risk is acceptable. How is the expected prevalence determined? Using secondary sources, informed judgment, experience, findings from previous surveys, etc. If there is absolutely no information, then an estimation of 50% can be used. Assuming a prevalence closer to 50% is safest because 50% requires the largest sample size. What level of precision is acceptable? In general, if the prevalence of a condition is high, lower levels of precision can be tolerated. For example, a 95% confidence interval of +/- 10 percentage points (absolute precision) would seem reasonable for an estimated prevalence of stunting near 50%, but the same absolute precision would be unacceptable for a prevalence level of 10%. But the level of precision you need really depends on what you will use the results for. If you want to compare later surveys to a baseline, you may need good precision for the baseline. If you only want a general idea if malnutrition or a specific micronutrient deficiency is a problem, you may not need so much precision. For cluster surveys, the sample size must be multiplied by the design effect which will be discussed later during the session.

حجم العينة – SRS – الوفيات
من أجل تقدير معدل الوفيات باستخدام تصميم عشوائي بسيط أو منتظم يتم تطبيق المعادلة التالية: n = Z2 x (rate) d2 حيث أن: n = حجم العينة (شخص – زمن) z = الدرجة المعيارية (z-score) المترافقة مع حدود الثقة المرغوبة (على سبيل المثال 1.96 = 95%) المعدل rate= معدل الوفيات التقديري d = الدقة المرغوبة Notes for trainers: Note that the n is expressed in person-time. To find the sample size in number of persons, an additional calculation is done by ENA using the recall period figure. Source: C.Robinson

حجم العينة – العينة العنقودية
العينة العنقودية تتطلب تكبير حجم العينة من خلال مُعامل التصحيح الذي يسمى تأثير التصميم (design effect) لماذا؟ لتصحيح أثر التجانس بين العناقيد على كامل العينة تأثير التصميم من المسوحات السابقة يساعد في تقرير القيمة التي تستخدم إذا لم يكن هناك بيانات متوفرة فإن تأثير التصميم الذي يساوي 1.5 يكون مقبول عموماً لمسوحات قياسات الجسم البشري Notes for trainers: To enter only for a cluster survey. Example: A measles outbreak in one or two clusters will likely affect prevalence of malnutrition in these clusters compared to others with no outbreak. This high cluster prevalence is not representative of the whole area but will increase the general prevalence estimate which will not reflect the reality of the majority of clusters. To compensate this, we will inflate our sample size by using a correction factor called design effect. ENA Planning Screen: In the box called « Sampling », check that the item « Random » is selected. Enter the following values: Estimated prevalence: 15% Desired precision: 5% What is the value in the design effect ? (1) What is your sample size? (196). Now, in the sampling box, click on « cluster ». What is the value noted in the design effect box? (2) This value is there arbitrarily, and it does not mean that design effect is always = 1.5. For example, replace this value by 1.5. What is your sample size with a design effect of 2? (392). With a design effect of 1.5? (294). Other things being equal: the higher the DEFF, the higher the sample size.

حجم العينة – العينة العنقودية – قياسات الجسم البشري
من أجل تقدير معدل الوفيات باستخدام تصميم العنقود يتم تطبيق المعادلة التالية: n = { t2 x p x q } x DEFF d2 حيث أن: n = حجم العينة t = خطر الخطأ (خطأ 1.96أو 5%) b = معدل الانتشار المتوقع (كسر عشري من رقم 1) q = 1 - P (معدل عدم الانتشار المتوقع) d2 = الدقة النسبية المرغوبة DEFF = تأثير التصميم Notes for trainers: People within the same cluster (or village) are generally more similar to each other than to members of a different cluster or village. This is compensated for by multiplying (increasing) the sample size calculated by the design effect. The design effect is a multiplication factor of the sample size calculated by simple random sampling. Design effects can vary from 1 (if the population is homogeneous so that all the clusters are similar to one another) to 4 or higher where some clusters are not affected and others are severely affected (high heterogeneity). If you anticipate that crude death rate or malnutrition prevalence is quite different in different parts of the population, then the design effect may be high. If the design effect is thought to be much greater than 2 then the population is probably sufficiently heterogeneous that two separate surveys should be conducted, each focused upon more homogeneous sections of the population. Two cluster surveys, each with a design effect of 1.5, can be conducted with the same effort as one survey with a design effect of 3. The effects of an error in choosing an inappropriate design effect are not symmetrical: If too large, additional children are measured unnecessarily; If too small, the results may be discarded and a new survey requested.

حجم العينة – العينة العنقودية – الوفيات
من أجل تقدير معدل الوفيات باستخدام التصميم العنقودي للعينة يتم تطبيق المعادلة التالية: n = DEFF x Z2 x (rate) d2 حيث أن: n = حجم العينة (شخص – زمن) DEFF = تأثير التصميم z = الدرجة المعيارية (z-score) المترافقة مع حدود الثقة المرغوبة (على سبيل المثال 1.96 = 95%) المعدل rate= معدل الوفيات التقديري d = الدقة المرغوبة Notes for trainers: If you are unsure about the expected rate, enter the highest rate among the expected values. If mortality due to violence is low in the survey area, a cluster effect of 1.5 can be used, otherwise a higher cluster effect should be entered. Source: C.Robinson

حجم العينة – الوفيات – فترة الاسترداد
حجم العينة – الوفيات – فترة الاسترداد الأهداف هل تتغير معدلات الوفيات بشكل سريع؟ التغيرات الموسمية الهجرة Notes for trainers: The concept of Recall Period will be discussed more details in the Moratlity module. Mortality rates use the concept of person-time to measure the degree of exposure to a mortality risk during a specific period of time (more details in the mortality module). The longer the period studied, the higher the denominator, and then, the lower the sample size. However, recall period length should never be decided in function of desired sample size but of survey objectives. During an acute emergency, usually use approximately a three month recall period (90 days). However, the decision should be made individually for each emergency context. Objectives: How will the mortality information be used? How current does the information have to be? What time period is needed to address the objectives? Are mortality rates changing rapidly? If so, then knowing the mortality rate over the last few months is likely to be more important than knowing the average rate over six months or one year. Seasonality: Does the mortality change markedly with the different seasons of the year? If so then you are more likely to capture the current season if the recall period is not longer than a few months. Migration: Over a short recall period there will be fewer household members leaving the household, and fewer new members joining. This simplifies the interview and calculations. With mass displacements, there may be very few households who have had a stable composition during a prolonged recall period.

فترة التذكر ... تابع المصداقية الدقة الإمدادات Notes for trainers:
Accuracy: The shorter the recall period the more accurate the estimate of mortality. This is because more distant events are more likely to be forgotten by the respondent and there are likely to be more mistakes in the time of death. Recall periods longer than one year should not be used. If the objective of the survey is to document mortality over a longer period of time, recall periods of up to twelve months are however justified. Example: If major violence resulted in population displacement six months ago, you may want to document the death rate for two separate periods: three or six months before, and the six months since, the displacement. Because this increases the complexity of the mortality survey, it should only be included if the added information is useful and if the persons interviewed can reliably place deaths into the time intervals. Precision: The longer the recall period, the more precise the estimate of mortality. With a longer recall period, a smaller number of households need to be interviewed. Logistics: The longer the recall period, the fewer persons (and households) needed in the sample. Although a longer recall period may increase the time needed at each household, the time saved by going to fewer households more than compensates for a longer interview.

البيانات الديموغرافية
متوسط حجم الأسرة نسبة الأطفال دون سن خمس سنوات مصادر البيانات: تعداد سكاني مسوحات الوفيات السابقة في نفس المنطقة أو منطقة مشابهة Notes for trainers: Why average HH size? Answer: Because our basic sampling unit is the HH. In many contexts of developing countries, the proportion of children under 5 don’t reach 20%. Even if our target population is children 6-59 months, on the planning screen, we enter the under-5 proportion. ENA does the calculations automatically assuming that 90% of the children under 5 are aged between 6-59 months. Example: If our sample should contain 500 children 6-59 months, it means that we need: 500 / 90% = 556 child under 5 to obtain that number of 500. In terms of HH, we will need: 556 children under-5 / number of children under-5 per HH 556 children under-5 / (average HH size x % children under 5). 556 / (4 x 20%) = 695 households. الإفراط في تقدير نسبة الأطفال دون سن خمس سنوات قد ينتج عنه الخروج بحجم عينة صغيرة من حيث عدد الأسر ومن ثم يكون الناتج عينة نهائية أصغر من حيث عدد الأطفال

تعريف الأسرة لا يوجد إجماع واضح حول تعريف الأسرة (الأسرة المعيشية) لأن هذا التعريف يرتبط بخصائص ثقافية ينبغي الاتفاق على التعريف الذي سوف يستخدم قبل أن يبدأ المسح من الموصى به استخدام التعريف الذي يُستخدم من قبل الجهات المعنية في البلد التعريف الأكثر استخداماً هو التالي: "الأشخاص الذين رقدوا هنا الليلة الماضية وأكلوا من نفس القدر" Notes for trainers: In country where polygamy exists, a clear definition of a household is even more crucial. By having a specific definition, you’ll be able to determine how many households there are in a polygamous family. For example, if all wives cook together, eat together and live in the same compound, that would be 1 household. However, if each wife has her own kitchen and prepares food for her own children, those would be seperate households.

5 دقائق تمرين تم تنفيذ مسح في عام 2006م على الأطفال في سن 6-59 شهراً وقد كشف المسح عن معدل انتشار سوء التغذية الحاد العام بنسبة 12% (9-15% فترة ثقة 95%). يبلغ متوسط عدد الأشخاص في الأسرة خمسة أشخاص والسكان في المنطقة غير مهاجرين (غير مرتحلين) عموماً وقد كانت الفوارق بين العناقيد في المسح السابق صغيرة جداً. لا يوجد تغييرات كبيرة منذ مسح 2006م. مسح الوفيات الأخير الذي تم تنفيذه في نفس المنطقة كشف عن وجود 17% من السكان دون سن خمس سنوات. ما هو حجم العينة من حيث عدد الأطفال؟ ومن حيث عدد الأسر؟ Notes for trainers: GAM: 12 % (9 – 15% CI 95 %). Cluster effect: 1.5 (since the previous survey didn’t show significant differences between the clusters.) Precision: ±3% (for example). Size of sample obtained: 676 children. If we assume a 3 % non response rate. The number of households is thus: 911 HH. It’s possible to have different sample sizes depending on the estimated prevalence that you enter. If you use 15% as an estimate, you will get a sample size of 816 children. Also, depending on the objective of the survey, you may want to increase or decrease your precision, which will change de sample size.

5 دقائق تمرين كشف المسح الأخير المنفذ في سبتمبر 2008م عن أن معدل الوفيات يبلغ 1.1 ( فترة ثقة 95%) حالة وفاة/ 10,000/ يوم. كما أظهر كذلك أن متوسط حجم الأسر يبلغ 4.3 أشخاص. أنت تريد أن تنفذ مسحاً بفترة استرداد تبلغ 95%. ما هو حجم العينة من حيث عدد الأشخاص؟ ومن حيث عدد الأسر؟ Notes for trainers: CDR: 1.1 ( CI 95%). Average number of people per household: 4.3 people. Design effect: 1.5 Mortality: Estimate of 0.4 for precision and 95 days for the recall period. Obtained size for the number of people: 4170 persons; 1000 HH.

معدل عدم الاستجابة ماذا لو توقعت في الميدان أنك لن تصل إلى كافة الأفراد أو الأسر الذين ينبغي عليك الوصول إليهم من أجل تحقيق هدف العينة؟ نعني بذلك معدل عدم الاستجابة Notes for trainers: تقدير معدل عدم الاستجابة يسمح بحساب حجم عينة أكبر مع التأكد من أن مستوى الدقة لازال مقبولاً

معدل عدم الاستجابة ... تابع
معدل عدم الثقة يشير إلى عدد وحدات العينة الأساسية التي لن تتمكن من الوصول إليها لأسباب متعددة: الرفض عدم إمكانية الوصول أسباب تتعلق بالسلامة التغيب أخرى برنامج ENA يستخدم هذه التقديرات من أجل تكبير حجم العينة باستخدام المعادلة التالية Notes for trainers: NRR = Non-Response Rate N = Sample Size BSU = Basic Sampling Units Example: If according to your estimated prevalence, desired precision, design effect and number of persons per household, you get 695 HH as the sample size, and you expect to have about 5% of non-response rate, the final sample size should be: Final N = 695/(1-0.05) = 732 HH. Final N = N of BSU needed /(1 – NRR)

العينة الكلية – المسح المُدمج
توجد فوارق بسيطة بين حجمي العينتين (عينة قياسات الجسم البشري وعينة معدل الوفيات): يتم تقرير إجمالي عدد الأسر التي يتم مسحها حسب أعلى عدد للأسر التي ينبغي زيارتها (اختيار أعلى عدد من الأسر بين العينتين) العينة الأكبر لأحد العنصرين Notes for trainers:

العينة الكلية – المسح المُدمج ... تابع
وجود فوارق كبيرة بين حجمي العينتين (عينة قياسات الجسم البشري وعينة معدل الوفيات): تقييد عدد الأسر في كل عنصر بالعدد الذي تم احتسابه تنفيذ مسح أصغر في كل ثاني أو ثالث أسرة (أو أكثر) في كل عنقود Notes for trainers:

مثال: حجم العينة – قياسات الجسم البشري 497 أسرة حجم العينة – معدل الوفيات 425 أسرة العينة الكلية: 497 أسرة Notes for trainers:

مثال: حجم العينة – قياسات الجسم البشري: 560 = 35 عنقود كل عنقود مكون من 16 أسرة حجم العينة – معدل الوفيات: 350 أسرة = 35 عنقود كل عنقود مكون من عشر أسر  تنفيذ مسح معدلات الوفيات فقط في العشر الأسر الأولى من كل عنقود Notes for trainers:

حجم العينة – الأطفال أو الأسر؟
طريقة عينة الكوتا: حجم العينة معبر عنه بعدد الأطفال طريقة عدد الأسر الثابت: حجم العينة معبر عنه بعدد الأسرة لماذا يوصي سمارت بطريقة عدد الأسر الثابت؟ باستخدام الاختيار العشوائي مع العينة المنتظمة فإننا نختار الأسر وليس الأطفال قياس المؤشرات الأخرى المرتبطة بالأسر التأكد من أن المساحين لا يغفلون أي أسرة حتى إذا لم تكن تحوي أطفالاً Notes for trainers: The quota sampling method means that, if for example, there is a cluster of 30 children, one must not stop the cluster until that number of children has been reached, regardless of the number of households visited. Whereas the method according to fix households considers a fixed number of households per cluster, and only these households selected in the field will be visited. At the end of the day, if the target number of children was not reached, it should balance itself out since a percentage of non-response was included in the calculation of the sample size.

عدد العناقيد ما لا يقل عن 25 عنقود
Notes for trainers: When the survey includes less than 25 clusters, this seriously threatens the validity of the results and their representativeness. A study was conducted by Binkin and al. where we took 150 clusters and made segments of 10, 15, 25, 30 and 50 clusters. Results: selecting 50 clusters do not yield to very different estimates from those given by 30 clusters. When you go below 25 clusters, the results may be very different from the estimate we have. From Binkin N. Rapid nutrition surveys: how many clusters are enough? Disasters 16(2): الشكل 2: نسبة انتشار انخفاض الوزن مقابل الطول (أقل من 80% من الوسيط) بين الأطفال بالنسبة للعينات الفرعية المنتظمة 10 و 15 و 25 و 30 و 50 عنقود من إجمالي 150 عنقود. تقييم المجاعة في بوركينافاسو وغينيا والنيجر 1984م و 1985م (الخط الأفقي المستقيم يمثل معدل انتشار انخفاض الوزن مقابل العمر (11.6%) لكافة العناقيد الـــــ 150)

عدد العناقيد... تابع احتساب عدد الأسر التي يتم مسحها كل يوم بناءً على ما يلي: زمن الانتقال عدد ساعات العمل يومياً عدد ساعات العمل في الميدان (باستثناء زمن الانتقال) عدد الساعات المطلوبة لمسح الأسر (باستثناء فترات الاستراحة والفترة التي يتم قضاؤها في تقديم الفرق واختيار الأسر) عدد العناقيد = إجمالي عدد الأسر في العينة/ عدد الأسر المراد مسحها يومياً Notes for trainers: It is important to calculate the number of households (per cluster) that a team can survey per day in order not to overload it and to calculate the total number of clusters needed. This slide is a suggestion of a method on how to calculate the number of clusters to be surveyed. In the calculation, we must take into consideration travel time, breaks, walking between households, taking the measurements and administration of questionnaires.

عدد العناقيد... تابع مثال: حجم العينة = 721 أسرة
المغادرة من المكتب الساعة 7:30 صباحاً والعودة الساعة 5:30 مساءً زمن الانتقال للوصول إلى القرية: 45 دقيقة فترة المقدمة الأولية واختيار الأسرة الأولى: ساعة ونصف زمن الانتقال من أسرة إلى الأسرة التي تليها: خمس دقائق متوسط الزمن لدى كل أسرة: 15 دقيقة الاستراحات: استراحتين لمدة 15 دقيقة كل منها واستراحة واحدة لمدة ساعة كم من الأسر يمكن مسحها في اليوم؟ كم من العناقيد من الممكن أن يتضمنها المسح؟ Notes for trainers: Travel time: 45 min. 7:30am – 5:30pm  10h 10h – (2x45min) = 8.5 hours. 8.5 h – 1.5h (introduction and selection of 1st household) = 7h 7h – 1.5h (breaks) = 5.5 hours = 330 minutes. 330 minutes / 20 minutes (time per household + time of moving from one household to the next) = 16.5  16 households per day. 721 households in the survey/16 households per day= 45,06 households  45 clusters.

تمرين 15 دقيقة حجم العينة = 678 أسرة
المغادرة من المكتب الساعة 7:00 صباحاً والعودة الساعة 5:00 مساءً زمن الانتقال للوصول إلى القرية: ساعة ونصف فترة المقدمة الأولية واختيار الأسرة الأولى: ساعة واحدة زمن الانتقال من أسرة إلى الأسرة التي تليها: ثلاث دقائق متوسط الزمن لدى كل أسرة: 10 دقيقة الاستراحات: استراحتين لمدة 15 دقيقة كل منها واستراحة واحدة لمدة ساعة عدد فرق المسح: أربع فرق كم عدد الأيام المطلوبة لجمع البيانات إذا أردت من كل فريق أن يكمل عنقود واحد يومياً؟ Notes for trainers: Travel time: 1.5 h. 7am – 5pm  10h 10h – (2x1.5h) = 7 hours 7 h – 1h (introduction and selection of households) = 6h 6h – 1.5h (pauses) = 4.5 hours= 270 minutes. 270 minutes / 13 minutes (time per household + time of moving from one household to the next) = 20.7  20 households per day. 678 households in the survey/20 households per day= 33,6  34 clusters. 34 clusters = 34 days. 34 days/4 teams = 8.5  9 days per team survey may last 9 days if all goes well.

العناقيد الاحتياطية العناقيد التي يستحيل مسحها:
بسبب عدم وجود الأمن عدم إمكانية الوصول إليها الرفض ...الخ يختار برنامج ENAعناقيد احتياطية تلقائياً في مرحلة التخطيط عندما لا يكون بالإمكان مسح 10% من العناقيد أو أكثر: استخدم كافة العناقيد البديلة مثال: إذا كان مسحك يحتوي على 30 عنقود فإن برنامج ENA سوف يختار أربعة عناقيد احتياطية في مرحلة اختيار العناقيد Notes for trainers: In some cases, once the survey starts, events occur that could prevent surveyors from accessing previously selected clusters. Examples: A conflit could start and a cluster can become inaccessible due to insecurity. A flood might occur and surveyors might not be able to reach a cluster. Due to political or other reasons, a village leader might change his mind and refuse to let surveyors access his village. Replacement clusters (RC) should be used only if 10% or more clusters were impossible to reach during the survey. In that case, ALL RC should be surveyed.

تصدير بيانات السكان إلى ملف أكسل
تحديد العناقيد تصدير بيانات السكان إلى ملف أكسل خيار النسخ/ اللصق طباعة الجدول تحديد العنقود من خلال الضغط على زرار "تحديد العناقيد" (assign clusters) وينبغي تنفيذ ذلك مرة واحدة فقط Notes for trainers: Enter names all the villages, towns, sub-districts or other areas that will potentially be chosen to include in a cluster into the left column. Enter the estimated population size for each area into the right hand column. The order is NOT important. What is important is that all the survey area must be included. If areas are omitted at this step, then they are not part of the surveyed population. Enter the number of clusters to select from these geographic areas. Press on ‘Assign cluster’. The Software will automatically select the area or “villages” that should contain a cluster. This is calculated at random with the chance of any village being chosen being proportional to its population size. One can also choose to use the Copy/Paste option to insert the population data into the table, by: Copying data from Word or Excel. Clicking on the top left empty box of the Table for cluster sampling. Clicking on the clipboard at the top right side of the table. A window will open asking: Data from the clipboard will be integrated! Click Yes. Once the cluster sites have been chosen this is fixed. It will potentially introduce a bias if they are re-selected. Under no circumstances should the program be re-run several times until particular villages are included or excluded. The geographic area to survey should therefore be carefully studied at the beginning of the planning in order to exclude any inaccessible area. You will not be able to change a cluster site once it is selected. If the survey is to be unbiased, the selected site must be visited. Thus, it is important to define your geographical area in the planning stage very realistically, taking travel, security, and any other factor that could influence your ability to get to the cluster site into account before listing the sites in the planning table.

العناقيد الاحتياطية ... تابع
عليك تحديد المنطقة الجغرافية في مرحلة التخطيط بشكل واقعي جداً آخذين في الحسبان ظروف الانتقال والقضايا الأمنية وأي عوامل أخرى يمكن أن تؤثر على قدرتك في الوصول إلى موقع العنقود قبل إدراج المواقع في جدول التخطيط أختر دائماً الوحدة الجغرافية الأصغر حجماً التي يوجد بالنسبة لها بيانات سكان واسم محدد لها إذا لم تكن بيانات السكان متوفرة أرسم خريطة وحدد أحجام نسبية للمناطق على الخريطة ينبغي أن يكون في كل قسم على الأقل العدد المطلوب من الأسر لتشكيل عنقود مكتمل Notes for trainers: If there are no population data: Draw a map of the area and with someone who is familiar with the area and roughly divide the area into sections of about equal size, following as far as possible existing geographic or administrative boundaries. Each area should have a local name, so that the boundaries are clear. The survey team can further assist in verifying or providing population estimates. Use local knowledge to assign a relative size to each village. It is normally better to take one typical village that everyone knows and then describe the other villages in terms of fractions or multiples of the index village. A starting village, A, is used and the other villages described in terms of their relative size. These factors are then used to give the villages weights. Sometimes only vague descriptions are available, very big, big, medium, small, very small, for example. These descriptions can also be used to weight villages for sampling proportional to population size. Example: Village A is used to estimate the other villages relative size; Village B is ½ of the size of village A; Village C is 2 times as large as village A, etc.. Each section should have at least the number of households required to form a complete cluster. If there are insufficient houses in a village then two adjacent villages should be combined at the planning stage.

حالات خاصة: القرى الصغيرة
إذا لم يكن في القرية عدد كاف من الأسر (أقل من عدد الأسر الذي ينبغي أن يحتوي عليه العنقود) فينبغي دمج القرية مع أقرب قرية منها إذا ما اختيار هذه المجموعة من قبل برنامج ENA كعنقود فيتم تحديد احتمالية الأسر التي يتم اختيارها بين القريتين على سبيل المثال القرية أ يوجد بها 15 أسرة والقرية ب يوجد لديها 60 أسرة. حيث أن القرية ب تبلغ أربعة أضعاف القرية أ وسوف نختار أسر من القرية أ أربعة أضعاف القرية ب. إذا كان العنقود يحتوي 20 أسرة القرية أ: أربع أسر القرية ب: 16 أسرة Notes for trainers:

حالات خاصة: القرى الكبيرة
قد تحتوي القرية الواحدة على أكثر من عنقود واحد: إذا كانت تحتوي عنقودين: تقسيم القرية إلى جزأين متساويين (شمال وجنوب، أراضي مرتفعة وأراضي منخفضة، الخ) اختيار عنقود من كل جزء إذا كانت الأجزاء غير متساوية يتم اختيار العناقيد باستخدام طريقة الاحتمالية المتناسبة مع الحجم (PPS) إذا كانت القرية تحتوي على أكثر من عنقودين: تحديد فيما إذا كان هناك تقسيمات فرعية للقرية مثل قرى فرعية أو قطاعات أو أحياء...الخ اختيار عنقود في كل من هذه القطاعات باستخدام طريقة الاحتمالية المتناسبة مع الحجم (PPS) Notes for trainers: If a village is large but still less than HH and was selected to contain 2 or more clusters, you can conduct the clusters one after the other. However, make sure to change the cluster number on the questionnaires after finishing each “set” of households. For example, if each cluster should contain 15 HH, you should change the cluster number on the questionnaires when you visit HH number 16, because this HH will then belong to the next cluster.

تسمية ملفات المسح ينبغي أن تعطى للمسوحات أسماء فريدة بها:
ثلاثة أحرف رمز البلد تاريخ المسح نوع السكان أو الجهة رمز لنوع الملف تسهيل العثور على الملف من قبل مختلف المستخدمين أمثلة: <LIB_0408_rep.doc> <LIB_0409_IDP_Buchanan_AAH_dat.xls> Notes for trainers : Mention that there is a recommended naming convention. The survey must be given a unique name. Important to be consistent in the naming of files so that they can be recognized later by any member of the team. The name of the file should: Start with a three letter code for the country (e.g. SUD for Sudan, ZAM for Zambia, ANG for Angola, etc.) Then the file name should have the date of the survey in YYMM format (year, year, month, month) Type of population (refugee, IDP, resident) or the agency involved can be included in the name of all the files as well. Then, a code for the type of file: REP for report, DAT for the data file, etc. Example: File named <LIB_0408_rep.doc> would be the report of a survey taken in August 2004 in Liberia. <LIB_0409_IDP_Buchanan_AAH_dat.xls> would be the data file for a survey with IDPs in Buchanan, Liberia in September 2004 conducted by Action Against Hunger. This naming convention means that if the files in the directory are sorted they will automatically be listed in a clear order and all the files pertaining to a particular survey will be together. All files related to one survey should be kept in a unique directory. The directory should also be named using the same convention (by country and date), and any other information that makes the directory unique for a particular survey. It is important that everyone in an agency uses the standard convention for naming files.

تمرين تم تنفيذ مسح في أكتوبر 2005م في النيجر في منطقة تاهو (Tahoua)
5 دقائق تمرين تم تنفيذ مسح في أكتوبر 2005م في النيجر في منطقة تاهو (Tahoua) كيف ستسمي هذا التقرير؟ كيف ستسمي ملف البيانات؟ Notes for trainers :

ما هي طريقة أخذ العينة هنا؟
Notes for trainers: This is generally more reflective of the situations where we are required to do a survey. Widely dispersed population with no obvious pattern or organization. Answer: cluster sampling. ما هي طريقة أخذ العينة هنا؟ Source: CDC

ما هي طريقة أخذ العينة هنا؟
Notes for trainers: Photo taken in Macedonia of Cegrane Camp for Kosovar refugees: a systematic sample was used here as the tents were in neat rows and could be counted to choose which tents to include in the sample. Source: CDC

سؤال سريع لماذا ومتى نختار عينة؟
5 دقائق سؤال سريع لماذا ومتى نختار عينة؟ نختار عينة لأن مسح كامل السكان ليس أمراً ممكنا إلا في حالات نادرة نختار عينة عندما يكون حجم السكان أكبر من 1000 وحدة Notes for trainers:

لا يمكن تجنب أخطاء اختيار العينات
صح أم خطأ؟ 5 دقائق لا يمكن تجنب أخطاء اختيار العينات صحيح: وهذا بسبب الاختيار العشوائي. إلا أنه لازال بالإمكان التحكم في ذلك من خلال احتساب حجم العينة بالشكل السليم. يستخدم برنامج ENA تأثير التصميم من أجل تعويض معدل عدم الاستجابة العالمي المتوقع. Notes for trainers: خطأ: تأثير التصميم يؤدي إلى زيادة حجم العينة إلا أنه يُستخدم بشكل أكثر عموماً لتعويض تأثير عدم تجانس العنقود على النتيجة الكلية

15 دقيقة تمارين من إجمالي عدد 10,000 طفل قم باختيار 500 طفل عشوائياً لمسح عينة عشوائية بسيطة باستخدام جدول الأرقام العشوائية في برنامج ENA قم باختيار 35 عنقود عشوائياً من بيانات القرى في الوثيقة: بيانات السكان Notes for trainers: See Annex 3.3 for the Population Data.

تمارين تخطيط مسح قياسات الجسم البشري
ساعة واحدة تمارين تخطيط مسح قياسات الجسم البشري تخطيط مسح قياسات الجسم البشري ومعدل الوفيات Notes for trainers: See Annexes and answers in Annexes

المقرر الثالث اختيار العينة.

عروض تقديميّة مشابهة

عرض تقديمي عن الموضوع: "المقرر الثالث اختيار العينة."— نسخة العرض التّقديمي:

عروض تقديميّة مشابهة

عن المشروع

خدمة التّواصل معنا

تسجيل الدّخول

تسجيل الحساب باستخدام حساب على موقع تواصل إجتماعي:

المقرر الثالث اختيار العينة.

عروض تقديميّة مشابهة

عرض تقديمي عن الموضوع: "المقرر الثالث اختيار العينة."— نسخة العرض التّقديمي:

عروض تقديميّة مشابهة

عن المشروع

خدمة التّواصل معنا