Master

التنبؤ بالسلوك الإستهلاكي للزبون بإستخدام طرق تعلم الآلة لعينة من العلامات التجارية

تحميل PDF

الكلية

كلية العلوم الاقتصادية والتجارية وعلوم التسيير

المشرف

حميدان ربيع

التحميلات

الكلمات المفتاحية

تعلم الآلة، الزبائن، التنبؤ، السلوك الإستهلاكي

الملخص بالعربية

تسعى هذه الدراسة إلى بناء نموذج قادر على التنبؤ بالسلوك الاستهلاكي للزبائن من خلال تصنيفهم بين السلوكيات، ومن أبرزها سلوك \"المغادرة\" الذي يُعد تهديدًا حقيقيًا للشركات. يساعد التنبؤ المسبق بهذا السلوك على اتخاذ قرارات استباقية دقيقة، تُسهم في تقليل خسارة الزبائن، وزيادة ولائهم، والحد من التكاليف. تم في هذه الدراسة استخدام قاعدة بيانات حقيقية لإحدى شركات الاتصالات، تم الحصول عليها من موقع Kaggle، حيث كانت البيانات مستقرة وملائمة للتطبيق. تم اعتماد المنهج الوصفي التحليلي، وتطبيق أربع نماذج من نماذج تعلم الآلة، وهي: 1. الانحدار اللوجستي (Logistic Regression) 2. شجرة القرار (Decision Tree) 3. التدرج المعزز (Gradient Boosting) 4. الغابات العشوائية (Random Forest) وقد تم مقارنة أداء هذه النماذج باستخدام مؤشرات عامة مثل: الدقة (Accuracy)، والاستدعاء (Recall)، والدقة الإيجابية (Precision)، ومعامل F1 (F1-Score)، بهدف اختيار النموذج الأكثر كفاءة في التنبؤ. توصلت النتائج إلى أن نموذج التدرج المعزز هو الأفضل من حيث نسبة الدقة، حيث بلغت 83%، مع انحراف معياري قدره 0.01، مما يشير إلى استقراره وكفاءته العالية في التنبؤ بسلوك الزبائن.

الملخص بالإنجليزية

This study aims to build a model capable of predicting customer consumption behavior by classifying them according to different patterns, with customer churn being one of the most critical behaviors that poses a threat to companies. Predicting this behavior in advance enables businesses to make proactive, data-driven decisions that help reduce customer loss, increase loyalty, and minimize costs. A real-world dataset from a telecommunications company was used in this study, obtained from the Kaggle platform. The data proved to be stable and suitable for analysis. The descriptive-analytical approach was adopted, and four machine learning models were applied: 1. Logistic Regression 2. Decision Tree 3. Gradient Boosting 4. Random Forest The models were compared based on general performance metrics such as Accuracy, Recall, Precision, and F1-Score to determine the most efficient model for prediction. The results showed that the Gradient Boosting model achieved the best performance, with an accuracy rate of 83% and a standard deviation of 0.01, indicating a high level of consistency and predictive efficiency