Questions tagged [python]
Use for data science questions related to the programming language Python. Not intended for general coding questions (which should be asked on Stack Overflow).
6,621 questions
1
vote
1
answer
24
views
Axis error when creating csr_matrix. from merge data
I get an error when creating a csr_matrix data. from two merged pd.read_csv() data.
the code is:
...
1
vote
0
answers
22
views
What would be the best way to analyze the relationship between a chemical reaction network graph and a tuple using a GNN?
So, for an ongoing research project, I've been analyzing the topology of the chemical reaction network (CRN) of a planet's atmosphere. What I'd like to do is see if anything about the CRN can be ...
0
votes
1
answer
37
views
definition and example of coords concept
What is coords at csr_matrix?. i confused when i print() my csr_matrix data. something that appear on my terminal is these:
...
3
votes
0
answers
24
views
CLV Estimation: BTYD vs. Survival Analysis
I'm working on a project using the Elo Merchant Category Recommendation dataset (Kaggle). My goal is to perform Customer Segmentation base on their transactions by combining RFM metrics with Customer ...
0
votes
0
answers
27
views
Neural network (NN) model loss is increasing with epochs
I am trying to make a Convolutional Neural Network (CNN) using only numpy and other basic libraries. As a first step though, I needed to test the regular NN layers first, so I currently have a regular ...
9
votes
1
answer
648
views
Handling Profanity Censorship in BERTopic
I'm currently working in a dataset with censorship on profanity. Basically, fuck would be 4 heart emojis. Considering I'm trying to run a topic modelling w/ BERTopic, what kinda of preprocessing would ...
1
vote
0
answers
21
views
What causes a model to have such an output?
I'm training CSDI model and the output is very suspicious?
Low diffusion steps? too high learning rate? nothing seems to change this behavior? some normalization issue that I'm not accounting for, I'...
5
votes
1
answer
290
views
How to efficiently merge multiple CSV and JSON files into a single DataFrame using Pandas in Python
I am working with multiple data files in a folder where some files are in CSV format and others are in JSON format. I want to combine all of them into a single DataFrame for further analysis.
Here is ...
2
votes
0
answers
18
views
Trying to understand if I'm implementing GluonTS sliding window splitting and validation set correctly
The documentation is a little bit confusing so I thought I would ask here to make sure, I'm using:
...
5
votes
1
answer
56
views
Model's forecasts are not anchored correctly to the history
I'm implementing this paper
and trying to train it on a generated data and return full ground truths and a single forecast but the forecasts my model is producing are not anchored to the past series ...
7
votes
4
answers
348
views
Calculating next row in binary matrix
if I have the binary matrix which looks something like this (this is only 10 rows of binary matrix, I have a dataset of a million rows, so you can see what the binary matrix looks like):
...
2
votes
0
answers
52
views
GAN training (RGB to IR) High-quality results on training set, but blurry/hallucinated outputs on unseen test images
I am a 6th sem student and my mini project is IR pedestrian detection using yolo v8. My job is to train GAN(Pix2Pix-based) for the generation of synthetic IR images. I will give my code below. My 99th ...
5
votes
0
answers
35
views
Unable to predict values for test data
I have build and trained a NMT model using Rnn in Google colab and Now when I am trying to predict for my test data my Google colab session keeps on crashing . The shape of my test data is 47838Ã55
...
6
votes
1
answer
147
views
When attempting to maximize F1 score for a decision tree on test data using cost-complexity pruning why is it yielding the fully grown tree?
I'm learning about classification using decision trees. I'm using DecisionTreeClassifier function in the scikit-learn library in Python to train the model on training data (yields fully grown tree), ...
2
votes
0
answers
35
views
graph analysis for 2-dimensional edges for Natural Language Processing
Given a text resource (Corpus/novel/...) I want to find pair of words that 1) appear statistically significantly together and 2) extract contextual knowledge from these pairs. For simplicity I'm ...