You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
~/opt/miniconda3/envs/pyspark2.4/lib/python3.7/site-packages/fugue/workflow/workflow.py in run(self, *args, **kwargs)
1521 if ctb is None: # pragma: no cover
1522 raise
-> 1523 raise ex.with_traceback(ctb)
1524 self._computed = True
1525 return DataFrames(
~/opt/miniconda3/envs/pyspark2.4/lib/python3.7/site-packages/triad/collections/schema.py in setitem(self, name, value, *args, **kwds)
182 assert_arg_not_none(value, "value")
183 if not validate_column_name(name):
--> 184 raise SchemaError(f"Invalid column name {name}")
185 if name in self: # update existing value is not allowed
186 raise SchemaError(f"{name} already exists in {self}")
SchemaError: Invalid column name 1_col
**Expected behavior**
Expect to support columns starting with numbers.
**Environment (please complete the following information):**
- Backend: pandas
- Backend version:
- Python version: 3.7
- OS: linux/windows: linux
The text was updated successfully, but these errors were encountered:
andyndang
changed the title
[BUG] Fugue doesn't support column name starting with numbers
[FEATURE] Support column names starting with numbers
Sep 30, 2022
Minimal Code To Reproduce
using
_0 _State.RUNNING -> _State.FAILED Invalid column name 1_col
SchemaError Traceback (most recent call last)
/var/folders/c_/jg8n22hx1qd8lgmv955hs05m0000gr/T/ipykernel_43335/3739077820.py in
15 df=tdf,
16 using=no_op,
---> 17 schema="*",
18 )
~/opt/miniconda3/envs/pyspark2.4/lib/python3.7/site-packages/fugue/interfaceless.py in transform(df, using, schema, params, partition, callback, ignore_errors, engine, engine_conf, force_output_fugue_dataframe, persist, as_local, save_path, checkpoint)
165 else:
166 tdf.save(save_path, fmt="parquet")
--> 167 dag.run(engine, conf=engine_conf)
168 if checkpoint:
169 result = dag.yields["result"].result # type:ignore
~/opt/miniconda3/envs/pyspark2.4/lib/python3.7/site-packages/fugue/workflow/workflow.py in run(self, *args, **kwargs)
1521 if ctb is None: # pragma: no cover
1522 raise
-> 1523 raise ex.with_traceback(ctb)
1524 self._computed = True
1525 return DataFrames(
~/opt/miniconda3/envs/pyspark2.4/lib/python3.7/site-packages/triad/collections/schema.py in setitem(self, name, value, *args, **kwds)
182 assert_arg_not_none(value, "value")
183 if not validate_column_name(name):
--> 184 raise SchemaError(f"Invalid column name {name}")
185 if name in self: # update existing value is not allowed
186 raise SchemaError(f"{name} already exists in {self}")
SchemaError: Invalid column name 1_col
The text was updated successfully, but these errors were encountered: