python - How can I concatenate columns without quotation marks in the results while avoiding SQL injection attacks using psycopg-软件玩家

admin管理员组
文章数量:1394778

I want to concatenate with results like concat_columns here:

In the example code below the results have quotation marks: Houston-Alice-"salary" I do not want this, I want the results like this: Houston-Alice-salary

I've tried variations on the "'{}'".format(data_col) to no avail. data_col, "{}".format(data_col), and '{}'.format(data_col) all return Houston-Alice-77321.

The column names are user supplied so I need to use methods that prevent SQL injection attacks.

What should I try next?

Here is my example code:

import psycopg

import pandas as pd
    
def so_question(columns, group_by_columns, table):
                """
                example for SO
                """
                table = psycopg.sql.Composable.as_string(psycopg.sql.Identifier(table))
                columns = [psycopg.sql.Composable.as_string(psycopg.sql.Identifier(col)) for col in columns]
                group_by_columns = [psycopg.sql.Composable.as_string(psycopg.sql.Identifier(col)) for col in group_by_columns]
            
                for data_col in columns:
                    group_by = ''
                    # check if there are grouping columns
                    if len(group_by_columns) > 0:
                        concat_columns = " ,'-',".join(group_by_columns) + " ,'-'," + "'{}'".format(data_col)
                        group_by_clause_group_by = " ,".join(group_by_columns) + " ," + data_col
                    # CTE generation:
                    sql_statement = f"""
                                WITH sql_cte as (
                                    SELECT
                                            CONCAT({concat_columns}) as concat_columns 
                                            ,{data_col} as value
            
                                    from {table}, unnest(array[{data_col}]) AS my_col
                                    group by 
                                            {group_by_clause_group_by}
                                            )
                                    SELECT * FROM sql_cte
                                    """
                return sql_statement
    
    def execute_sql_get_dataframe(sql):
            """
            Creates (and closes) db connection, gets requested sql data , returns as Pandas DataFrame.
            Args:
                    sql (string): sql query to execute.
        
            Returns:
                Pandas DataFrame of sql query results.
            """
            try:
                # print(sql_statement_to_execute)
                # create db connection, get connection and cursor object
                db_connection_cursor = db_connection_cursor_object()
                # execute query
                db_connection_cursor['cursor'].execute(sql)
                # get results into tuples
                tuples_list = db_connection_cursor['cursor'].fetchall()
                # get column names: /
                column_names = [desc[0] for desc in db_connection_cursor['cursor'].description]
                db_connection_cursor['connection'].close()
                # create df from results
                df_from_sql_query = pd.DataFrame(tuples_list, columns=column_names)
                return df_from_sql_query
            except Exception as exc:
                log.exception(f'sql_statement_to_execute:\n {sql}', exc_info=True)
                log.exception(msg=f'Exception: {exc}', exc_info=True)
        
    
    
    data_columns = ['salary']
    group_by_columns_in_order_of_grouping = ['city', 'name']
    
    _sql = so_question(columns=data_columns,
                                  group_by_columns=group_by_columns_in_order_of_grouping,
                                  table='random_data')
    dataframe = execute_sql_get_dataframe(sql=_sql)
    print(dataframe)

Here's the code for data generation:

import psycopg

# Connect to the database
conn = psycopg.connect("dbname=mydatabase user=myuser password=mypassword")
cur = conn.cursor()

# Create the sample table
cur.execute("""
    CREATE TABLE sample_table (
        city VARCHAR(50),
        name VARCHAR(50),
        salary INTEGER
    )
""")

# Insert sample data into the table
cur.execute("""
    INSERT INTO sample_table (city, name, salary) VALUES
    ('New York', 'Alice', 70000),
    ('Los Angeles', 'Bob', 80000),
    ('Chicago', 'Charlie', 75000),
    ('Houston', 'Eve', 71758),
    ('Phoenix', 'Dave', 68000)
""")

# Commit the transaction
connmit()

# Close the cursor and connection
cur.close()
conn.close()

print("Sample table created and data inserted successfully.")

I want to concatenate with results like concat_columns here:

In the example code below the results have quotation marks: Houston-Alice-"salary" I do not want this, I want the results like this: Houston-Alice-salary

I've tried variations on the "'{}'".format(data_col) to no avail. data_col, "{}".format(data_col), and '{}'.format(data_col) all return Houston-Alice-77321.

The column names are user supplied so I need to use methods that prevent SQL injection attacks.

What should I try next?

Here is my example code:

import psycopg

import pandas as pd
    
def so_question(columns, group_by_columns, table):
                """
                example for SO
                """
                table = psycopg.sql.Composable.as_string(psycopg.sql.Identifier(table))
                columns = [psycopg.sql.Composable.as_string(psycopg.sql.Identifier(col)) for col in columns]
                group_by_columns = [psycopg.sql.Composable.as_string(psycopg.sql.Identifier(col)) for col in group_by_columns]
            
                for data_col in columns:
                    group_by = ''
                    # check if there are grouping columns
                    if len(group_by_columns) > 0:
                        concat_columns = " ,'-',".join(group_by_columns) + " ,'-'," + "'{}'".format(data_col)
                        group_by_clause_group_by = " ,".join(group_by_columns) + " ," + data_col
                    # CTE generation:
                    sql_statement = f"""
                                WITH sql_cte as (
                                    SELECT
                                            CONCAT({concat_columns}) as concat_columns 
                                            ,{data_col} as value
            
                                    from {table}, unnest(array[{data_col}]) AS my_col
                                    group by 
                                            {group_by_clause_group_by}
                                            )
                                    SELECT * FROM sql_cte
                                    """
                return sql_statement
    
    def execute_sql_get_dataframe(sql):
            """
            Creates (and closes) db connection, gets requested sql data , returns as Pandas DataFrame.
            Args:
                    sql (string): sql query to execute.
        
            Returns:
                Pandas DataFrame of sql query results.
            """
            try:
                # print(sql_statement_to_execute)
                # create db connection, get connection and cursor object
                db_connection_cursor = db_connection_cursor_object()
                # execute query
                db_connection_cursor['cursor'].execute(sql)
                # get results into tuples
                tuples_list = db_connection_cursor['cursor'].fetchall()
                # get column names: https://www.geeksfeeks./get-column-names-from-postgresql-table-using-psycopg2/
                column_names = [desc[0] for desc in db_connection_cursor['cursor'].description]
                db_connection_cursor['connection'].close()
                # create df from results
                df_from_sql_query = pd.DataFrame(tuples_list, columns=column_names)
                return df_from_sql_query
            except Exception as exc:
                log.exception(f'sql_statement_to_execute:\n {sql}', exc_info=True)
                log.exception(msg=f'Exception: {exc}', exc_info=True)
        
    
    
    data_columns = ['salary']
    group_by_columns_in_order_of_grouping = ['city', 'name']
    
    _sql = so_question(columns=data_columns,
                                  group_by_columns=group_by_columns_in_order_of_grouping,
                                  table='random_data')
    dataframe = execute_sql_get_dataframe(sql=_sql)
    print(dataframe)

Here's the code for data generation:

import psycopg

# Connect to the database
conn = psycopg.connect("dbname=mydatabase user=myuser password=mypassword")
cur = conn.cursor()

# Create the sample table
cur.execute("""
    CREATE TABLE sample_table (
        city VARCHAR(50),
        name VARCHAR(50),
        salary INTEGER
    )
""")

# Insert sample data into the table
cur.execute("""
    INSERT INTO sample_table (city, name, salary) VALUES
    ('New York', 'Alice', 70000),
    ('Los Angeles', 'Bob', 80000),
    ('Chicago', 'Charlie', 75000),
    ('Houston', 'Eve', 71758),
    ('Phoenix', 'Dave', 68000)
""")

# Commit the transaction
connmit()

# Close the cursor and connection
cur.close()
conn.close()

print("Sample table created and data inserted successfully.")

Share Improve this question edited Mar 27 at 11:40 snakecharmerb 56.1k13 gold badges134 silver badges187 bronze badges asked Mar 27 at 10:52 Python_Learner 1,6774 gold badges25 silver badges55 bronze badges

Offtopic: In this example, there is no need for a CTE or a GROUP BY. – Frank Heikens Commented Mar 27 at 16:33
@FrankHeikens fair observation, I stripped down my existing function to this example code. Didn't occur to me to start from scratch, could have been made simpler. – Python_Learner Commented Mar 29 at 9:39

Add a comment |

2 Answers 2

Sorted by: Reset to default 1

Since Postgres column identifiers are quoted in double quotes, psycopg.sql.Identifier() does return the column name enclosed in them.

What you can do is theoretically:

columns = [psycopg.sql.Composable.as_string(psycopg.sql.Identifier(col).strip('"')) for col in columns]

However, as I am not sure how Identifier() ensures quotability it may be that you are losing the SQL injection protection.

So you may actually want to remove the quotes from the SQL result in dataframe.

For future readers, Postgres has a built in solution, REPLACE().

Here's it added to my CONCAT() line to accomplish the desired result:

REPLACE(CONCAT({concat_columns}), '"', '')

本文标签：

版权声明：本文标题：python - How can I concatenate columns without quotation marks in the results while avoiding SQL injection attacks using psycopg 内容由网友自发贡献，该文观点仅代表作者本人，转载请联系作者并注明出处：http://www.betaflare.com/web/1744095230a2590163.html，本站仅提供信息存储空间服务，不拥有所有权，不承担相关法律责任。如发现本站有涉嫌抄袭侵权/违法违规的内容，一经查实，本站将立刻删除。

编程频道|软件玩家 - 软件改变生活！

python - How can I concatenate columns without quotation marks in the results while avoiding SQL injection attacks using psycopg

2 Answers 2

更多相关文章

html - How can i draw polygons on an HTML5 canvas through a JavaScript function - Stack Overflow

JavaScript: Format numbercurrency wregards to culture like .NET&#39;s String.Format()? - Stack Overflow

javascript - HTML blank screen - Stack Overflow

Javascript setter and getter with key for object - Stack Overflow

distributed system - Write and read occurs simultaneously in strong consistency - Stack Overflow

html - Google translate breaks button that makes javascript function call - Stack Overflow

create a balloon popup in taskbar using javascript - Stack Overflow

excel - VBA change range when users adds a row - Stack Overflow

javascript - Communicate between a browser application and .net Windows application - Stack Overflow

progressive web apps - Chrome&#39;s pwa automatic install promotion popup does not appear - Stack Overflow

javascript - What is the purpose of textContent in an input element? - Stack Overflow

template include - Should I use do_action in the header file?

reactjs - the fetch function don&#39;t send cookies - Stack Overflow

theme development - Custom Static Page to Display Blog Posts in Excerpt Non-Singular Form (is_singular not working?)

javascript - Passing values of textbox to another page html - Stack Overflow

Python openpyxl issue when save xlsx file - Stack Overflow

javascript - Angular Bootstrap: data-provide=&quot;datepicker&quot; not using format identifier - Stack Overflow

functions - Custom WordPress post query for displaying time-released content on website

python - Index Pandas with multiple boolean arrays - Stack Overflow

Llama Index Google Docs Reader fails to read credentials.json file - Stack Overflow

发表评论

推荐文章

javascript - Vue Full Calendar (next, prev) buttons trigger - Stack Overflow

javascript - Video still playing when bootstrap modal closes - Stack Overflow

javascript - What&#39;s a good way to store model data in a jQuery app? - Stack Overflow

plugin development - PHPUnit Testing and woocommerce Constant

jquery - Refresh morris chart using javascript - Stack Overflow

热门文章

javascript - How a string(like &quot;123&quot;) is converting to a number if I add + before it but not after it? - Stack

jquery - How to match javascript source array keys within target array values - Stack Overflow

Move WooCommerce product tabs out of the tabs

javascript - Delete Key Move Focus to Previous Input Field - Stack Overflow

dart - Achive infinite animation in flutter for prompts - Stack Overflow

Google Maps Javascript V3: How can I Set the Marker Shadow&#39;s Offset from the Marker? - Stack Overflow

javascript - How do I display selected values using Bootstrap Multiselect - Stack Overflow

javascript - How do you make an API request using browser fetch with token auth - Stack Overflow

javascript - innerHTML doesn&#39;t respect indentation - Stack Overflow

javascript - Best practice to select an event target with Vanilla JS - Stack Overflow

最新文章

windows设置断电重启开机后自动输入锁屏密码登录

Windows系统设置开机默认开启数字小键盘

Windows11 开机自动同步时间（开机时间不更新问题）

windows配置开机自启动软件或脚本

【Redis】Windows设置Redis为开机自启动

Automatically replace original uploaded image with large image size

javascript - Check if value exists in nested array - Stack Overflow

Llama Index Google Docs Reader fails to read credentials.json file - Stack Overflow

javascript - How to handle statuscode 307 redirect with jquery - Stack Overflow

javascript - How to filter the props for to render with ReactJS? - Stack Overflow

惠普OMEN 15-CE001TX 2EF91PA参数报价

苹果新款MacBook Pro 15英寸 i732GB1TBVega Pro 20参数报价

联想Y330A-PSE L参数报价

神舟战神Z7 D6 i7-12650H16GB512GBRTX4050旗舰版参数报价

神舟战神Z7 D6 i7-12650H16GB1TBRTX4050参数报价

JavaScript: Format numbercurrency wregards to culture like .NET's String.Format()? - Stack Overflow

progressive web apps - Chrome's pwa automatic install promotion popup does not appear - Stack Overflow

reactjs - the fetch function don't send cookies - Stack Overflow

javascript - Angular Bootstrap: data-provide="datepicker" not using format identifier - Stack Overflow

javascript - What's a good way to store model data in a jQuery app? - Stack Overflow

javascript - How a string(like "123") is converting to a number if I add + before it but not after it? - Stack

Google Maps Javascript V3: How can I Set the Marker Shadow's Offset from the Marker? - Stack Overflow

javascript - innerHTML doesn't respect indentation - Stack Overflow