GENSCAN 1.0 Date run: 6-Nov-116 Time: 09:18:34 Sequence gi568815596f:6777801_6995939 : 218139 bp : 43.09% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 4880 5027 148 1 1 66 115 46 0.298 4.91 1.02 Intr + 7561 7599 39 2 0 100 73 39 0.181 1.80 1.03 Term + 25526 25587 62 0 2 25 49 115 0.194 -0.73 1.04 PlyA + 26280 26285 6 1.05 2.06 PlyA - 26996 26991 6 1.05 2.05 Term - 33760 33683 78 1 0 97 52 74 0.253 2.46 2.04 Intr - 43002 42933 70 2 1 93 50 57 0.163 1.58 2.03 Intr - 43348 43227 122 1 2 120 -23 94 0.311 0.79 2.02 Intr - 43623 43491 133 2 1 92 72 46 0.740 4.05 2.01 Init - 56495 56404 92 2 2 61 94 35 0.284 1.36 2.00 Prom - 58360 58321 40 -4.96 3.08 PlyA - 59500 59495 6 1.05 3.07 Term - 62878 62763 116 2 2 117 44 74 0.615 4.63 3.06 Intr - 71563 71479 85 1 1 62 82 21 0.428 -1.71 3.05 Intr - 72173 72089 85 1 1 97 19 98 0.395 3.62 3.04 Intr - 73883 73650 234 1 0 73 56 146 0.469 6.70 3.03 Intr - 83585 83384 202 0 1 57 96 155 0.988 11.54 3.02 Intr - 85778 85664 115 2 1 46 94 158 0.995 12.22 3.01 Init - 87896 87222 675 2 0 87 94 950 0.995 88.67 3.00 Prom - 96074 96035 40 -6.06 4.00 Prom + 99942 99981 40 -4.66 4.01 Init + 100001 100346 346 1 1 82 105 237 0.565 20.48 4.02 Intr + 105571 105732 162 2 0 43 54 138 0.604 5.85 4.03 Intr + 109135 109364 230 2 2 65 106 240 0.994 21.09 4.04 Intr + 112376 112525 150 1 0 73 1 175 0.322 7.66 4.05 Intr + 115871 115903 33 1 0 105 73 40 0.479 2.62 4.06 Term + 117978 118142 165 2 0 54 43 130 0.528 3.02 4.07 PlyA + 119503 119508 6 1.05 5.02 PlyA - 120788 120783 6 1.05 5.01 Sngl - 126144 125545 600 0 0 79 49 312 0.994 22.60 5.00 Prom - 138144 138105 40 -3.56 6.10 PlyA - 138343 138338 6 1.05 6.09 Term - 143839 143475 365 2 2 91 44 132 0.467 3.83 6.08 Intr - 145879 145725 155 0 2 47 44 53 0.221 -3.48 6.07 Intr - 146637 146430 208 1 1 56 32 162 0.322 5.54 6.06 Intr - 147282 147092 191 2 2 19 70 53 0.235 -4.17 6.05 Intr - 148924 148769 156 0 0 26 69 121 0.431 3.13 6.04 Intr - 151199 151014 186 1 0 28 94 118 0.120 5.20 6.03 Intr - 153367 153259 109 2 1 55 76 39 0.062 -1.26 6.02 Intr - 158262 158193 70 0 1 95 77 75 0.111 5.85 6.01 Init - 178676 178515 162 2 0 85 57 101 0.284 6.43 6.00 Prom - 184331 184292 40 -3.16 7.03 PlyA - 184440 184435 6 1.05 7.02 Term - 191083 190948 136 0 1 79 49 119 0.973 4.59 7.01 Init - 196120 196032 89 1 2 78 87 67 0.928 5.72 7.00 Prom - 201071 201032 40 -4.66 8.02 PlyA - 201504 201499 6 -0.45 8.01 Sngl - 203892 203602 291 0 0 59 48 166 0.582 5.35 8.00 Prom - 211742 211703 40 -3.66 9.02 PlyA - 211914 211909 6 1.05 9.01 Sngl - 217880 217755 126 2 0 71 55 170 0.573 2.88 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:6777801_6995939|GENSCAN_predicted_peptide_1|82_aa WHVNKFSFIQKEEQVVLNSVPASYHQISQQPLSKVGAAKATLALRAGCPARPSMDCLMPI RFDHGAEDARNDDHDEEEGEDR >gi568815596f:6777801_6995939|GENSCAN_predicted_CDS_1|249_bp tggcacgtgaataaattcagcttcatccaaaaagaggaacaggttgtcctaaattctgtg cccgcgagttaccatcaaatttcacaacaaccacttagcaaagtgggtgcagcaaaggcc actttagccctccgggccggatgcccagccaggccctcaatggattgcctgatgcccatc cgctttgaccatggagctgaagatgccaggaatgatgaccatgatgaggaggagggggag gatcggtga >gi568815596f:6777801_6995939|GENSCAN_predicted_peptide_2|164_aa MSCDDRGRDWTDAAESQEHQNMIITTKRWKRLTGAQVKKEPQTYFLPCPFRPLGADISFA AVRPSLPRSLELNAGMRAPTLPPTPAAELRFLSTAVRFQPRSDASDVISSSCLWTCSRSV MVNIDCQLDEIEGCKVLFLPLQSDTSQGKDMGEVAVQPLLSHRG >gi568815596f:6777801_6995939|GENSCAN_predicted_CDS_2|495_bp atgtcatgtgatgatagaggtagagattggactgatgcagctgaaagccaggaacaccaa aatatgatcatcactaccaaaagatggaagaggctcacaggagcccaggtcaagaaggaa ccacagacgtacttcctcccctgccccttccgtcccctgggagctgacatttcctttgca gcagtccgtcccagtttacccaggagcctggagctcaacgctgggatgagagcacccacg cttcctccaactcctgccgcagagctgcgtttcctcagtactgctgtccgcttccagccc cgctccgacgcatctgacgtcatttcctcgtcttgtctctggacctgctcccggagtgtg atggttaatattgactgtcagcttgatgaaattgaaggatgcaaagtattgttcctgcct ttgcagagtgacaccagccagggcaaagacatgggagaggtggccgtgcagcccctgctc tcacacagaggctga >gi568815596f:6777801_6995939|GENSCAN_predicted_peptide_3|503_aa MAFARRLLRGPLSGPLLGRRGVCAGAMAPPRRFVLELPDCTLAHFALGADAPGDADAPDP RLAALLGPPERSYSLCVPVTPDAGCGARVRAARLHQRLLHQLRRGPFQRCQLLRLLCYCP GGQAGGAQQGFLLRDPLDDPDTRQALLELLGACQEAPRPHLGEFEADPRGQLWQRLWEVQ DGRRLQVGCAQVVPVPEPPLHPVVPDLPSSVVFPDREAARAVLEECTSFIPEARAVLDLV DQCPKQIQKGKFQVVAIEGLDATGKTTVTQSVADSLKAVLLKSPPSCIGQWRKIFDDEPT IIRRAFYSLGNYIVASEIAKESAKSPVIVDRYWHSTATYAIATEVSGGLQHLPPAHHPVY QWPEDLLKPDLILLLTVSPEERLQRLQGRGMEKTREEAELEANSVFRQKVEMSYQRMENP GCHVVDASPSREKVLQTSQILEPSVPAEAVYVSWLPKDTPAEPFLGPSWEEPYTILLSTC SAVKFVGLESWIHHTRVKAWNDP >gi568815596f:6777801_6995939|GENSCAN_predicted_CDS_3|1512_bp atggccttcgcccgccggctcctgcgcgggccactgtcggggccgctgctcgggcggcgc ggggtctgcgctggggccatggctccgccgcgccgcttcgtcctggagcttcccgactgc accctggctcacttcgccctaggcgccgacgcccccggcgacgcagacgcccccgacccc cgcctggcggcgctgctggggcccccggagcgcagctactcgctgtgcgtgcccgtgacc ccggacgccggctgcggggcccgggtccgggcggcgcggctgcaccagcgcctgctgcac cagctgcgccgcggccccttccagcggtgccagctgctcaggctgctctgctactgcccg ggcggccaggccggcggcgcacagcaaggcttcctgctgcgcgaccccctggatgaccct gacacccggcaagcgctgctcgagctgctgggcgcctgtcaggaggcaccacgcccgcac ttgggcgagttcgaggccgacccgcgcggccagctgtggcagcgcctctgggaggtgcaa gacggcaggcggctgcaggtgggctgcgcacaggtcgtgcccgtcccggagcccccgctg cacccggtggtgccagacttgcccagttccgtggtcttcccggaccgggaagccgcccgg gccgttttggaggagtgtacctcctttattcctgaagcccgggcagtgcttgacctggtc gaccagtgcccaaaacagatccagaaaggaaagttccaggttgttgccatcgaaggactg gatgccacgggtaaaaccacggtgacccagtcagtggcagattcacttaaggctgtcctc ttaaagtcaccaccctcttgcattggccagtggaggaagatctttgatgatgaaccaact atcattagaagagctttttactctttgggcaattatattgtggcctccgaaatagctaaa gaatctgccaaatctcctgtgattgtagacaggtactggcacagcacggccacctatgcc atagccactgaggtgagtgggggtctccagcacctgcccccagcccatcaccctgtgtac cagtggccagaggacctgctcaaacctgaccttatcctgctgctcactgtgagtcctgag gagaggttgcagaggctgcagggccggggcatggagaagaccagggaagaagcagaactt gaggccaacagtgtgtttcgtcaaaaggtagaaatgtcctaccagcggatggagaatcct ggctgccatgtggttgatgccagcccctccagagaaaaggtcctgcagacgtcccaaatt ctagagccaagtgttcctgcagaggctgtctatgtgtcctggctgcccaaggacactcct gcagagccatttttgggtcccagctgggaggaaccttataccatccttctctccacctgc tcggcagtgaagtttgtgggactggaatcttggattcatcacactcgagtcaaggcctgg aatgacccctga >gi568815596f:6777801_6995939|GENSCAN_predicted_peptide_4|361_aa MWVLTPAAFAGKLLSVFRQPLSSLWRSLVPLFCWLRATFWLLATKRRKQQLVLRGPDETK EEEEDPPLPTTPTSVNYHFTRQCNYKCGFCFHTAKTSFVLPLEEAKRGLLLLKEAGMEKI NFSGGEPFLQDRGEYLGKLVRFCKVELRLPSVSIVSNGSLIRERWFQNYGEYLDILAISC DSFDEEVNVLIGRGQGKKNHVENLQKLRRWCRDYRVAFKINSVINRFNVEEDMTEQIKAL NPVRWKVFQCLLIEGENCGEDALREAERFVIGDEEFERFLERHKEVSCLVPESNQKMKDS YLILDEYMRFLNCRKGRKDPSKSILDVGVEEAIKFSGFDEKMFLKRGGKYIWSKADLKLD W >gi568815596f:6777801_6995939|GENSCAN_predicted_CDS_4|1086_bp atgtgggtgcttacacctgctgcttttgctgggaagctcttgagtgtgttcaggcaacct ctgagctctctgtggaggagcctggtcccgctgttctgctggctgagggcaaccttctgg ctgctagctaccaagaggagaaagcagcagctggtcctgagagggccagatgagaccaaa gaggaggaagaggaccctcctctgcccaccaccccaaccagcgtcaactatcacttcact cgccagtgcaactacaaatgcggcttctgtttccacacagccaaaacatcctttgtgctg ccccttgaggaagcaaagagaggattgcttttgcttaaggaagctggtatggagaagatc aacttttcaggtggagagccatttcttcaagaccggggagaatacctgggcaagttggtg aggttctgcaaagtagagttgcggctgcccagcgtgagcatcgtgagcaatggaagcctg atccgggagaggtggttccagaattatggtgagtatttggacattctcgctatctcctgt gacagctttgacgaggaagtcaatgtccttattggccgtggccaaggaaagaagaaccat gtggaaaaccttcaaaagctgaggaggtggtgtagggattatagagtcgctttcaagata aattctgtcattaatcgtttcaacgtggaagaggacatgacggaacagatcaaagcacta aaccctgtccgctggaaagtgttccagtgcctcttaattgagggtgagaattgtggagaa gatgctctaagagaagcagaaagatttgttattggtgatgaagaatttgaaagattcttg gagcgccacaaagaagtgtcctgcttggtgcctgaatctaaccagaagatgaaagactcc taccttattctggatgaatatatgcgctttctgaactgtagaaagggacggaaggaccct tccaagtccatcctggatgttggtgtagaagaagctataaaattcagtggatttgatgaa aagatgtttctgaagcgaggaggaaaatacatatggagtaaggctgatctgaagctggat tggtag >gi568815596f:6777801_6995939|GENSCAN_predicted_peptide_5|199_aa MRKNQHKNAENSKNHNASSPPKDHNSPPARKQNWMQNEFDRLTEVDFRRWVTTNSSKLKE HVLTQCKEAKNLDKRLQELLTRITSLEKNINDLIELKNTAQELHEAYTSINSRIDQVEEM ISKIEDQLNEIKREDKIREKRMQRNKQSLQEIWDYVKRPNLRLIGVPERDEDNGTKLENT SGYHPGELPQPSKTGQHSN >gi568815596f:6777801_6995939|GENSCAN_predicted_CDS_5|600_bp atgaggaaaaaccagcacaaaaatgctgaaaattccaaaaaccacaatgcctcttctcct ccaaaggatcacaactccccaccagcaaggaaacaaaactggatgcagaatgagtttgac agactgacagaggtagacttcagaaggtgggtaacaacaaactcctccaagctaaaggag catgtcctaacccaatgcaaggaagctaagaaccttgataaaaggttacaggaactgcta actagaataaccagtttagagaagaacataaatgacctgatagagctgaaaaacacagca caagaacttcatgaagcatacacaagtatcaatagccgaatcgatcaagtggaagaaatg atatcaaagattgaagatcaacttaatgaaataaagcgtgaagacaagattagagaaaaa agaatgcaaaggaacaaacaaagcctccaagaaatatgggattatgtgaaaagaccaaat ctacgtttaattggtgtacctgaaagagacgaggataatggaaccaagttggaaaacact tcaggatatcatccaggagaacttccccaacctagcaagacaggccaacattcaaattag >gi568815596f:6777801_6995939|GENSCAN_predicted_peptide_6|533_aa MGIYATTYRNDIQKDVWCTDEIRELKAASSGGTSDSLVGLLHNHFRYTPWVQTKAEPPKF PKGTVKAVKNRNLTDIEALTTTAALGDASGLCRAHCPSSDMNSTQSDLQVHGIREQMDAD FIPCHPQNAKLLRELTGSEGATQNPQKWQEEDGPVSDHHRRQAATAGDTGNAGKSREKTQ IHFNKYLLSASYEKPVCRVLEGYKDNQDRTGIPSPHCTHTEPIDGTHRLQSSSAYAIKIR RVAIEVARKLTEGPRCFLTFSIRQRASPKVSGSPHPSQSKGSLGAHSLCYAYSQDMRSDG HIQALGSNHSTLCISLHIAGKGHVLCFPCIAQCLGPPSNTATEVSPAVTLDLTYMVSTQP RSAPCGSIFSSVQCQCQRRASPQSPTLAILQLEGKAAFVMSLQESLGSWHMDDYSLLKVM GSILICRPLGSKPGFKIDDVCAMLDNIPALCFTLLLCQLGIATYACAAGLCENCIHDASK ALKIRQIHNEYQPPPPPLVPNTPEMIKYWLTHCMPPRSLLLSGTLHPAYKPAP >gi568815596f:6777801_6995939|GENSCAN_predicted_CDS_6|1602_bp atggggatctatgcaacaacctacagaaatgacatccaaaaagatgtgtggtgtacagat gagattagggagctgaaagctgcttcttctggaggcacctctgactctcttgttgggcta cttcacaaccatttcaggtacacgccatgggtacagacgaaggccgagccgcccaagttc cctaaaggaacagtgaaagcagtgaaaaacagaaacctgacagatattgaagcactgacc acgaccgctgcactaggagatgcttctggactctgccgggcacactgtcccagttctgac atgaacagtactcagtcagacctgcaggtgcatggtatcagggagcaaatggatgctgac ttcattccatgtcacccccaaaatgccaagctgctcagggaactgacagggtctgaaggg gccacgcagaacccccagaagtggcaggaagaggatgggccagtttctgaccatcaccga aggcaggctgccacagcaggagacactgggaatgcaggaaaaagcagggaaaagacgcag attcatttcaacaaatatttattgagcgcctcctatgagaagcctgtgtgccgagtgcta gaggggtacaaagacaaccaggacaggactgggattcccagcccacactgtacccacacc gaacctattgatggaacacacaggctacaaagctccagtgcatacgctatcaaaattcgc cgtgtggcaatcgaggtcgccagaaaactcacagaaggacccaggtgcttcctgacattc tccatccgtcaaagagccagccccaaggtctctggaagcccacacccttcccaaagcaaa ggctcccttggagcacacagcctgtgttatgcttactcccaggacatgcgctcggatggg cacattcaggcgcttggttctaaccattccacattatgcatctcccttcacatcgcaggc aaaggccacgtcctatgcttcccgtgcatcgcacagtgcctcgggccccccagcaacacg gccacggaagtctcgccagctgttactctggatctcacttacatggtgtctacacaacca cgttctgctccgtgcggctccatcttctcatctgtgcaatgccagtgtcagaggagagca tctccacagtcacccacattggcaattttacagctggaaggaaaggcagcatttgtgatg agccttcaagaatcattaggaagctggcacatggatgactacagcctcctcaaggtcatg ggatctatccttatctgcaggcctctgggttcaaagccaggcttcaaaattgacgacgtg tgtgcgatgttggacaacatccctgctctctgcttcactctgctcctctgtcagctaggg atagcaacatacgcctgtgctgcagggctgtgtgagaactgcatccacgatgcaagcaaa gccctgaagatcaggcaaatccataatgagtatcaaccacctcctccacctctagtgcca aacacacctgagatgataaaatactggttgacacactgcatgccaccaaggagcttgctt ttgtctggaacactgcatccagcttataagcctgctccttag >gi568815596f:6777801_6995939|GENSCAN_predicted_peptide_7|74_aa MENISGFVDHGVSVEKTSLMLEHEAARDGTDSMGQPEHTTPENTVGSDHQSCLRRMMPRD TGNALAKFTVPALP >gi568815596f:6777801_6995939|GENSCAN_predicted_CDS_7|225_bp atggaaaatatttcaggctttgtggatcatggggtctctgtagaaaagacctctctgatg ctggagcacgaagcagccagagacggcacggactccatgggacagcctgaacacacaaca cctgagaacaccgtgggctctgaccaccagtcatgcttgaggaggatgatgccaagagac actggcaatgccctcgccaagttcacagtgcctgctttgccctga >gi568815596f:6777801_6995939|GENSCAN_predicted_peptide_8|96_aa MWESLELPRDLLNGFDQHADSRIDNEVQAEVVSDGDEEPNGNWSKGHSHYALVKRLVAFS PCPRDLWNFEIERDNLRYLAEEISKQQSIEMRPGFS >gi568815596f:6777801_6995939|GENSCAN_predicted_CDS_8|291_bp atgtgggaaagtttggaacttcctagagacttgttgaatggttttgaccaacatgctgac agtcgtatcgacaatgaagttcaggctgaggtggtctcagatggagatgaggaacctaat gggaactggagcaaaggtcactctcactatgctttagtaaagagactggtggcattttcc ccctgccctagagatctgtggaactttgaaattgagagagataatttaaggtatctggca gaggaaatttctaagcagcaaagcatcgagatgcgacctggcttttcctga >gi568815596f:6777801_6995939|GENSCAN_predicted_peptide_9|41_aa MERLDWLNLLASVVLLCWAFCPRTSDSKFFSFGTPGPSTTD >gi568815596f:6777801_6995939|GENSCAN_predicted_CDS_9|126_bp atggaaagactggactggctgaatcttctggcctctgtcgttctcctgtgctgggctttc tgccctcgaacatcggactccaagttcttcagctttgggactcctggcccttcgaccaca gactga