GENSCAN 1.0 Date run: 4-Nov-116 Time: 09:04:44 Sequence gi568815587f:17619783_17821505 : 201723 bp : 48.02% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 9351 9534 184 2 1 50 77 207 0.840 15.16 1.02 Intr + 11920 12140 221 2 2 80 25 193 0.937 10.22 1.03 Intr + 12306 12444 139 2 1 48 78 231 0.997 18.14 1.04 Intr + 13898 14092 195 0 0 69 78 133 0.982 9.79 1.05 Intr + 14287 14499 213 2 0 106 89 172 0.997 17.79 1.06 Intr + 15298 15405 108 2 0 48 64 139 0.937 7.76 1.07 Intr + 15828 15929 102 1 0 99 89 67 0.989 8.05 1.08 Term + 18669 18832 164 1 2 140 55 140 0.612 14.00 1.09 PlyA + 19278 19283 6 1.05 2.02 PlyA - 19771 19766 6 -1.95 2.01 Sngl - 20325 19870 456 0 0 74 38 564 0.999 46.19 2.00 Prom - 20630 20591 40 -8.56 3.00 Prom + 20738 20777 40 -13.43 3.01 Init + 20993 21038 46 1 1 38 89 74 0.840 3.34 3.02 Intr + 21131 21309 179 0 2 60 54 261 0.840 19.54 3.03 Intr + 22065 22169 105 2 0 63 99 122 0.998 11.21 3.04 Intr + 22345 22464 120 0 0 103 86 64 0.901 8.39 3.05 Intr + 23679 23724 46 2 1 105 76 -32 0.611 -4.72 3.06 Intr + 25782 25861 80 1 2 117 38 190 0.653 16.17 3.07 Term + 25962 26162 201 2 0 98 54 266 0.973 21.59 3.08 PlyA + 27479 27484 6 1.05 4.07 PlyA - 28198 28193 6 1.05 4.06 Term - 60630 60443 188 1 2 88 32 92 0.726 1.25 4.05 Intr - 69117 68965 153 1 0 110 38 68 0.123 4.04 4.04 Intr - 80079 79933 147 1 0 79 68 82 0.699 5.51 4.03 Intr - 84220 84144 77 0 2 72 90 112 0.387 8.96 4.02 Intr - 88641 88544 98 0 2 94 11 26 0.143 -5.69 4.01 Init - 88790 88680 111 2 0 109 106 14 0.665 5.52 4.00 Prom - 92318 92279 40 -5.86 5.00 Prom + 94756 94795 40 -6.56 5.01 Init + 100001 100630 630 1 0 56 91 1283 0.347 120.75 5.02 Intr + 101120 101198 79 1 1 117 64 163 0.990 15.92 5.03 Intr + 101473 101717 245 2 2 63 47 215 0.671 12.12 5.04 Term + 102295 102585 291 0 0 53 40 187 0.211 5.74 5.05 PlyA + 104412 104417 6 1.05 6.00 Prom + 107540 107579 40 -5.76 6.01 Init + 116221 116790 570 0 0 104 114 1613 0.988 160.59 6.02 Intr + 151883 152816 934 1 1 125 91 1711 0.996 165.87 6.03 Intr + 159674 159862 189 0 0 44 75 180 0.285 11.86 6.04 Term + 165432 165442 11 1 2 125 49 -4 0.167 -2.24 6.05 PlyA + 166958 166963 6 1.05 7.04 PlyA - 168290 168285 6 1.05 7.03 Term - 168631 168303 329 2 2 100 44 207 0.713 12.37 7.02 Intr - 172667 172583 85 2 1 75 55 30 0.012 -2.21 7.01 Intr - 189350 189263 88 1 1 139 71 23 0.034 5.67 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587f:17619783_17821505|GENSCAN_predicted_peptide_1|441_aa VTVDLQPVWPPVSRYGFRIEDTGHMYMILTPSDIQIQWLHSSGLMIVEASKTSKAQGHGL CGICDGDAANDLTLKDGSVVGGAEDPAPFLDSWQVPSSLTSVGQTRFRPDSCATTDCSPC LRMVSNRTFSACHRFVPPESFCELWIRDTKYVQQPCVALTVYVAMCHKFHVCIEWRRSDY CPFLCSSDSTYQACVTACEPPKTCQDGILGPLDPEHCQVLGEGCVCSEGTILHRRHSALC IPEAKCACTDSMGVPRALGETWNSSLSGCCQHQCQAPDTIVPVDLGCPSPRPESCLRFGE VALLLPTKDPCCLGTVCECDPDLCEAELVPSCRQDQILITGRLGDSCCTSYFCACGDCPD SIPECQEGEALTVHRNTTELCCPLYQCVCENFRCPQVQCGLGTALVEVWSPDRCCPYKSC GESVVRTASPLGDPVALLRRD >gi568815587f:17619783_17821505|GENSCAN_predicted_CDS_1|1326_bp gtgactgtggacttgcagcctgtgtggccaccggtgagcaggtatggattcagaattgag gacacaggccacatgtacatgatcctgactccctcagacatccagatccagtggctccac agctcaggactcatgatcgtggaggccagcaaaaccagcaaggcccagggccatggcctg tgcggtatctgtgatggagatgcagccaatgaccttaccctgaaggatggctcagtggtg ggtggggctgaggaccctgctccctttctggacagctggcaggtgcccagctccctgacc tcagtgggccagacccgcttccgcccagacagctgcgccacaactgactgctcgccctgc cttcgcatggtgtccaaccgcaccttcagtgcctgccaccgctttgtgcctccggagtca ttctgtgagctgtggatccgggacaccaagtacgtgcagcagccctgcgtggccctgact gtgtacgtggccatgtgccacaaatttcatgtgtgcatcgagtggcggcgctctgactac tgccccttcctgtgctccagcgactccacataccaggcatgtgtgacagcctgtgagcca cccaagacatgccaggatgggatactagggcctctggacccagagcactgccaggtgctg ggcgagggctgcgtctgctccgagggcaccatcttacaccggcgccactctgcactctgc atcccggaggccaagtgcgcctgcactgacagcatgggggtgccgagggccctgggggag acctggaacagctccctcagcggctgctgccagcaccagtgccaagccccagacaccatt gtcccggtggatctgggctgccccagtccccgccctgagagctgcctgcgattcggggag gtggccttgctcctacccaccaaggacccctgctgcctggggactgtctgtgagtgtgac ccagatctctgtgaggcagagctggtccccagctgccgacaggaccagatcctgatcacg ggccgcctgggggactcctgctgcacctcctacttctgcgcctgtggtgactgtccagac tccatccccgaatgtcaagaaggggaggcgctcactgtgcacaggaataccacggaactc tgctgccctctgtaccagtgtgtgtgtgagaacttccgctgtccccaagtgcagtgtggc ctgggcactgccctggtggaggtgtggagccccgaccgctgctgcccctacaaatcctgt ggtgagtccgtggtcaggacagcctccccgctgggagatccagtggccctgctgaggagg gattga >gi568815587f:17619783_17821505|GENSCAN_predicted_peptide_2|151_aa MIVINIIIIIAIIIFTTTIILTFPTTTIVIALSPPSPSSLHYHHHPHCPHHHHHCIITIT TTTILTITTTIILTITTTITIALSPSPPPSSSPSPPPPSLHYYHNHCTITIVTTTILLTV PTIITTISIIIIIFMIIFIFIVLNPYHNSKA >gi568815587f:17619783_17821505|GENSCAN_predicted_CDS_2|456_bp atgattgtcatcaacatcatcatcatcattgccattatcatcttcaccaccaccatcatc ctcaccttccccactaccaccattgtcattgcattatcaccaccatcaccatcgtcactg cattatcaccatcaccctcactgtccccaccaccaccatcattgcattatcaccatcacc accaccaccatcctcaccatcaccaccaccatcatcctcaccatcaccaccaccatcacc attgcattatcaccatcaccaccaccatcatcctcaccatccccaccaccaccatcattg cattattaccataatcactgcactatcaccatcgtcaccaccaccatcctcctcactgtc cccaccatcatcactaccatcagcatcatcattatcatcttcatgataattttcatcttc attgtacttaatccttaccacaactctaaagcatga >gi568815587f:17619783_17821505|GENSCAN_predicted_peptide_3|258_aa MHSVENVCGCAKYECVKAPVCLSRELGVMQPGQTVVELSADGVCHTSRCTTVLDPLTNFY QINTTSVLCDIHCEANQEYEHPRDLAACCGSCRNVSCLFTFPNGTTSLFLPGASWIADCA RHHCSSTPLGAVLVRSPISCPPLNETECAKVGGSVVPSLEGCCRTCKEDGRSCKKVTIRM TIRKNECRSSTPVNLVSCDGRCPSASIYNYNINTYARFCKCCREVGLQRRSVQLFCATNA TWVPYTVQEPTDCACQWS >gi568815587f:17619783_17821505|GENSCAN_predicted_CDS_3|777_bp atgcacagcgtggagaatgtgtgtggctgcgccaagtacgagtgtgtgaaggccccggtg tgtctgagccgcgagctgggtgtgatgcagcccggccagacagtggtggagctctcagca gatggcgtgtgccacacctcccgctgcaccaccgtgctcgaccctctcaccaacttctac cagatcaacaccacctccgtgctctgtgacatccactgtgaggcgaaccaggagtacgag cacccgcgggacctcgctgcctgctgcggctcctgcaggaacgtgtcctgtctcttcacc ttccccaatggcaccacctccctgttcttgcccggggcatcctggatcgcagactgcgcc cgccaccactgcagcagcacgcccctgggtgccgtgctggtccgctctcccataagctgc ccaccgctcaatgagactgagtgtgccaaggttgggggttccgtggtaccttccttggaa ggatgctgcaggacctgtaaggaggatgggcgctcctgcaagaaggtgaccatccgcatg accatccgcaagaatgaatgcaggagcagcacccctgtgaacctagtgtcctgcgatggg aggtgcccatccgccagcatctacaactacaacatcaacacctatgcccgattctgcaag tgctgccgtgaggtgggcctgcagcggcgctctgtgcagctcttctgtgccaccaatgcc acctgggtgccctatacagtgcaggagcccaccgactgtgcctgccagtggtcctga >gi568815587f:17619783_17821505|GENSCAN_predicted_peptide_4|257_aa METGPPVLGLCSSCVSTGLRSLIHLDSNAVSARYLPWLSNVTGSSTYGGCKGSVVKVYTV LASCLSQGAQTCENITLQGKKDFADVDGKIILDSRGVSEKQDSRKEDKAPNLRTQLEKPE KAPEQLTLGATYWRQPPPPPHWLCCDVSEKQGSIPTLPFFLGHTCGCYASVIWPSFYLFL HFQFVDSSESDRKALGEQKPYGLDRRSSSMPLCYGAPPFLVSASIAFKGKKEEGVSGEYL EELKYSPKEHAFTPKQF >gi568815587f:17619783_17821505|GENSCAN_predicted_CDS_4|774_bp atggagactggaccaccagtgctggggctatgttcctcatgtgtctccactggacttagg agtctaatacacctggattcaaatgctgtgtctgctcgttacttaccttggctgtcaaat gtgactggttctagcacctacggaggatgtaagggttcagttgtcaaggtatacacagtg cttgcatcatgcctgtctcagggtgcacaaacctgtgaaaacattacgttacaaggcaaa aaggactttgcagatgtggatgggaagattatcctggattcccgaggggtcagtgagaag caggactccaggaaggaagacaaggccccaaacctcaggactcagttggagaaaccagag aaggccccagaacagctgactctgggcgccacgtattggaggcagccaccccctccccca cactggctgtgctgtgacgtctctgaaaaacagggcagcatccccacattgcccttcttt ctgggtcatacctgtggatgctatgccagtgttatttggccatctttctacctcttcctg cattttcagtttgttgatagttctgagtcagatcggaaagctcttggtgagcagaagcca tatggcttagaccgcaggtcctcctccatgccattgtgctatggagcacccccttttctt gtcagtgcctctatagcattcaaggggaagaaagaggaaggagtgagtggggagtacctt gaagaactgaagtactcaccaaaagagcatgcatttactccaaagcaattttaa >gi568815587f:17619783_17821505|GENSCAN_predicted_peptide_5|414_aa MELLSPPLRDVDLTAPDGSLCSFATTDDFYDDPCFDSPDLRFFEDLDPRLMHVGALLKPE EHSHFPAAVHPAPGAREDEHVRAPSGHHQAGRCLLWACKACKRKTTNADRRKAATMRERR RLSKVNEAFETLKRCTSSNPNQRLPKVEILRNAIRYIEGLQALLRDQDAAPPGAAAAFYA PGPLPPGRGGEHYSGDSDASSPRSNCSDGMMDYSGPPSGARRRNCYEGAYYNEAPSEPRP GKSAAVSSLDCLSSIVERISTESPAAPALLLADVPSESPPRRQEAAAPSEGESSGDPTQS PDAAPQCPAGANPNPIYQAGEPRALAQVIKIKALIYTAVAPAFPGHGCGIRRKIRKLGQL SLSDACRRQADCKEEACCLGKEGGVQISPVREEVPLTLTTLLHTSVALMEGYTG >gi568815587f:17619783_17821505|GENSCAN_predicted_CDS_5|1245_bp atggagctactgtcgccaccgctccgcgacgtagacctgacggcccccgacggctctctc tgctcctttgccacaacggacgacttctatgacgacccgtgtttcgactccccggacctg cgcttcttcgaagacctggacccgcgcctgatgcacgtgggcgcgctcctgaaacccgaa gagcactcgcacttccccgcggcggtgcacccggccccgggcgcacgtgaggacgagcat gtgcgcgcgcccagcgggcaccaccaggcgggccgctgcctactgtgggcctgcaaggcg tgcaagcgcaagaccaccaacgccgaccgccgcaaggccgccaccatgcgcgagcggcgc cgcctgagcaaagtaaatgaggcctttgagacactcaagcgctgcacgtcgagcaatcca aaccagcggttgcccaaggtggagatcctgcgcaacgccatccgctatatcgagggcctg caggctctgctgcgcgaccaggacgccgcgccccctggcgccgcagccgccttctatgcg ccgggcccgctgcccccgggccgcggcggcgagcactacagcggcgactccgacgcgtcc agcccgcgctccaactgctccgacggcatgatggactacagcggccccccgagcggcgcc cggcggcggaactgctacgaaggcgcctactacaacgaggcgcccagcgaacccaggccc gggaagagtgcggcggtgtcgagcctagactgcctgtccagcatcgtggagcgcatctcc accgagagccctgcggcgcccgccctcctgctggcggacgtgccttctgagtcgcctccg cgcaggcaagaggctgccgcccccagcgagggagagagcagcggcgaccccacccagtca ccggacgccgccccgcagtgccctgcgggtgcgaaccccaacccgatataccaggcgggc gagccgcgggcgctcgctcaggtgatcaaaataaaggcgctaatttataccgccgtggct ccggctttccctggacatgggtgtgggatccggaggaaaatccgcaaactgggccagctg tccctcagcgacgcctgtaggcggcaggcggattgcaaggaggaagcctgctgcctgggg aaggaaggaggggtgcaaatttctccagtacgtgaggaagttcctctgaccttgactaca ttactacacacgtccgtggctcttatggaagggtacacaggttga >gi568815587f:17619783_17821505|GENSCAN_predicted_peptide_6|567_aa MGQGDESERIVINVGGTRHQTYRSTLRTLPGTRLAWLAEPDAHSHFDYDPRADEFFFDRH PGVFAHILNYYRTGKLHCPADVCGPLYEEELAFWGIDETDVEPCCWMTYRQHRDAEEALD SFGGAPLDNSADDADADGPGDSGDGEDELEMTKRLALSDSPDGRPGGFWRRWQPRIWALF EDPYSSRYARYVAFASLFFILVSITTFCLETHERFNPIVNKTEIENVRNGTQVRYYREAE TEAFLTYIEGVCVVWFTFEFLMRVIFCPNKVEFIKNSLNIIDFVAILPFYLEVGLSGLSS KAAKDVLGFLRVVRFVRILRIFKLTRHFVGLRVLGHTLRASTNEFLLLIIFLALGVLIFA TMIYYAERIGAQPNDPSASEHTHFKNIPIGFWWAVVTMTTLGYGDMYPQTWSGMLVGALC ALAGVLTIAMPVPVIVNNFGMYYSLAMAKQKLPKKKKKHIPRPPQLGSPNYCKSVVNSPH HSTQSDTCPLAQEEILEINRADSKLNGEVAKAALANEDCPHIDQALTPDEGLPFTRSGTR ERYGPCFLLSTGEYACPPGGGMRKGGI >gi568815587f:17619783_17821505|GENSCAN_predicted_CDS_6|1704_bp atgggccaaggggacgagagcgagcgcatcgtgatcaacgtgggcggcacgcgccaccag acgtaccgctcgaccctgcgcacgctgcccggcacgcggctcgcctggctggcggagccc gacgcccacagccacttcgactatgacccgcgtgctgacgagttcttcttcgaccgccac cccggcgtcttcgcgcacatcctgaactactaccgcacgggcaagctgcactgcccagcc gacgtgtgcgggccgctctacgaggaggagctggccttctggggcatcgacgagaccgac gtggagccctgctgctggatgacgtaccgccagcaccgcgacgccgaggaggctctggac agcttcggcggcgctcctctggacaacagcgccgacgacgcggacgccgacggccctggc gactcgggcgacggcgaggacgagctggagatgaccaagcgcctggcgctcagtgactcc ccggatggccggcctggcggcttttggcgccgctggcagccgcgcatctgggcgctcttc gaggacccgtactcgtcccgctacgcgcggtatgtggccttcgcttccctcttcttcatc ctggtctccatcaccaccttctgcctggagacccacgagcgcttcaaccccatcgtgaac aagacggagatcgagaacgttcgcaatggcacgcaagtgcgctactaccgggaggccgag acggaggccttccttacctacatcgagggcgtctgtgtggtctggttcaccttcgagttc ctcatgcgtgtcatcttctgccccaacaaggtagagttcatcaagaactcgctcaacatc attgactttgtggccatcctgcccttctacctggaggtggggctgagcggcctgtcctcc aaggcagccaaggacgtgctgggcttcctgcgcgtcgtccgcttcgtgcgcatcttgcgc atctttaagctgacccgccactttgtgggcctgcgggtcctgggccacacgctccgagcc agcaccaacgagttcctgctgctcatcatcttcctggccttgggcgtgctgatcttcgcc accatgatctactacgccgagaggataggggcacagcccaatgaccccagcgccagtgag cacacgcactttaagaacatccccatcggcttctggtgggccgtggtcaccatgacgacc ctgggctatggagacatgtacccgcagacgtggtccggcatgctggtgggggctctgtgt gcgctggcgggcgtgctcaccatcgccatgcccgtgcccgtcatcgtgaacaatttcggg atgtattactccttagccatggctaagcagaaactaccaaagaaaaaaaagaagcatatt ccgcggccaccgcagctgggatctcccaattattgtaaatctgtcgtaaactctccacac cacagtactcagagtgacacatgtccgctggcccaggaagaaattttagaaattaacaga gcagattccaaactgaatggggaggtggcgaaggccgcgctggcgaacgaagactgcccc cacatagaccaggccctcactcccgatgagggcctgccctttacgcgctcgggcacccgc gagagatacggaccctgcttcctcttatcaaccggggagtacgcgtgcccacctggtgga ggaatgagaaagggaggaatatga >gi568815587f:17619783_17821505|GENSCAN_predicted_peptide_7|167_aa XKSLLIPQIIYSLLAGAMNGSFYIPYDIYQVSSAFLPPPSLLTVSWCHSPGPGTKPIPGG VCYSWGWNEHGMCGDGTEANVWAPKPVQALLSSSGLLVGCGAGHSLALCQLPAHPALVQD PKVTYLSPDAIEDTESQKAMDKERNWKERQSETSTQSQSDWSRNGGL >gi568815587f:17619783_17821505|GENSCAN_predicted_CDS_7|504_bp nngaagtctcttctgatccctcagattatttattccttgctagcaggtgccatgaatggt tcattttatatcccttatgatatctaccaagtcagctctgccttcctcccacccccatcc ctgctgacggtcagctggtgccactcaccagggccgggcacaaagcccataccaggtgga gtgtgttactcttggggctggaatgagcatggcatgtgcggagatggcactgaagccaac gtctgggccccaaagccggtgcaggctctgctgtcatcgtcaggactccttgtgggctgt ggggctggccactccttggccctctgccagctgccagctcaccctgcattggtccaggac cccaaggtcacctacctttccccagatgccatcgaggacactgaatctcagaaagccatg gacaaagagagaaactggaaggaaagacaatcagaaacttcaacccaaagccaatctgac tggtccagaaatgggggactgtga