GENSCAN 1.0 Date run: 3-Nov-116 Time: 05:06:38 Sequence gi568815576f:35556426_35759532 : 203107 bp : 46.46% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 7552 7591 40 -0.66 1.01 Init + 32429 32519 91 1 1 92 113 63 0.816 9.85 1.02 Term + 37534 37610 77 2 2 46 44 108 0.537 0.20 1.03 PlyA + 38883 38888 6 1.05 2.10 PlyA - 40774 40769 6 1.05 2.09 Term - 51018 50872 147 0 0 108 47 275 0.998 23.30 2.08 Intr - 54681 54459 223 2 1 110 75 322 0.644 31.23 2.07 Intr - 60840 60738 103 1 1 100 98 115 0.402 12.93 2.06 Intr - 66108 66010 99 1 0 94 68 23 0.143 0.98 2.05 Intr - 71234 71151 84 0 0 60 28 118 0.074 2.79 2.04 Intr - 78656 78530 127 2 1 92 68 27 0.322 1.35 2.03 Intr - 86882 86777 106 1 1 110 63 16 0.013 1.52 2.02 Intr - 91832 91784 49 0 1 87 77 28 0.043 -0.66 2.01 Init - 95333 95201 133 2 1 78 47 69 0.359 2.10 2.00 Prom - 95457 95418 40 -0.26 3.02 PlyA - 95741 95736 6 -0.45 3.01 Sngl - 96906 95968 939 0 0 58 43 327 0.504 21.71 3.00 Prom - 98124 98085 40 -5.16 4.00 Prom + 99037 99076 40 -3.66 4.01 Init + 99364 99515 152 0 2 69 106 16 0.892 1.02 4.02 Intr + 102190 103024 835 1 1 76 59 749 0.961 62.20 4.03 Term + 135605 135775 171 1 0 86 34 107 0.182 2.93 4.04 PlyA + 135789 135794 6 1.05 5.00 Prom + 137341 137380 40 -4.76 5.01 Init + 138827 138887 61 1 1 71 85 32 0.875 2.55 5.02 Intr + 140022 140189 168 1 0 92 97 77 0.926 8.92 5.03 Intr + 141291 141464 174 1 0 84 93 76 0.865 7.61 5.04 Term + 141535 141746 212 2 2 -14 49 191 0.577 2.46 5.05 PlyA + 142231 142236 6 1.05 6.05 PlyA - 142381 142376 6 -1.75 6.04 Term - 143166 143092 75 0 0 56 48 85 0.631 -0.86 6.03 Intr - 145835 145710 126 2 0 44 84 159 0.991 11.98 6.02 Intr - 147551 147384 168 2 0 83 88 134 0.957 13.04 6.01 Init - 150886 150884 3 1 0 61 115 0 0.413 -0.00 6.00 Prom - 151131 151092 40 -4.26 7.00 Prom + 154961 155000 40 -3.56 7.01 Init + 161447 161501 55 1 1 51 119 49 0.481 5.75 7.02 Intr + 169786 170769 984 2 0 8 97 737 0.154 57.47 7.03 Intr + 178306 178828 523 2 1 135 -14 294 0.809 15.91 7.04 Intr + 180274 180462 189 1 0 36 44 180 0.559 7.10 7.05 Term + 186440 186545 106 2 1 67 41 116 0.441 2.68 7.06 PlyA + 186914 186919 6 1.05 8.04 PlyA - 187058 187053 6 1.05 8.03 Term - 187824 187770 55 2 1 71 53 62 0.217 -1.97 8.02 Intr - 189570 189498 73 1 1 95 103 -1 0.579 0.56 8.01 Init - 190114 190030 85 1 1 55 77 133 0.323 7.91 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 111492 111664 173 1 2 74 44 126 0.815 4.79 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815576f:35556426_35759532|GENSCAN_predicted_peptide_1|55_aa MGKVRVNPEDKGGHSVGVQSQGAGEGCQKQASSLGKTNLSYVTTKGSRNSGIQTL >gi568815576f:35556426_35759532|GENSCAN_predicted_CDS_1|168_bp atgggcaaggtgagggtgaatcctgaagacaaaggtggccactcggtgggtgttcagagc cagggagcaggtgaaggttgtcagaaacaagcttcctcacttggaaaaacaaatctgagc tacgtgaccaccaagggctcccggaactctggcattcagaccctgtga >gi568815576f:35556426_35759532|GENSCAN_predicted_peptide_2|356_aa MEYYAAIKKDEFVSFVGTWMKLETIILSKLSQGQKTKHRMFSLLGWTYMGYENSSCGLEL WWAFSITERREFQVYYNSCPSSSWIGDARAFSPPDPTGLYPIYSQDTQTFRLWLNYNTDF PGSPACKWQAMELLSPQSPFRVDRGEHLPFLVKGARYTLVPAGQEGALAAWLEALRGQLG RRGAVVSMMDAEGLERSSPDCAMGLSDGEWQLVLNVWGKVEADIPGHGQEVLIRLFKGHP ETLEKFDKFKHLKSEDEMKASEDLKKHGATVLTALGGILKKKGHHEAEIKPLAQSHATKH KIPVKYLEFISECIIQVLQSKHPGDFGADAQGAMNKALELFRKDMASNYKELGFQG >gi568815576f:35556426_35759532|GENSCAN_predicted_CDS_2|1071_bp atggaatactatgcagccataaaaaaggatgagttcgtgtcctttgtagggacatggatg aagctggaaaccatcattctcagcaaactatcgcaaggacaaaaaaccaaacaccgcatg ttctcactcctaggatggacctatatggggtatgaaaatagctcctgtggcttagagcta tggtgggctttttccataacagaaagaagagaatttcaagtctattacaattcatgtcct tctagttcctggattggagatgcaagagcattttccccaccagatccaactgggctttac cccatctactcccaggatactcagaccttcagactctggctgaattataacactgacttt cctgggtctccagcttgcaaatggcaggctatggaacttctcagcccacaatcacccttc agggtagaccgaggagagcacctccccttcctggtgaagggagcccgatacacgctggtg ccggctggccaagaaggagccctggccgcttggctggaggctctgcgaggacagctgggg agaaggggagctgtggtcagtatgatggatgctgaggggctggagaggagcagccctgac tgcgccatggggctcagcgacggggaatggcagttggtgctgaacgtctgggggaaggtg gaggctgacatcccaggccatgggcaggaagtcctcatcaggctctttaagggtcaccca gagactctggagaagtttgacaagttcaagcacctgaagtcagaggacgagatgaaggcg tctgaggacttaaagaagcatggtgccaccgtgctcaccgccctgggtggcatccttaag aagaaggggcatcatgaggcagagattaagcccctggcacagtcgcatgccaccaagcac aagatccccgtgaagtacctggagttcatctcggaatgcatcatccaggttctgcagagc aagcatcccggggactttggtgctgatgcccagggggccatgaacaaggccctggagctg ttccggaaggacatggcctccaactacaaggagctgggcttccagggctag >gi568815576f:35556426_35759532|GENSCAN_predicted_peptide_3|312_aa MIVYLENPIVSAQNLLKLIGNFSKVSGYKINVQKSQAFLHTNNRQTESQIMSELPFTIAS KRIKYLGIQLTRDVKDFFKENYKPLLNETKEDTNKWKNIPCSWVGRINIMKMAILPKVIY RFNAIPIKLPMTFFTELEKTTLKFIWNQKRAHIAKSILSQKNKAGDITLPDFKLYYKATV TKTAWYWYQNRDIDQWNRTEPSEIMPHIYNYLIFDKPDKNKKWGNDSLFNKWCWENWLAT RRKLKLDPFLTPYTKINSRWIKDLNVRPKTIKTLEENLGNIIQDIGMGKDFMSKTPKATA TKAKIDKGISLN >gi568815576f:35556426_35759532|GENSCAN_predicted_CDS_3|939_bp atgattgtatatctagaaaaccccattgtctcagcccaaaatctccttaagctgataggc aacttcagcaaagtctcgggatacaaaatcaatgtgcaaaaatcacaagcattcttacac accaataacagacaaacagagagccaaatcatgagtgaactcccattcacaattgcttca aagagaataaaatacctaggaatccaacttacaagggatgtgaaggacttcttcaaggag aactacaaaccactgctcaatgaaacaaaagaggatacaaacaaatggaagaacattcca tgctcatgggtaggaagaatcaatatcatgaaaatggccatactgcccaaggtaatttat agattcaatgccatccccatcaagctaccaatgactttcttcacagaattggaaaaaact actttaaagttcatatggaaccaaaaaagagcccacattgccaagtcaatcctaagccaa aagaacaaagctggagacatcacgctacctgacttcaaactatactacaaggctacagta accaaaacagcatggtactggtaccaaaacagagatatagaccaatggaacagaacagag ccctcagaaataatgccgcatatctacaactatctgatctttgacaaacctgacaaaaac aagaaatggggaaatgattccctgtttaataaatggtgctgggaaaactggctagccaca cgcagaaagctgaaactggatcccttccttacaccttatacaaaaattaattcaagatgg attaaagacttaaatgttagacctaaaaccataaaaaccctagaagaaaacctaggcaat atcattcaggacataggcatgggcaaggacttcatgtctaaaacaccaaaagcaacggca acaaaagccaaaattgacaaagggatctcattaaactaa >gi568815576f:35556426_35759532|GENSCAN_predicted_peptide_4|385_aa MGEIQGLNNSQRGSFEFQLQYISNNEQYIFREVARQWKRVLSLQQEQLVGKDEDDAPLCE DVELQDGDLSPEEKIFLREFPRLKEDLKGNIDKLRALADDIDKTHKKFTKANMVATSTAV ISGVMSLLGLALAPATGGGSLLLSTAGQGLATAAGVTSIVSGTLERSKNKEAQARAEDIL PTYDQEDREDEEEKADYVTAAGKIIYNLRNTLKYAKKNVRAFWKLRANPRLANATKRLLT TGQVSSRSRVQVQKAFAGTTLAMTKNARVLGGVMSAFSLGYDLATLSKEWKHLKEGARTK FAEELRAKALELERKLTELTQLYKSLQQKDIRNKMTEGMYTHCDIFNNVILYPLAIRKNI IERCTLSVISGVISSPDIRKSYINY >gi568815576f:35556426_35759532|GENSCAN_predicted_CDS_4|1158_bp atgggagagatccagggattaaataattctcaaagaggtagctttgaattccagcttcaa tacatctccaacaatgaacagtacatttttagagaagtagcaagacaatggaaaagggtt ttgagtctccagcaggagcaacttgtgggaaaggatgaggatgacgctcctctgtgtgaa gacgtggagctacaagacggagatctgtcccccgaagaaaaaatatttttgagagaattt cccagattgaaagaagatctgaaagggaacattgacaagctccgtgccctcgcagacgat attgacaaaacccacaagaaattcaccaaggctaacatggtggccacctctactgctgtc atctctggagtgatgagcctcctgggtttagcccttgccccagcaacaggaggaggaagc ctgctgctctccaccgctggtcaaggtttggcaacagcagctggggtcaccagcatcgtg agtggtacgttggaacgctccaaaaataaagaagcccaagcacgggcggaagacatactg cccacctacgaccaagaggacagggaggatgaggaagagaaggcagactatgtcacagct gctggaaagattatctataatcttagaaacaccttgaagtatgccaagaaaaacgtccgt gcattttggaaactcagagccaacccacgcttggccaatgctaccaagcgtcttctgacc actggccaagtctcctcccggagccgcgtgcaggtgcaaaaggcctttgcgggaacaaca ctggcgatgaccaaaaatgctcgcgtgctgggaggtgtgatgtccgccttctcccttggc tatgacttggccactctctcaaaggaatggaagcacctgaaggaaggagcaaggacaaag tttgcggaagagttgagagccaaggccttggagctggagaggaaactcacagaactcacc cagctctacaagagcttgcagcagaaagatattaggaacaagatgaccgaagggatgtac acccactgcgatattttcaataatgtcatcctctaccccctggctattaggaagaacatc atagagcggtgtacactttctgtgatatcgggagtaatatcctctccagatatcaggaaa agttatattaattattaa >gi568815576f:35556426_35759532|GENSCAN_predicted_peptide_5|204_aa MGVLRARGGEGLAVTPRIAGGGVRLPVILFLISRDGEHDISFHIAVGVHSPGDTDPNIQQ VEYDMTANIAMNVQPPDIRNYVTGDCTLLAILGVISSSPIMDIKKNITREVYTPCDMESN SMLSLWDIRNNITEGGCTPPAILGVISFSPPRDIRNNITVGVYTPCDIATSIIVSLPAYK EQYHKGVYTPCDIGDNIFLSPAGY >gi568815576f:35556426_35759532|GENSCAN_predicted_CDS_5|615_bp atgggggtcctaagagcaagggggggagaggggctggctgttactccccgcatcgcagga gggggtgtacgcctacctgtgatattgttcctaatatccagggacggagagcatgatatt agttttcatatcgcagtaggtgtacactcacccggtgacaccgatcctaatatccagcag gtagagtatgacatgactgccaacatagcaatgaatgtacagccacccgatattaggaac tatgtcacaggagactgtacacttcttgcgatattgggagtaatatcatcctctcccatc atggatattaagaagaatattacaagggaggtgtacaccccctgcgatatggagagtaat agtatgctctccctttgggatattaggaacaatatcacagaaggaggatgtacaccccct gcgatattgggagtaatatcattttctccccctcgggatattcggaacaatatcacagtg ggtgtgtacaccccctgcgatattgccactagtatcatcgtctccctgccagcatataag gaacagtatcacaagggggtgtacaccccctgcgatattggggataatatcttcctctcc cccgctggctattag >gi568815576f:35556426_35759532|GENSCAN_predicted_peptide_6|123_aa MVEGNELLLMTVGQWTIGAQLLYEMAVPGTPVSRHSAVEGPVGAATQGSEDGEGESTWVY TAPNPGVVFLISKWEEDDIADNIEGGVHLFCDMVPDIQGEQYHTGLYTFCDIGSDINLSA FGY >gi568815576f:35556426_35759532|GENSCAN_predicted_CDS_6|372_bp atggttgaggggaacgagttgctcttaatgactgttgggcagtggaccataggagcccaa ctgctctatgaaatggctgttcctgggacacctgtgagcagacacagtgccgtggaaggg ccagttggtgcagccacccagggctcagaggatggtgaaggtgaatccacgtgggtgtac accgcccccaaccccggggtagtgttcctaatatccaagtgggaagaggatgacattgct gacaatatcgaagggggtgtacacctcttctgtgatatggttcctgatatccagggggaa cagtatcacacggggctgtacactttctgcgatattgggagtgatatcaacctctcggcc tttggatattaa >gi568815576f:35556426_35759532|GENSCAN_predicted_peptide_7|618_aa MPCGKQGNLQVPGSKVLPASLVNLCQSWKINNLMSTVHSDEAGMLSYFLFEELMRCDKDS MPDGNLSEEEKLFLSYFPLHKFELEQNIKELNTLADQVDTTHELLTKTSLVASSSGAVSG VMNILGLALAPVTAGGSLMLSATGTGLGAAAAITNIVTNVLENRSNSAARDKASRLGPLT TSHEAFGGINWSEIEAAGFCVNKCVKAIQGIKDLHAYQMAKSNSGFMAMVKNFVAKRHIP FWTARGVQRAFEGTTLAMTNGAWVMGAAGAGFLLMKDMSSFLQSWKHLEDGARTETAEEL RALAKKLEQELDRLTQHHRHLPQKASQTCSSSRGRAVRGSRVVKPEDQFSLHYSSAYPWI QAQKTFHLCETVQDPRIQAQKTCHLCETVQDPWIQAQKTFHLCETVQDPRIQAQKTCHLC ETVQDPWIQAQKTFHLCETVQDPRIQAQKAFHLCETVRDPRIQAQKAFHLSETVRDPRIQ AQKAFHLSETVRDPRIQAQKAFHLCETVRDPRIQAQKAFHLTERQTPNKLPGEHIVQGLV DQNKDLELATGWEAKEVCTKARHNPIEQAVERTNFGDGELSEKSVNVYKVVRVLSGVFLE DSEKLSGIATGSRLWIEK >gi568815576f:35556426_35759532|GENSCAN_predicted_CDS_7|1857_bp atgccatgtggcaaacaaggaaatttgcaagttcccggttccaaggtgttacctgcctca ctcgtgaacctgtgccagagttggaaaattaacaatttgatgtcaactgtccacagtgat gaggctggtatgctgtcctactttctgtttgaagagctgatgcgatgtgacaaagattcc atgccagatggaaatctgtcagaggaggaaaaattgtttctctcatattttcctttgcac aagtttgagctagaacagaacatcaaagaacttaacacccttgcggaccaagttgacacc actcacgagttgcttaccaagaccagcctggtggccagctcttccggggctgtttctggg gtcatgaacatcctgggtttggccctagcacctgtgacagcaggaggcagtctcatgctc tcagcaactgggacagggttgggggcagcagctgccatcaccaacatagtaacaaatgtc ttagaaaatagaagcaattcagcagcaagagacaaagccagccgactggggcctctgaca acatcacatgaggctttcggaggaataaattggtctgaaatcgaggctgctggcttttgt gttaataagtgtgtaaaagctatccagggcatcaaggatcttcatgcctaccagatggcc aaatccaactctggcttcatggctatggtcaagaattttgtggccaagagacacatccct ttctggacggctagaggggtgcagagagcctttgagggcacaactctggccatgaccaat ggtgcctgggtgatgggtgctgctggggctggcttcttacttatgaaagacatgagcagc ttcctgcagagctggaagcacctggaggatggggcaaggacggagacagcagaggaactg agagcacttgctaagaagctggagcaggagctggaccggctcacccagcaccaccggcac ctgccgcagaaggcgagccagacctgttccagctcccggggcagggctgttcgaggatcc cgtgtggttaaaccagaagaccagttctctctgcactactcatcggcttacccgtggatc caggctcagaaaacattccacctgtgtgaaactgtgcaagacccacggatccaggctcag aaaacttgccacctgtgtgaaactgtgcaagacccgtggatccaggctcagaaaacattc cacctgtgtgaaactgtgcaagacccacggatccaggctcagaaaacttgccacctgtgt gaaactgtgcaagacccgtggatccaggctcagaaaacattccacctgtgtgaaactgtg caagacccacggatccaggctcagaaagcattccacctgtgtgaaactgtgcgagaccca cggatccaggctcagaaagcattccacctgtctgaaactgtgcgagacccacggatccag gctcagaaagcattccacctgtctgaaactgtgcgagacccgcggatccaggctcagaaa gcattccacctgtgtgaaactgtgcgagacccgcggatccaggctcagaaagcattccac cttacagaaagacagacaccaaataaactaccaggggagcacattgtacagggccttgta gaccaaaataaggacttggaactggctacgggatgggaagccaaagaagtttgcacaaaa gcacgtcacaatccgattgaacaggctgtcgagagaacaaacttcggcgatggagagctg tcggagaaaagtgttaatgtgtacaaagtggtccgggtgctcagcggggtgtttttggag gacagtgagaagctcagcgggatagccacaggctcaagactgtggatagagaagtga >gi568815576f:35556426_35759532|GENSCAN_predicted_peptide_8|70_aa MHSLLLQPQPPLLQPLQPLTVTGMYRDSVMAGCTQPTPTMPLPLPLAMELALWRVYTEVA TADLPPTEVT >gi568815576f:35556426_35759532|GENSCAN_predicted_CDS_8|213_bp atgcacagcctgctactgcaaccgcagccaccgctgctgcagccgctgcagccgcttaca gtgacgggtatgtacagagacagtgttatggcagggtgtacacagccgacccctaccatg cccttgcccctgccgctagctatggagttggcgctgtggcgagtttataccgaggtggct acagccgatttgccccctactgaagtgacgtga