GENSCAN 1.0 Date run: 4-Nov-116 Time: 19:58:52 Sequence gi568815586f:4944148_5145986 : 201839 bp : 44.89% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 12116 12184 69 1 0 63 87 46 0.394 3.07 1.02 Term + 20365 20448 84 0 0 97 37 101 0.600 3.55 1.03 PlyA + 22011 22016 6 1.05 2.00 Prom + 28338 28377 40 -3.26 2.01 Init + 32955 33265 311 2 2 61 66 151 0.407 7.25 2.02 Intr + 34997 35111 115 2 1 98 69 41 0.520 3.65 2.03 Term + 38549 38602 54 1 0 78 43 90 0.527 1.06 2.04 PlyA + 39289 39294 6 1.05 3.09 PlyA - 40132 40127 6 1.05 3.08 Term - 40888 40770 119 2 2 132 55 77 0.938 7.50 3.07 Intr - 44592 44442 151 0 1 75 80 81 0.481 5.74 3.06 Intr - 55245 55082 164 1 2 78 77 15 0.130 -0.91 3.05 Intr - 56772 56539 234 1 0 53 60 106 0.122 1.96 3.04 Intr - 58819 58715 105 2 0 87 94 1 0.044 0.79 3.03 Intr - 75353 75180 174 0 0 109 44 37 0.260 1.31 3.02 Intr - 77593 77486 108 2 0 101 89 53 0.674 6.96 3.01 Init - 78157 78073 85 1 1 52 85 47 0.230 1.78 3.00 Prom - 87715 87676 40 -3.76 4.02 PlyA - 88282 88277 6 1.05 4.01 Sngl - 88974 88402 573 0 0 21 42 598 0.502 44.67 4.00 Prom - 93769 93730 40 -4.06 5.00 Prom + 96373 96412 40 -3.26 5.01 Sngl + 100001 101842 1842 1 0 105 48 3122 0.995 304.04 5.02 PlyA + 102616 102621 6 1.05 6.00 Prom + 114322 114361 40 -4.06 6.01 Init + 119368 119430 63 0 0 27 89 69 0.044 2.05 6.02 Term + 137065 137172 108 0 0 85 55 101 0.540 5.01 6.03 PlyA + 137980 137985 6 1.05 7.04 PlyA - 138876 138871 6 1.05 7.03 Term - 144869 144301 569 0 2 29 42 182 0.061 2.28 7.02 Intr - 155247 155109 139 0 1 46 28 197 0.242 9.54 7.01 Init - 158819 158745 75 2 0 69 66 16 0.306 -1.35 7.00 Prom - 165952 165913 40 -1.36 8.05 PlyA - 167147 167142 6 1.05 8.04 Term - 171453 171333 121 2 1 83 46 52 0.420 -1.55 8.03 Intr - 173808 172938 871 1 1 83 53 271 0.126 13.50 8.02 Intr - 175849 174449 1401 2 0 42 40 437 0.073 23.51 8.01 Init - 177029 176678 352 2 1 71 58 162 0.088 8.82 8.00 Prom - 183498 183459 40 -3.16 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586f:4944148_5145986|GENSCAN_predicted_peptide_1|50_aa MYDSLFHRELSRGEHGGCLLTRKDLCRIIVSKSPLNCCLFRDTYSDHSLK >gi568815586f:4944148_5145986|GENSCAN_predicted_CDS_1|153_bp atgtatgactcactcttccaccgggaattaagcagaggagagcatggtggctgcctgctg acaagaaaggatctttgccgcatcattgtatccaagtcaccgctcaactgctgcctcttc agagacacctactctgaccactctctaaagtag >gi568815586f:4944148_5145986|GENSCAN_predicted_peptide_2|159_aa MPFACMPAGVVVPSCSPVLFLSMVLSTGGGPIIQDCGGVEENTREPSEWWHRHLTYLENC LETGDSLRALLPGPIIAGSQAACRKGEYTTALHGLGEDDFSTSCTGVREEEVEAAVLGSG FIPCDSSGLPQVLLSTYTTGHQELAYSAIDLTINTLMET >gi568815586f:4944148_5145986|GENSCAN_predicted_CDS_2|480_bp atgccatttgcctgcatgcctgctggtgtggtggtgccgtcctgttctcccgtgctgttc ctcagcatggttctctccacgggtggaggacccatcattcaggattgtggtggcgtagaa gaaaacacacgggagccttctgagtggtggcaccggcatcttacataccttgaaaactgc ctggaaacaggtgattcattacgggctctgcttccaggcccgatcatagcaggatcccag gcagcctgcagaaagggagagtacactactgctcttcatggtctgggggaggatgacttc agtacctcatgcacaggggtgcgagaggaggaagtggaggctgcagtgctgggctctggg ttcattccctgtgattcctctgggctgccccaagtcctgctcagcacatacacaacagga caccaggaacttgcctattctgccattgacctcaccatcaacactttgatggagacctaa >gi568815586f:4944148_5145986|GENSCAN_predicted_peptide_3|379_aa MLLAQGSYFESQDMSPSPTSSDSTSISAVHWLAEVAAVSQQRGDHDLRGKGKRKRPSHKA CCHHDCEPLEGRGALSNRDSAGQTQPGTKHIFDIHLLNDCLQGLKGIPDLGIRKPGRDEQ SPVGTPKLRAPSALRVLTQATSPFGKTGLHPAPQNEEGEPEDSLTLKEAATAPAEGKREE HVNAKRTAEPAAAVATRSEWEVQATRGQQLVLLCPQRLWSLNESGSSPWVGGRNPGCSDC TMDMRVNACQAISWCFPHIACVHRYIPVRHMAREATCASSPERDLQTTLHLSGGLAELMS GINQQGGKSVSSAAPLPPRNHLLKLLSSIHLPEAPVMQAAGALLPAIICHCKQPVQQCAA VSTSSERRHLLPEDTFRYA >gi568815586f:4944148_5145986|GENSCAN_predicted_CDS_3|1140_bp atgctgctggcccaaggatcatactttgagtcacaggacatgagcccatcaccaacatcc agtgactcaacatccatatcagctgtccactggctggcagaagtagctgcagttagccag cagcggggtgaccatgacctgagaggaaaaggaaagaggaaaagaccaagtcataaagcc tgctgccaccatgactgtgagcctctggagggcaggggggccttatccaacagggattct gcaggccaaacacagcctggcacgaagcatatatttgatatacacctgctgaatgactgt ttacaaggtctgaaaggaataccagatttggggatcagaaaaccaggcagagatgaacag agcccagtggggacgcccaaactaagggctcccagcgctctgcgtgttctgacccaggca acgtcaccttttggaaagacaggcttgcaccctgctccgcaaaatgaggaaggggagcca gaggactctctgaccctgaaagaggcggccacagcccctgctgaggggaagagggaagag catgtgaatgcaaagaggactgcagagcctgctgcagctgtagccactcgctcggagtgg gaagtccaggccacccgagggcagcagcttgtcctcctctgccctcagaggctgtggtca ttgaatgagtctggatcttctccctgggttgggggaaggaatccaggctgcagtgactgc accatggatatgagagtgaatgcttgtcaggccatcagttggtgttttccccacatcgca tgtgtgcacaggtacatacctgtgaggcacatggccagggaggccacctgtgcttcttcc ccagagagagaccttcagaccaccctccatctgagcggaggactggctgagctgatgagt ggtataaatcaacaaggtggcaaatcggtgagttctgctgctcctttgcccccaaggaac cacttgttaaagctcctgagcagcatccacctccctgaagcacctgtcatgcaggctgcg ggggccttactgcctgccatcatctgccactgcaaacaacctgttcagcagtgtgcagct gtgagcacatcctctgagagaaggcatctgctacccgaagataccttcaggtatgcctga >gi568815586f:4944148_5145986|GENSCAN_predicted_peptide_4|190_aa MLELQGPRECIKKGKWEPEPQPVLALVLELRLGPLPGLGLAWLEEHLLVPPVWPSCGERK PGRTMAVMFTPLTVKYVYCDTERIGVDLIVKTCFSPNRVIGLSGDLQQVGGASARIRDAL SRVLQYAEDVLSGKVSVGRFLVNLVNQVPKIVPDDFEIMLNGNINDLLMVTYLANLTQSQ IALSEKLVNL >gi568815586f:4944148_5145986|GENSCAN_predicted_CDS_4|573_bp atgctggaactccagggccccagggaatgcatcaagaagggaaagtgggaaccagagccg cagccggtgctggcattggtgctggaactgaggctgggaccgctgccggggctggggctg gcgtggctggaggagcacttgctggtaccgccggtgtggccatcttgtggagaaagaaag cctgggaggaccatggcagtgatgttcacacctctgacagtgaaatatgtatactgtgac actgaacgcattggagttgacctgatcgtgaagacctgctttagccccaacagagtgatt ggactctcaggtgacttgcagcaagtaggaggggcatccgctcgcatccgggatgccttg agcagagtgttgcaatatgcagaggatgtactctctggaaaggtgtcggttggccgcttc ctggtgaacctggttaaccaagtacccaaaatagttcccgatgacttcgagatcatgctc aacggtaacatcaacgacctgttgatggtgacctacctggccaacctcacacagtcacag attgccctcagtgaaaaacttgtaaacctgtga >gi568815586f:4944148_5145986|GENSCAN_predicted_peptide_5|613_aa MEIALVPLENGGAMTVRGGDEARAGCGQATGGELQCPPTAGLSDGPKEPAPKGRGAQRDA DSGVRPLPPLPDPGVRPLPPLPEELPRPRRPPPEDEEEEGDPGLGTVEDQALGTASLHHQ RVHINISGLRFETQLGTLAQFPNTLLGDPAKRLRYFDPLRNEYFFDRNRPSFDGILYYYQ SGGRLRRPVNVSLDVFADEIRFYQLGDEAMERFREDEGFIKEEEKPLPRNEFQRQVWLIF EYPESSGSARAIAIVSVLVILISIITFCLETLPEFRDERELLRHPPAPHQPPAPAPGANG SGVMAPPSGPTVAPLLPRTLADPFFIVETTCVIWFTFELLVRFFACPSKAGFSRNIMNII DVVAIFPYFITLGTELAEQQPGGGGGGQNGQQAMSLAILRVIRLVRVFRIFKLSRHSKGL QILGKTLQASMRELGLLIFFLFIGVILFSSAVYFAEADNQGTHFSSIPDAFWWAVVTMTT VGYGDMRPITVGGKIVGSLCAIAGVLTIALPVPVIVSNFNYFYHRETDHEEPAVLKEEQG TQSQGPGLDRGVQRKVSGSRGSFCKAGGTLENADSARRGSCPLEKCNVKAKSNVDLRRSL YALCLDTSRETDL >gi568815586f:4944148_5145986|GENSCAN_predicted_CDS_5|1842_bp atggagatcgccctggtgcccctggagaacggcggtgccatgaccgtcagaggaggcgat gaggcccgggcaggctgcggccaggccacagggggagagctccagtgtcccccgacggct gggctcagcgatgggcccaaggagccggcgccaaaggggcgcggcgcgcagagagacgcg gactcgggagtgcggcccttgcctccgctgccggacccgggagtgcggcccttgcctccg ctgccagaggagctgccacggcctcgacggccgcctcccgaggacgaggaggaagaaggc gatcccggcctgggcacggtggaggaccaggctctgggcacggcgtccctgcaccaccag cgcgtccacatcaacatctccgggctgcgctttgagacgcagctgggcaccctggcgcag ttccccaacacactcctgggggaccccgccaagcgcctgcgctacttcgaccccctgagg aacgagtacttcttcgaccgcaaccggcccagcttcgacggtatcctctactactaccag tccgggggccgcctgcggaggccggtcaacgtctccctggacgtgttcgcggacgagata cgcttctaccagctgggggacgaggccatggagcgcttccgcgaggatgagggcttcatt aaagaagaggagaagcccctgccccgcaacgagttccagcgccaggtgtggcttatcttc gagtatccggagagctctgggtccgcgcgggccatcgccatcgtctcggtcttggttatc ctcatctccatcatcaccttctgcttggagaccctgcctgagttcagggatgaacgtgag ctgctccgccaccctccggcgccccaccagcctcccgcgcccgcccctggggccaacggc agcggggtcatggccccgccctctggccctacggtggcaccgctcctgcccaggaccctg gccgaccccttcttcatcgtggagaccacgtgcgtcatctggttcaccttcgagctgctc gtgcgcttcttcgcctgccccagcaaggcagggttctcccggaacatcatgaacatcatc gatgtggtggccatcttcccctacttcatcaccctgggcaccgaactggcagagcagcag ccagggggtggaggaggcggccagaatgggcagcaggccatgtccctggccatcctccga gtcatccgcctggtccgggtgttccgcatcttcaagctctcccgccactccaaggggctg cagatcctgggcaagaccttgcaggcctccatgagggagctggggctgctcatcttcttc ctcttcatcggggtcatcctcttctccagtgccgtctacttcgcagaggctgacaaccag ggaacccatttctctagcatccctgacgccttctggtgggcagtggtcaccatgaccact gtgggctacggggacatgaggcccatcactgttgggggcaagatcgtgggctcgctgtgt gccatcgccggggtcctcaccattgccctgcctgtgcccgtcatcgtctccaacttcaac tacttctaccaccgggaaacggatcacgaggagccggcagtccttaaggaagagcagggc actcagagccaggggccggggctggacagaggagtccagcggaaggtcagcgggagcagg ggatccttctgcaaggctggggggaccctggagaatgcagacagtgcccgaaggggcagc tgccccctagagaagtgtaacgtcaaggccaagagcaacgtggacttgcggaggtccctt tatgccctctgcctggacaccagccgggaaacagatttgtga >gi568815586f:4944148_5145986|GENSCAN_predicted_peptide_6|56_aa MCEKALKGMSMVPESDGLPSQFPIGWRSARMDSDRLVTQQGGGKLWMRILGFPEDA >gi568815586f:4944148_5145986|GENSCAN_predicted_CDS_6|171_bp atgtgtgaaaaagcactgaaggggatgtctatggtgcctgaaagtgatggtttaccttct cagttccccattggctggcggagtgctcggatggacagtgaccgtctggtgacccagcag ggcggagggaagctttggatgagaatcctcggcttccctgaggacgcctga >gi568815586f:4944148_5145986|GENSCAN_predicted_peptide_7|260_aa MGHLILFSIGPALGTGTQTFTSLCVMCIADALRSPPLSPPCAEQYCEVQPGICRADPTSV VDSGQLDAIWRFTTLVQKRGFETLRPTLLAASTTSVHGAGRWGTNIPAGPWRMVGKGPVD FLPQSTFTPQRGRQESRLKENVHLLPPCFVALNMRLESMTRPAFAAAAGSQGKADWVLCP ATSKGGIWMVLEDGRVTLCGSLRPGPVDQEARHCQPQDGDAWSPGDGHWLSLPRQATSQI QPQVCGGGTHLNEGTTLLCK >gi568815586f:4944148_5145986|GENSCAN_predicted_CDS_7|783_bp atggggcatctcatcctcttctccatcggaccagccttaggaacaggaactcagactttc acatctttgtgtgtgatgtgcatcgccgatgcacttcgcagccctccgctgtcgccccct tgtgcggagcaatactgcgaggtgcagccaggaatctgcagagcagaccctaccagtgta gtggacagtggacagctggatgccatctggaggttcaccaccctggtgcagaagcgcggc tttgagacactgcgccccaccctgctagcagccagtaccacgtcagtccacggggctggg aggtggggcacaaacatccctgcgggcccctggaggatggttggaaaaggccctgtggac ttcctgcctcagagcacgttcactccgcagagaggcagacaagaaagccgtttaaaagaa aatgtgcatctcctgcctccttgtttcgtggccttgaacatgcgtttggaaagcatgacg aggccggcgtttgcagccgctgcgggaagtcaaggaaaggctgactgggtgctctgccca gccacttccaagggtggaatctggatggttttggaggatgggagagttaccctgtgtggc tcactcaggcctggtcctgtggaccaggaggccaggcactgccagccgcaggatggggac gcttggagccccggcgacggccactggctctccctgccacgtcaggccacatcacagatc cagccacaggtgtgtggaggagggactcacctgaacgagggaacaactctcctgtgcaag tga >gi568815586f:4944148_5145986|GENSCAN_predicted_peptide_8|914_aa MEGEMNEMKREGKFREKRIKRNEQSLQEIWDYVKRPNLRLIGVPESDGENGTKLENTLQD IIQENFPNLARQANVQIQEIQRTPQRYSSRRATPRHIIVRFTKVEMKEKMLRAAREKDRS TRQKVNKDTQELNSALHQADLIDIYRTLHLKSTEYTFFSAPHHTYSKIDHILGSKALLSK CKRTEIITNYLSDHSAIKLELRIKNLTQNRSTTWKLNNLLLNDYWVHNKMKAEIKMFFET NENKDTTYQNLWDTFKAVCRGKFIALNAHKRKQERSKIDTLTSQLKELEKQEQTHSKASR RQEITKIRAELKEIETQKTLQKINESRSWFFERINKIDRPLARLIKKKREKNLIDAIKND KGDITTDPTEIQTTIREYYKHLYTNKLENLEEMDKFLDTYTLPRLNEEEVESLNRPITGA EIVAIINSLPTKKSPGPDGFTAEFYQRYKEELVPFLLKLFQSIEKEGILPNSCYEASIIL IPKPGRDTTKKENFRPISLMNIDAKILNKILAKRIQQHIKKLIHHDQVGFIPGMQGWFNI RKSINVIQHINRAKDKNHMIISIDAEKAFDKIQQPFMLKTLNKLELEETTVKFIWNQKRA HIAKSILSQKNKAGGITLRDFKLYYKATVTKTAWYWYQNRDIDQWNRTEPSEITLHIYNY LIFDKPEKNKQWGTDSLFNKWCWENWLAICRKLKLDPFLTPYTKINSRWIKDLNVRPKTI KTLEENLGITIQDIGMGKDFMSKTPKAMATKAKIDKWDLIKLKSFCTAKETTIRVNRQPT KWEKIFTTYSSDKGLISRIYNELKQIYKKKTNNPIKKWAKDMNRHFSEEDIYAAKKHMKK CSSSLAIREMQIKTTMRYHLTPVRMAIIKKSGNNRHAPFSIHTHIMFGSLYLQIQKDLSI LGFWYPRGILEPIY >gi568815586f:4944148_5145986|GENSCAN_predicted_CDS_8|2745_bp atggaaggtgaaatgaatgaaatgaagcgagaagggaagtttagagaaaaaagaataaaa agaaatgagcaaagcctccaagaaatatgggactatgtgaaaagaccaaatctacgtctg attggtgtacctgaaagtgatggggagaatggaaccaagttggaaaacactctgcaggat attatccaggagaacttccccaatctagcaaggcaggccaacgttcagattcaggaaata cagagaacgccacaaagatactcctcgagaagagcaactccaagacacataattgtcaga ttcaccaaagttgaaatgaaggaaaaaatgttaagggcagccagagagaaagacagatca acgagacagaaagtcaacaaggatacccaggaattgaactcagctctgcaccaagcggac ctaatagacatctacagaactctccacctcaaatcaacagaatatacatttttttcagca ccacaccacacttattccaaaattgaccacatacttggaagtaaagctctcctcagcaaa tgtaaaagaacagaaattataacaaactatctctcagaccacagtgcaatcaaactggaa ctcaggattaagaatctcactcaaaaccgctcaactacatggaaactgaacaacctgctc ctgaatgactactgggtacataacaaaatgaaggcagaaataaaaatgttctttgaaacc aacgagaacaaagacacaacataccagaatctctgggacacattcaaagcagtgtgtaga gggaaatttatagcactaaatgcccacaagagaaagcaggaaagatccaaaattgacacc ctaacatcacaattaaaagaactagaaaagcaagagcaaacacattcaaaagctagcaga aggcaagaaataactaaaatcagagcagaactgaaggaaatagagacacaaaaaaccctt caaaaaattaatgaatccaggagctggttttttgaaaggatcaacaaaattgatagaccg ctagcaagactaataaagaaaaaaagagagaagaatctaatagacgcaataaaaaatgat aaaggggatatcaccaccgatcccacagaaatacaaactaccatcagagaatactacaaa cacctctacacaaataaactagaaaatctagaagaaatggataaattcctcgacacatac actctcccaagactaaacgaggaagaagttgaatctctgaatagaccaataacaggagct gaaattgtggcaataatcaatagcttaccaaccaaaaagagtccaggaccagatggattc acagctgaattctaccagaggtacaaggaggaactggtaccattccttctgaaactattc caatcaatagaaaaagagggaatcctccctaactcatgttatgaggccagcatcattctg ataccaaagccaggcagagacacaacaaaaaaagagaattttagaccaatatccttgatg aacattgatgcaaaaatcctcaataaaatactggcaaaacgaatccagcagcacatcaaa aagcttatccaccatgatcaagtgggcttcatccctgggatgcaaggctggttcaatata cgcaaatcaataaatgtaatccagcatataaacagagccaaagacaaaaaccacatgatt atctcaatagatgcagaaaaagcctttgacaaaattcaacaacccttcatgctaaaaact ctcaataaattagaattggaagaaactactgtaaagttcatatggaaccaaaaaagagcc cacatcgccaagtcaatcctaagccaaaagaacaaagctggaggcatcacactacgtgac ttcaaactatactacaaggctacagtaaccaaaacagcatggtactggtaccaaaacaga gatatagatcaatggaacagaacagagccctcagaaataacgctgcatatctacaactat ctgatctttgacaaacctgagaaaaacaagcaatggggaacggattccctatttaataaa tggtgctgggaaaactggctagccatatgtagaaagctgaaactggatcccttccttaca ccttatacaaaaatcaattcaagatggattaaagacttaaacgttagacctaaaaccata aaaaccctagaagaaaacctaggcattaccattcaggacataggcatgggcaaggacttc atgtctaaaacaccaaaagcaatggcaacaaaagccaaaattgacaaatgggatctaatt aaactaaagagcttctgcacagcaaaagaaactaccatcagagtgaacaggcaacctaca aaatgggagaaaattttcacaacctactcatctgacaaagggctaatatccagaatctac aatgaactcaaacaaatttacaagaaaaaaacaaacaaccccatcaaaaagtgggcaaag gacatgaacagacacttctcagaagaagacatttatgcagccaaaaaacacatgaaaaaa tgctcatcatcactggccatcagagaaatgcaaatcaaaaccacaatgagataccatctc acaccagttagaatggcaatcattaaaaagtcaggaaacaacaggcatgcacccttttcc atacacacccacattatgttcggctctctatatctacagatacagaaggacttgagcatc cttggattttggtatccacggggcatcctggaaccaatctactga