GENSCAN 1.0 Date run: 6-Nov-116 Time: 18:16:03 Sequence gi568815596f:130629152_130830192 : 201041 bp : 45.44% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 1252 1263 12 0 0 85 83 28 0.544 1.24 1.02 Intr + 3339 3505 167 2 2 56 115 169 0.870 15.16 1.03 Intr + 9468 9538 71 0 2 61 110 69 0.978 5.23 1.04 Intr + 11512 11603 92 2 2 73 94 -21 0.381 -3.29 1.05 Intr + 16978 17159 182 0 2 85 115 169 0.751 17.97 1.06 Intr + 25267 25312 46 1 1 59 81 41 0.614 -1.09 1.07 Intr + 25729 25890 162 0 0 78 53 102 0.912 5.87 1.08 Term + 27398 28726 1329 1 0 51 43 2053 0.920 188.66 1.09 PlyA + 29377 29382 6 1.05 2.00 Prom + 30058 30097 40 -3.46 2.01 Init + 37319 37395 77 1 2 81 49 59 0.150 1.86 2.02 Intr + 43665 43794 130 0 1 81 92 115 0.987 11.90 2.03 Intr + 44054 44164 111 1 0 33 92 84 0.229 3.88 2.04 Term + 52183 52404 222 0 0 2 31 224 0.034 5.02 2.05 PlyA + 54714 54719 6 1.05 3.00 Prom + 55198 55237 40 -3.06 3.01 Init + 61085 61172 88 1 1 81 107 56 0.845 7.62 3.02 Intr + 73103 73277 175 0 1 128 94 137 0.845 17.30 3.03 Term + 95433 95733 301 0 1 39 38 334 0.821 18.19 3.04 PlyA + 96010 96015 6 1.05 4.00 Prom + 99621 99660 40 -7.06 4.01 Sngl + 100001 101044 1044 1 0 94 40 1001 0.993 91.05 4.02 PlyA + 101568 101573 6 1.05 5.00 Prom + 113121 113160 40 -2.66 5.01 Init + 119653 119729 77 0 2 43 91 58 0.018 2.16 5.02 Intr + 132903 135503 2601 0 0 119 -3 2079 0.008 189.26 5.03 Intr + 137899 138046 148 1 1 88 61 51 0.006 2.64 5.04 Intr + 147642 147755 114 2 0 52 70 114 0.089 6.64 5.05 Intr + 168689 168870 182 1 2 13 53 107 0.063 -1.63 5.06 Intr + 171305 171412 108 2 0 84 98 19 0.336 1.90 5.07 Term + 178849 178960 112 1 1 56 48 116 0.214 2.43 5.08 PlyA + 180081 180086 6 1.05 6.05 PlyA - 180258 180253 6 1.05 6.04 Term - 181477 181407 71 2 2 82 42 73 0.445 0.20 6.03 Intr - 186383 186311 73 2 1 64 80 41 0.097 -0.12 6.02 Intr - 193996 193814 183 1 0 113 96 65 0.730 9.78 6.01 Init - 194860 194810 51 1 0 77 93 64 0.756 4.96 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 95047 95120 74 0 2 96 48 58 0.893 3.14 S.002 Intr - 127340 127148 193 0 1 78 61 125 0.867 7.35 S.003 Intr - 129552 129456 97 0 1 112 89 30 0.935 5.08 S.004 Sngl + 132922 135507 2586 0 0 104 48 2024 0.960 192.83 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:130629152_130830192|GENSCAN_predicted_peptide_1|686_aa MMKKVEEEMKKHESNNVGLLENLSNGVTAGNGDDGLIPQRKSRTPENQQFPDNESEEYHR ICELVSDYKEKQMPKYSSENSNPGRKLRSMQGASCNISLAGSYHMLIPKPGTGKLENFMA IEEMKKHGSTHVGFPENLTNGATAGNGDDGLIPPRKSRTPESQQFPDTENEEYHRSAVAL DCPECDLNPVGQLQYFMKIKISSSDEQNDTQKQFCEEQNTGILHDEILIHEEKQIEVVEK MNSELSLSCKKERDFLHENSMLREEIAMLRLELDTMKHQSQLRKKKYLEDIESVKKKNDN LLKALQLNELTMDDDTAVLVIDNGSGMCKAGFAGDDAPRAVFPSIVGCPRQQGMMGGMHQ KESYVGKEAQSKRGILTLKYPMEHGIITNWDDMEKIWHHTFYNELRVAPEEHPILLTEAP LNPKANREKMTQIMFETFNTPAMYVAIQAMLSLYTSGRTTGIVMDSGDGVTHTVPIYDGN ALPHATLRLDLAGRELTDYLMKILTERGYRFTTMAEREIVRDIKEKLCYVALDFEQEMAM VASSSSLEKSYELPDGQVITISNEWFRCPEALFQPCFLGMESCGIHETTFNSIMKSDVDI RKDLYTNTVLSGGTTMYPGMAHRMQKEIAALAPSMMKIRIIAPPKRKYSVWVGGSILASL STFQQMWISKQEYDESGPSIVHRKCF >gi568815596f:130629152_130830192|GENSCAN_predicted_CDS_1|2061_bp atgatgaagaaggttgaagaagaaatgaagaagcatgaaagtaataatgtgggattacta gaaaacctgagtaatggtgtcactgctggcaatggtgatgatggattaattcctcaaagg aagagcagaacacctgaaaatcagcaatttcctgacaacgaaagtgaagagtatcacaga atttgcgaattagtttctgactacaaagaaaaacagatgccaaaatactcttctgaaaac agcaacccagggaggaaacttagaagcatgcaaggggcttcctgtaacatttcattggct gggtcataccacatgctcattcccaaaccaggcactgggaagctagaaaattttatggct atcgaagaaatgaagaagcacggaagtactcacgtcggattcccagaaaacctgactaat ggtgccactgctggcaatggtgatgatggattaattcctccaaggaagagcagaacacct gaaagccagcaatttcctgacactgagaatgaagagtatcacagatcagcggtggcatta gattgtcctgagtgtgacctgaaccctgttggacagctccagtatttcatgaaaattaaa atttcttctagtgacgaacaaaatgatactcagaagcaattttgtgaagaacagaacact ggaatattacacgatgagattctgattcatgaagaaaagcagatagaagtggttgaaaaa atgaattctgagctttctcttagttgtaagaaagaaagagacttcttgcatgaaaatagt atgttgcgggaagaaattgccatgctaagactggagctagacacaatgaaacatcagagc cagctaagaaaaaagaaatatttggaggatattgaaagtgtgaaaaaaaagaatgataat cttttaaaggctctacaattgaatgagctcaccatggatgatgataccgctgtgctcgtc attgacaacggctctggcatgtgcaaggccggctttgcgggcgacgatgccccccgggct gtcttcccttccatcgtggggtgccccaggcagcagggcatgatggggggcatgcatcag aaagagtcctatgtgggcaaggaggcccagagcaagagaggcatcctgaccctgaagtac cccatggaacacggcatcatcaccaactgggatgacatggagaagatctggcaccacacc ttctacaacgagctgcgtgtggcccccgaggagcaccccatcctgctgaccgaggccccc ctgaaccccaaggccaaccgcgagaagatgacccagatcatgtttgagaccttcaacacc ccagccatgtacgtggccatccaggccatgctgtccctgtacacctctggccgtactact ggcatcgtgatggactctggtgacggggtcacccacactgtgcccatctatgatgggaat gccctcccccatgccaccctgcgcctagacctggctgggcgggaactgactgactacctc atgaagatcctcaccgagcgtggctataggttcaccaccatggccgagcgggaaatcgtg cgtgacatcaaagagaagctgtgctatgttgccctggacttcgagcaggagatggccatg gtggcctccagctcctccctagagaagagctacgagctgcccgatggccaggtcatcacc atcagcaacgagtggttccgctgccccgaggcgctcttccagccttgcttcctgggcatg gaatcctgtggcatccatgaaactaccttcaactccatcatgaagtctgatgtggacatc cgcaaagacctgtacaccaacacagtgctgtctggcggcaccaccatgtaccctggcatg gcccacagaatgcagaaggagatcgctgccctggcgcctagcatgatgaagatcaggatc attgctcctcccaagcgcaagtactccgtgtgggtcggtggctccatcctggcctcgctg tccaccttccagcagatgtggatcagcaagcaggagtatgatgagtcaggcccctccatt gtccaccgcaaatgcttctag >gi568815596f:130629152_130830192|GENSCAN_predicted_peptide_2|179_aa MHKPQCNNSATNGAIERKRAAGSGAGPWGLAEKNGFLGQLHGPAAVCILRTLLPASLQPQ LQPWLNDAQIHRPTNILHPQCGKATGIQNSPVHERQLWVLNAAKPQICLCWDFLYYPFLR GNASAGPVTWCTTSDTIILQQHRTLTSQGVDDFLKAKATFKASDFIDALVLSKVASPGS >gi568815596f:130629152_130830192|GENSCAN_predicted_CDS_2|540_bp atgcacaagcctcagtgcaacaacagtgctacaaatggagccatagagaggaaacgagca gcaggctcaggagcagggccctggggcctagcagagaagaatggcttcctgggccagctc catggccctgctgctgtgtgcatcctcaggacactgctgcctgcatccctgcagccccag ctccagccatggctgaatgatgcacagattcatagacccaccaacatcttgcaccctcag tgtggaaaagctacaggcattcaaaacagccctgtccatgagaggcagctgtgggtgctg aacgctgcgaagccacagatctgcctgtgctgggacttcctgtactacccattcctgagg ggcaatgcttctgcagggcctgtgacttggtgcacaacttcagacaccatcatcttgcag cagcaccgcaccctcactagccagggtgttgatgacttcctcaaggccaaggccacattc aaggcttcggacttcattgatgcgcttgtgctgagcaaggtggcttctccgggatcttaa >gi568815596f:130629152_130830192|GENSCAN_predicted_peptide_3|187_aa MVVQQESRNLTIAILRPCIVRSNVAPAFPGFQVLATFEIPIPFARALRRPYADFTTSNFT TQYWNAISQQAPAIICDFYLWLIGRKPSDDGVDDSDGDDDNDGGDGADDSDDGVDDNDGN EVLMIVMMMMMLTVMMVLMIVMMIIVMDGADEDSDDNDGGNNKSGGDDMAATVWTCFLKF MYWKLNP >gi568815596f:130629152_130830192|GENSCAN_predicted_CDS_3|564_bp atggtggtgcagcaggagagcaggaacctaaccattgccatcctaaggccctgcattgtg cggagcaacgtggcaccagcttttcctggattccaggtcttggcaacctttgaaattcca attccatttgcaagagctttgaggaggccatatgctgatttcaccacaagcaacttcaca acccagtactggaatgccatcagccagcaggcccctgccattatctgtgacttctatctg tggctcattggaaggaaacccagtgatgatggagttgatgatagtgatggtgatgatgat aatgatggtggtgatggtgctgatgatagtgatgatggtgttgatgataatgatggtaat gaagtgctaatgatagtgatgatgatgatgatgttgacagtgatgatggtgctgatgata gtgatgatgataatagtgatggatggtgctgatgaagatagtgatgataatgatggtggt aataacaaaagtggtggtgatgatatggctgctacggtttggacgtgtttcttaaagttc atgtattggaaacttaatccctaa >gi568815596f:130629152_130830192|GENSCAN_predicted_peptide_4|347_aa MGDELAPCPVGTTAWPALIQLISKTPCMPQAASNTSLGLGDLRVPSSMLYWLFLPSSLLA AATLAVSPLLLVTILRNQRLRQEPHYLLPANILLSDLAYILLHMLISSSSLGGWELGRMA CGILTDAVFAACTSTILSFTAIVLHTYLAVIHPLRYLSFMSHGAAWKAVALIWLVACCFP TFLIWLSKWQDAQLEEQGASYILPPSMGTQPGCGLLVIVTYTSILCVLFLCTALIANCFW RIYAEAKTSGIWGQGYSRARGTLLIHSVLITLYVSTGVVFSLDMVLTRYHHIDSGTHTWL LAANSEVLMMLPRAMLTYLYLLRYRQLLGMVRGHLPSRRHQAIFTIS >gi568815596f:130629152_130830192|GENSCAN_predicted_CDS_4|1044_bp atgggggatgagctggcaccttgccctgtgggcactacagcttggccggccctgatccag ctcatcagcaagacaccctgcatgccccaagcagccagcaacacttccttgggcctgggg gacctcagggtgcccagctccatgctgtactggcttttccttccctcaagcctgctggct gcagccacactggctgtcagccccctgctgctggtgaccatcctgcggaaccaacggctg cgacaggagccccactacctgctcccggctaacatcctgctctcagacctggcctacatt ctcctccacatgctcatctcctccagcagcctgggtggctgggagctgggccgcatggcc tgtggcattctcactgatgctgtcttcgccgcctgcaccagcaccatcctgtccttcacc gccattgtgctgcacacctacctggcagtcatccatccactgcgctacctctccttcatg tcccatggggctgcctggaaggcagtggccctcatctggctggtggcctgctgcttcccc acattccttatttggctcagcaagtggcaggatgcccagctggaggagcaaggagcttca tacatcctaccaccaagcatgggcacccagccgggatgtggcctcctggtcattgttacc tacacctccattctgtgcgttctgttcctctgcacagctctcattgccaactgtttctgg aggatctatgcagaggccaagacttcaggcatctgggggcagggctattcccgggccagg ggcaccctgctgatccactcagtgctgatcacattgtacgtgagcacaggggtggtgttc tccctggacatggtgctgaccaggtaccaccacattgactctgggactcacacatggctc ctggcagctaacagtgaggtactcatgatgcttccccgtgccatgctcacatacctgtac ctgctccgctaccggcagctgttgggcatggtccggggccacctcccatccaggaggcac caggccatctttaccatttcctag >gi568815596f:130629152_130830192|GENSCAN_predicted_peptide_5|1113_aa MLPGKLEATERKDQPFTKDQQITDDSSWGTGSMELKRGKTFIKSSLQVSHEKPPDPAAVA AAREGTGPWSVLPGGQQRPHSEKGPQASPSAQEYDRCPNKGAQLDPKGGPAALCGATFKP VRKCKTHDSMSGAGRATAATGQLVGSASFPGSPGSRRMIDYRHFVPQMPFVPAVAKSIPR KRISLKRPKKCFRNLFHIRRNKTEDLASLAAEGKSLPSPGDPSDPGGRRSKAFLPPGEGP GLDGLCQDLLDSELLADASFGLCRALCEDVASLQSFDSLTGCGEVFADESSVPSLELNEG PESPTQAAQGLESKVPRGPLQGSVEQLASPAQNEASDFTRFWDSVNRSVRQQQRALLGPW LSGPQGTDRDQSRLDTAGLAELPLCPCRDPRSGSKASSIDTGTPKSEQPESVSTSDEGYY DSFSPGLEEDKKEAESPGTPAATFPRDSYSGDALYELFHDPSEGPLGPSPDDDLCVSESL SGPALGTPLSICSFRVGAEENLAPAPGPDLLSQGFLQSSWKGKECLLKLCDTELAITMGI VSWLRRGPTPRAPPTPGQPAAPPGSQGAPRAPTEKLGGREGLASDAGGATVCSAPSRQEL WAHPGTTGLLAGESKALGGATQGTGTLSRDASREEETRGHSEGLFSSMESAATSTTDTSG KNKAPVPSTWPCSQKEPGPPGVLGCFRGPWRPGHGGDTLDAEPMLAGCVARVAALKISSN EQPPAAWPPRQDMGSGLFGQRWARGPDMLEQKQSSSSPSMTTIHGLPYSASTQDQRCRDR VQDLSWLRVEPTGLGVQAWASVEDQPLQLSTEAVEQVAHGSQLDSEPRSAPAARWSSQGH HPESLGLTLNSQQEGGVSASAPECRCSLLAREGLLCGQPEVGASGPAMAEPHLKTSNGAV DLTPQNVTTHVQHPDIQNHEGRRWLVSVGGTGVRPGPVSGQQTPTEHSITWGEETPVEYL KNPENYILGTKTVFTGINKKSEIRFPRGWEDQARAQDWLRTTGPAVGSPGTRHGSLLPAK ATQGAFIPQAEAAERLGSGRQPYSGNHSHSIGKLNAMTATDLRVGGAKRGTQPPKTSAYA ATVLRMDRELHLKLNSNYCKKSKALVETECDGA >gi568815596f:130629152_130830192|GENSCAN_predicted_CDS_5|3342_bp atgttaccagggaagttggaagcaacagaaagaaaagatcaaccatttacaaaagatcag caaataaccgatgacagcagctggggcactggcagcatggagctgaagagaggaaagacc ttcatcaagtccagcctgcaggtttcccacgagaaacccccagacccagcagccgtggct gcagccagggaggggacaggcccctggtcagtccttccaggagggcaacagaggccccac agtgagaagggcccccaagccagccccagtgcccaagaatacgacagatgccccaacaaa ggggcgcagctggaccccaaagggggacccgcagccctctgtggagccaccttcaaaccg gtgcgaaagtgcaagactcacgacagcatgtctggggcaggcagggccacggctgccaca gggcagctggtgggcagtgcaagcttcccgggctccccgggcagccggcgcatgatcgac taccgccactttgtgccccagatgccctttgtgccagctgtggccaagagcatcccgagg aagaggatttccctgaagaggcccaagaagtgctttcggaacctattccacattcggaga aacaagactgaggacttggcctcgctggcggccgaggggaaaagcctgccctccccaggg gacccgtcagaccctggggggcggcgaagcaaagccttcctccccccgggtgaggggccg gggctggacggcctgtgccaggacctgttggacagcgagctcctggccgatgcatccttt ggtctctgcagggccctgtgtgaggacgtggcctcactccagagcttcgactcgctcacg ggttgtggggaggtgttcgcagatgagagctcggtgccatctctggagctgaacgagggc ccggagagcccaacccaggctgctcagggcctggagagcaaggttcccaggggccctctc cagggcagtgtggagcagctggcctcgcccgcccagaatgaagcctctgacttcaccagg ttctgggacagtgtgaatcgctcagtgcgtcagcagcagcgtgccctcctaggcccgtgg ctttcaggcccccaggggacagacagggaccaatcccggctggacacagctgggctcgct gagctgcccctctgcccctgcagggaccctcgcagcggctccaaagccagctccatcgac acaggcacccccaagagcgagcagcccgaatccgtgtccacaagtgacgagggctactat gattccttctcgccaggacttgaggaggacaagaaggaagctgagagcccaggcactcct gccgccaccttcccacgggacagctacagtggggacgccctctacgagctcttccacgac cccagcgagggtcctcttggccccagcccagatgatgacctgtgcgtgtctgagagtctg tcagggccggccctggggacgccactgtccatatgcagcttccgagtgggggccgaggag aacttggccccagcaccaggccctgacctgctcagccagggcttcctacagagctcctgg aagggcaaggagtgcctgctgaagctgtgtgacactgagctcgccatcaccatgggcatc gtcagctggctgcgccgaggccccacgccccgtgccccacccacccctgggcagcctgca gctccacctggttcccagggagcccctagggcacccacagagaagctggggggcagggag ggcctggcctcagatgcagggggggcgacagtttgctcagcacccagcaggcaggagctg tgggcacacccgggcaccacaggcctgctcgccggagagagcaaggccctcggaggggcc acacaagggactggcacactgtccagggatgcctctcgagaggaagagacacgaggtcac tctgaaggcttgttctcctctatggagtctgcagccacttcgacaacagatacttccggt aaaaataaggccccagttccttctacctggccctgctcccagaaggagcctgggccacca ggggtcctggggtgtttccgaggcccctggaggccaggtcacggaggtgacactctggat gcagagcccatgctggcaggctgtgtggcccgtgtggcagccctgaagatcagctcaaac gaacagcccccggccgcatggcctccaaggcaagacatgggcagtgggctctttgggcag cgctgggccaggggccctgacatgctggagcagaaacagtccagcagctcccccagcatg accaccatccatggcctaccctactcagccagcacacaggaccagaggtgtcgagatcgt gtccaggacctgagctggctcagggtggagcccaccgggctaggtgtccaggcctgggcc tctgtggaggaccagcccttgcagctcagcacagaggctgtggagcaggtggcacacggc agccagctggactctgagccccgctcagcccctgctgcccggtggagttcccagggccac catccagaaagcctgggcctcactttgaacagccagcaggaagggggggtctctgcaagt gccccagaatgccgctgcagcctcctggcccgtgagggcctcctctgtggccagccagaa gtgggggcctctgggccagccatggctgagccccatctgaaaacttcgaatggggctgtg gacctcactccccaaaatgtcacaacacatgtgcagcaccccgacattcagaaccatgaa gggagaaggtggctggtgtctgtaggagggacaggtgtccgtcctgggccagtgtcaggc cagcagacgccaacagaacacagcatcacctggggagaggagacaccggtggagtatttg aagaatccagagaattacatccttggaacaaaaacagtcttcactggcattaacaagaag agcgagatccgctttcctagaggctgggaggaccaggcccgggcacaggactggcttcgc accactggccccgcagtcgggtccccggggacgcgccacgggtcactgctgcctgctaaa gcgacacaaggcgccttcattcctcaggccgaggcggccgaacgccttggctctggccga caaccttacagtgggaaccacagtcactcaattggaaaattaaatgcaatgacagcaact gatctcagggtgggaggagccaagaggggaactcaaccacctaagacaagcgcatatgcg gccactgtcttacgcatggaccgagagctgcatttaaaactcaactcaaactattgcaag aaaagtaaggctttggtggaaactgaatgtgatggtgcctga >gi568815596f:130629152_130830192|GENSCAN_predicted_peptide_6|125_aa MLGRRGLQGQGQPVVVTVQQCLTGQTPKNTVQPGGHIFISLDRQVGQAHPELRPEGAGGR RKLPWGGAEAAPQSNTAQQAQPISTHQALGIGSPARWTPPNSEHCLGGSRVATAGDKGGE LLAAQ >gi568815596f:130629152_130830192|GENSCAN_predicted_CDS_6|378_bp atgctgggcaggcggggactgcagggccagggccagcctgtggttgtgactgtccagcag tgcctcacggggcagacacccaagaacaccgtccagccgggaggacacatcttcatctcc ttggataggcaggtgggccaggcccaccctgagctgaggcctgaaggtgccggaggaaga agaaagctgccctggggtggagcggaggctgctcctcagagcaacacagcccagcaggct cagcccatctccacacaccaggccctgggcataggctccccagcccgctggacaccaccc aactcagagcactgcctcgggggctcccgtgtggcaacagcgggggacaaaggtggggag ctcctggcggcgcagtga