GENSCAN 1.0 Date run: 6-Nov-116 Time: 08:10:40 Sequence gi568815597r:75818955_76032291 : 213337 bp : 37.36% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 177 172 6 1.05 1.03 Term - 1819 1363 457 0 1 62 47 208 0.463 7.71 1.02 Intr - 7336 7105 232 2 1 45 72 133 0.002 3.31 1.01 Init - 18867 18837 31 0 1 94 116 17 0.038 5.05 1.00 Prom - 22671 22632 40 -2.35 2.03 PlyA - 22997 22992 6 1.05 2.02 Term - 23409 23297 113 1 2 127 41 37 0.839 0.64 2.01 Init - 24316 24205 112 1 1 78 82 91 0.766 7.95 2.00 Prom - 36061 36022 40 -5.15 3.05 PlyA - 36453 36448 6 1.05 3.04 Term - 39905 39288 618 2 0 -40 42 280 0.204 4.05 3.03 Intr - 42370 42212 159 1 0 68 72 136 0.055 9.26 3.02 Intr - 51131 51003 129 2 0 86 57 110 0.098 7.67 3.01 Init - 52270 52268 3 1 0 75 101 0 0.296 0.05 3.00 Prom - 53468 53429 40 -6.85 4.00 Prom + 54826 54865 40 -2.55 4.01 Init + 56310 56392 83 2 2 75 53 117 0.748 7.39 4.02 Intr + 59195 59364 170 2 2 54 91 81 0.923 3.67 4.03 Intr + 64667 64867 201 0 0 55 111 89 0.808 6.24 4.04 Intr + 70297 70415 119 2 2 97 65 75 0.806 5.36 4.05 Term + 93742 93933 192 0 0 53 37 160 0.504 3.84 4.06 PlyA + 94259 94264 6 1.05 5.12 PlyA - 94354 94349 6 1.05 5.11 Term - 102347 102309 39 2 0 99 47 6 0.266 -6.19 5.10 Intr - 103405 103126 280 0 1 75 116 108 0.042 8.86 5.09 Intr - 113132 112937 196 0 1 16 75 80 0.008 -2.95 5.08 Intr - 124146 123852 295 0 1 12 86 178 0.015 5.66 5.07 Intr - 130434 130204 231 0 0 74 93 149 0.065 11.05 5.06 Intr - 144996 144872 125 1 2 88 38 53 0.022 -0.32 5.05 Intr - 155102 155024 79 2 1 72 68 84 0.063 3.01 5.04 Intr - 175181 175111 71 0 2 125 61 45 0.254 3.48 5.03 Intr - 175723 175654 70 1 1 64 100 16 0.394 -1.76 5.02 Intr - 190661 190566 96 2 0 79 110 58 0.127 6.39 5.01 Init - 202288 202229 60 1 0 83 115 56 0.822 9.10 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 103224 103126 99 0 0 54 116 99 0.889 9.61 S.002 Term - 130434 130162 273 0 0 74 43 207 0.921 9.29 S.003 Init - 135448 135386 63 1 0 50 91 53 0.847 3.00 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:75818955_76032291|GENSCAN_predicted_peptide_1|239_aa MADLMTDWKEEIQTNIREYYKHLYTNKLENLEEMDKFLDTYTLPRLNQEEVESVTRPITS SEIEGVINSPPTKKSSGPDGFTAEFYQRSWFFEKINKIDRPLARLIKKKREKNQIDAIKN DKGDITTDPTEIQTTIREYYKHLYANKLENLEEMDKFLETYTLPRLNQEEVESLNRPITG SEIEAIINSLPTKKCPGPDGFTAKFYQRYKEELVPFLLKLFQSTEKEGILPNSFYEDIF >gi568815597r:75818955_76032291|GENSCAN_predicted_CDS_1|720_bp atggcagatttgatgactgattggaaggaagaaatacaaactaacatcagagaatactat aaacacctctacacaaataaactagaaaatctagaagaaatggataaattcctggacaca tacaccctcccaagactaaaccaggaagaagttgaatctgtgactagaccaataacaagt tctgaaattgagggagtaattaatagcccaccaaccaaaaaaagctcaggaccagatgga ttcacagccgaattctaccagaggagctggttttttgaaaagataaacaaaattgataga ccactagcaagactaataaagaagaaaagagagaagaatcaaatagatgcaataaaaaat gataaaggggatatcaccactgatcccacagaaatacaaactaccatcagagaatactat aaacacctctatgcaaataaactagaaaatctagaagaaatggataaattcctggagaca tacaccctcccaagactaaaccaggaagaagttgaatctctgaatagaccaataacaggc tctgaaattgaggcaataattaatagcttaccaaccaaaaaatgtccaggaccagatgga ttcacagccaaattctaccagagatacaaggaggagctggtaccattccttctgaaacta ttccaatcaacagaaaaagagggaatcctccctaactcattttatgaggatatattttga >gi568815597r:75818955_76032291|GENSCAN_predicted_peptide_2|74_aa MTNARRPVVAPNVWLRCYLLDTKQKGQGKECESSPMIDPEDNQLRARHPFRSSEKHFTTC YLSEVCYLRDSSTQ >gi568815597r:75818955_76032291|GENSCAN_predicted_CDS_2|225_bp atgaccaatgcacggagaccggtagtggccccgaatgtctggctgcgctgttatttattg gatacaaagcaaaaggggcagggtaaagagtgtgagtcatctccaatgatagatccagaa gataatcaactaagagccaggcatcctttcaggtccagtgagaaacattttacaacctgc tatctctctgaagtctgctatctgagagattcctctacacaataa >gi568815597r:75818955_76032291|GENSCAN_predicted_peptide_3|302_aa MVTLMQEVNSHSLEQLHPCAFAGYSPLNPAAAFKAGVECLWLFQEHNSSPAREQNWMQNE FDDLTEVGFRRSVITNFSKLKEQVLTHQKEAKNLEKKVQRGAGTIPSEIFRSIEKEGILF NSFYEASIILIPNPGRDTRKKENFRLTSLMNINVRIRNKILANRIQQHIKKLIHHNQVGF IPGMRVWFNIHKINKHNPSQKQNQRQKPHDYLNRCRKGLQKIQQHFMLKTLNKVGINGMY LNIIRAINDKSTANIILNGQKLEAFPLKTGTRQGCPFSPLLFNIVLEVLARAIRQEKEIK GI >gi568815597r:75818955_76032291|GENSCAN_predicted_CDS_3|909_bp atggtcacactgatgcaagaagtgaactcccatagccttgagcagctccacccttgtgcc tttgcagggtacagccccctgaaccctgcagctgcttttaaggctggtgttgagtgtctg tggcttttccaggaacacaactcctcaccagcaagggaacaaaactggatgcagaatgag tttgacgacttgacagaagtaggcttcagaaggtcggtaataacaaacttctccaagcta aaggagcaggttctaacccatcaaaaggaagctaaaaaccttgaaaaaaaggtacaaaga ggagctgggaccattccttctgaaatattccgatcaatagaaaaagagggaattctcttt aactcattttatgaggccagcatcatcctgataccaaatcctggcagagacacaagaaaa aaagagaattttaggctaacatccctgatgaacatcaatgtgagaatccgcaataaaata ctggcaaaccgaatccagcagcacatcaaaaagcttatccaccacaatcaagtcggcttc atccctgggatgcgagtctggttcaacatacacaaaatcaataaacataatccatcacaa aaacagaaccaacgacaaaaaccacatgattatctgaatagatgcagaaaaggccttcag aaaattcaacagcatttcatgctaaaaactctcaataaagtgggtatcaatggaatgtat ctcaacataataagagctattaatgacaaatctacagccaatatcatactgaatgggcaa aaactggaagcattccctttgaaaactggcacaagacaaggatgccctttctcaccactc ctattcaacatagtattggaagttctggccagggcaatcaggcaagagaaagaaataaag ggtatttga >gi568815597r:75818955_76032291|GENSCAN_predicted_peptide_4|254_aa MVLIQGDHDSNDDEDDVVTLQKKRLQIWFGIILEKIKTVINDDARYMKGCLNMRTQKCYA VRSNINEFLDIARRTYTEIVDDIAVRPEFTDTLAIKQGWHPILEKISAEKPIANNTYVTE GSNFLIITGPNMSGKSTYLKQIALCQIMAQIGSYVPAEYSSFRIAKQIFTRISTDDDIET NSSTFMKEMKEQNQRSTPEMERQRAVYHLATRLVQTARNSQLDPDSLRIYLSNLKKKYKE DFPRTEQVPEKTEE >gi568815597r:75818955_76032291|GENSCAN_predicted_CDS_4|765_bp atggttctgattcaaggtgatcatgacagtaatgacgatgaagatgatgttgtaacactg cagaaaaaacgcctacagatatggtttggaatcatacttgaaaagattaaaacagtaatt aatgatgatgcaagatacatgaaaggatgcctaaacatgaggactcagaagtgctatgca gtgaggtctaacataaatgaatttcttgacatagcaagaagaacatacacagagattgta gatgacatagcagttcgaccagaatttactgatactttagcaatcaaacagggatggcat cctattcttgaaaaaatatctgcggaaaaacctattgccaacaatacctatgttacagaa gggagtaattttttgatcataactggaccaaacatgagtggaaaatccacatatttaaaa cagattgctctttgtcagattatggcccagattggatcatatgttccagcagaatattct tcctttagaattgctaaacagatttttacaagaattagtactgatgatgatatcgaaaca aattcatcaacatttatgaaagaaatgaaagagcaaaaccaaaggagtacccctgagatg gaaagacagagagctgtgtaccatctagccactaggcttgttcaaactgctcgaaactct caattggatccagacagtttacgaatatatttaagtaacctcaagaagaagtacaaagaa gattttcccaggactgaacaagttccagaaaagactgaagaataa >gi568815597r:75818955_76032291|GENSCAN_predicted_peptide_5|513_aa MNWKDKREVADGVNLWSYVMLYTRYQNMEMNKTQPKAYTVNQNEISLRREVQMWLWGKSS RDPCLHVPKTEISETKMQSELDRINLSSILGKQYQTQISQSPKTHIEMAASLKAHFMAKD DNREESNHMSNTACWYGCEDFTSVSTFAHQGVGSWDAYEPAASLLLSKGYTRQQFCHEST SNKGDRVIYKLTHPPYCCVRFPLAGKRPHILYLSRLASNLELFKRGKGRGEQRKEEVTCG MLKKLISNFSKVSGYKINVQKSQAFLYTNNRQTESQIMSELQFTIASKRIKYLGIQLTKD VKDLFKENYKPLLNEIKENTMEEHSMLMDRKNQYHENGHTAQDYIAFVEKSGYRFEVSFN LDFTEICVNTILYWVFARKGNPDFVELLLKKTKDYVQDRSCNLALIWRTFTPVYCPSPLS GITPLFYVAQTRQSNIFKILLQYGILEREKNPINIVLTIVLYPSRVRVMVDRELADIHED AKTCLVLCSRVLSVISVKEIKCLSSEYECLSQL >gi568815597r:75818955_76032291|GENSCAN_predicted_CDS_5|1542_bp atgaactggaaagataagcgggaagtggctgatggggttaacttatggagctatgtcatg ctctatactaggtaccagaatatggagatgaataaaacacagcccaaagcctatacagtc aaccagaatgaaatcagtcttagaagagaggtacagatgtggctctgggggaaatcttct agagatccctgcctccatgtccccaaaacagaaatttcagagaccaaaatgcagtctgaa ttagacagaattaacttgtccagcatccttggaaagcagtatcagacacagatttcacaa agcccgaagacccatattgagatggcagcatcactcaaagcacacttcatggccaaggat gataatagagaagaaagcaaccacatgtcaaacactgcttgttggtatggctgtgaggac ttcacttcagtgagcacctttgcacaccagggagtgggaagctgggatgcatatgaacca gcagcctctttgttattaagcaaagggtatactcgccagcagttttgccatgagagtaca tcaaacaaaggagacagggtcatttataaactgacgcatccaccctactgctgtgtccgg tttccattggctggaaaaagacctcacattctgtatttgtcccgactggctagcaacttg gagctttttaaaagaggcaaaggcagaggagaacaaaggaaggaggaagtaacttgtgga atgctgaaaaagctgataagcaacttcagcaaggtctcaggatacaaaatcaatgtgcaa aaatcacaagcattcttatacaccaataacagacaaacagagagccaaatcatgagtgaa ctccaattcacaattgcttcaaagagaataaaatacctaggaatccaacttacaaaggat gtgaaggacctcttcaaggagaactacaaaccactgctcaatgaaataaaagaaaacaca atggaagaacattccatgctcatggataggaagaatcagtatcatgaaaatggccacact gcccaagattacattgcatttgtggaaaaatcaggataccgttttgaagtaagttttaac ctcgacttcactgaaatatgtgtgaatacaattctgtactgggtttttgccagaaaaggt aatcctgactttgtggaattgcttctcaagaagacaaaagactatgttcaagacagaagt tgtaacctggcactgatatggagaactttcacaccagtatactgtccaagcccattaagt ggcatcacacctctcttttatgtagctcagacaagacagtctaatatcttcaaaatacta ctgcaatatggaatcttagaaagagaaaaaaaccctatcaacattgtcttaacaatagta ctctacccttcgagagtaagagtaatggttgatcgtgaattggctgacatccatgaagat gccaaaacatgtttggtactatgttccagagtgctttctgtcatttcagtcaaggaaata aagtgcctcagcagtgaatatgaatgtctgagccaactataa