GENSCAN 1.0 Date run: 5-Nov-116 Time: 13:41:28 Sequence gi568815582f:23086272_23315466 : 229195 bp : 44.07% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 7127 7166 40 -1.86 1.01 Init + 11781 12131 351 2 0 58 53 141 0.386 4.76 1.02 Term + 12187 12894 708 0 0 75 47 179 0.988 6.01 1.03 PlyA + 13420 13425 6 1.05 2.05 PlyA - 14159 14154 6 1.05 2.04 Term - 15393 15284 110 1 2 86 43 50 0.454 -1.03 2.03 Intr - 16192 16048 145 1 1 91 94 91 0.990 9.86 2.02 Intr - 21912 21775 138 0 0 40 116 113 0.917 9.96 2.01 Init - 62999 62367 633 2 0 99 105 864 0.966 82.95 2.00 Prom - 75875 75836 40 -4.56 3.00 Prom + 95138 95177 40 -3.76 3.01 Init + 95986 96187 202 0 1 48 98 151 0.088 9.57 3.02 Intr + 99957 100317 361 1 1 123 71 517 0.111 47.78 3.03 Intr + 103100 103400 301 2 1 131 99 296 0.970 31.74 3.04 Intr + 106081 106271 191 0 2 89 71 290 0.999 25.78 3.05 Intr + 107900 108003 104 2 2 126 91 75 0.999 11.42 3.06 Intr + 110993 111156 164 0 2 78 99 157 0.983 15.49 3.07 Intr + 123479 123577 99 1 0 73 92 135 0.999 12.71 3.08 Intr + 125763 125880 118 2 1 86 64 90 0.574 6.44 3.09 Intr + 126407 126503 97 0 1 112 64 121 0.816 11.17 3.10 Intr + 126566 126623 58 2 1 110 107 -35 0.712 -0.51 3.11 Intr + 126831 126892 62 2 2 101 108 9 0.652 1.73 3.12 Intr + 128441 128516 76 2 1 74 65 60 0.907 1.82 3.13 Term + 128818 129198 381 0 0 79 54 343 0.948 24.84 3.14 PlyA + 129696 129701 6 1.05 4.00 Prom + 139398 139437 40 -2.16 4.01 Init + 140175 140312 138 2 0 65 84 95 0.767 6.94 4.02 Term + 140559 140693 135 2 0 72 44 87 0.758 0.72 4.03 PlyA + 142758 142763 6 1.05 5.00 Prom + 158175 158214 40 -2.86 5.01 Sngl + 158875 159165 291 0 0 76 42 141 0.752 3.95 5.02 PlyA + 160997 161002 6 1.05 6.00 Prom + 168063 168102 40 -2.86 6.01 Init + 168499 168534 36 0 0 70 91 14 0.245 0.11 6.02 Term + 177362 177427 66 1 0 102 38 94 0.765 3.74 6.03 PlyA + 178067 178072 6 1.05 7.05 PlyA - 178447 178442 6 1.05 7.04 Term - 186579 186365 215 1 2 45 40 134 0.289 1.69 7.03 Intr - 200333 200129 205 2 1 67 100 38 0.187 1.67 7.02 Intr - 201363 201329 35 1 2 113 96 -23 0.021 -1.06 7.01 Init - 226544 226502 43 2 1 105 78 18 0.228 3.09 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 71076 71120 45 2 0 69 64 59 0.857 2.28 S.002 Term + 74339 74470 132 1 0 119 42 56 0.872 2.19 S.003 Init + 100001 100317 317 1 2 91 71 511 0.886 46.51 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815582f:23086272_23315466|GENSCAN_predicted_peptide_1|352_aa MIVYLENPIVSAQNLLKLISNFSKVSGYKINVQKSQAFLYTNNRQTESQIMSELPFTIAS KRIKYLGIQLTRDAKDVFKENYKPLLNEIKEDTNKWKNIPCSWIGRINIVKMAILPRNWK KLLQSSYGTKKARITKTILSKKNKAGGITLPNFKLYYKATVTKTAWYWYQHRDIDQWNRT EASEVTPHIYNHLIFDKPDKNKKWGKDSLFNKWCWENWLAICRKLKLDLFLTPYTKINPR WIKDLNLRPRTIKTLEENLGSIFRDIGMSKGFMTKTPKAKATKAKTDKWDLIKLKSFCTA KETTIRVNRQPTEWEKIFAIYPSDKGLISRIYKELKRIYKKKTTPSKSGQRV >gi568815582f:23086272_23315466|GENSCAN_predicted_CDS_1|1059_bp atgattgtatatctagaaaaccccatcgtctcagcccaaaatctccttaaactgataagc aacttcagcaaagtctcaggatacaaaatcaatgtgcaaaaatcacaagcattcctatac accaataacagacaaacagagagccaaatcatgagtgaactcccattcacaattgcttca aagagaataaaatacctaggaatccaacttacaagggatgcaaaggacgtcttcaaggag aactacaaaccactgctcaatgaaataaaagaggacacaaacaaatggaagaacattcca tgctcatggataggaagaatcaatatcgtgaaaatggccatactgcccaggaattggaaa aaactacttcaaagttcatatggaaccaaaaaagcccgcatcaccaagacaatcctaagc aaaaagaacaaagctggaggcatcacgctacctaacttcaaactatactacaaggctaca gtaaccaaaacagcatggtactggtaccaacacagagatatagaccaatggaacagaaca gaggcctcagaagtaacaccacacatctacaaccatctgatctttgacaaacctgacaaa aacaagaaatggggaaaggattccctgtttaataaatggtgctgggaaaactggctagcc atatgtagaaagctgaaactggatctcttccttacaccttatacaaaaattaatccaagg tggattaaagacttaaatcttagacctagaaccataaaaaccctagaagaaaacctaggc agtatctttagggacataggcatgagcaagggcttcatgactaaaacaccaaaagcaaag gcaacaaaagccaaaacagacaaatgggatctaattaaactaaagagcttctgcacggca aaagaaactaccatcagagtgaacaggcaacctacagaatgggagaaaatttttgcaatc tatccatctgacaaagggctaatatccagaatctataaagaactcaaacgaatttacaag aaaaaaacaaccccatcaaaaagtgggcaaagggtatga >gi568815582f:23086272_23315466|GENSCAN_predicted_peptide_2|341_aa MSKVTAPGSGPPAAASGKEKRSFSKRLFRSGRAGGGGAGGPGASGPAAPSSPSSPSSARS VGSFMSRVLKTLSTLSHLSSEGAAPDRGGLRSCFPPGPAAAPTPPPCPPPPASPAPPACA AEPVPGVAGLRNHGNTCFMNATLQCLSNTELFAEYLALGQYRAGRPEPSPDPEQPAGRGA QGQGEVTEQLAHLVRALWTLEYTPQHSRDFKTIVSKNALQYRGNSQHDAQEFLLWLLDRV HEDLNHSVKQSGQPPLKIVLTEMYYDGFHRSFCDTDDLETVHESDCIFAFETPEIFRPEG ILSQRVVINASVLPGNWLEMQILGPYSTPSKGETLVNKPFR >gi568815582f:23086272_23315466|GENSCAN_predicted_CDS_2|1026_bp atgtccaaggtaacggcgcctgggtccgggccgccggcggcggcgagcgggaaggagaag cgctccttcagcaagcggctgtttcggagcggccgcgctggcggcggcggcgcggggggc cccggggcgtccgggccggccgcgccttcctcgccctcctcgccctcctctgcacgctcg gtgggcagcttcatgagccgcgttctcaagacgctgtccacgctctcgcaccttagctct gagggcgccgccccagaccgcggcggcctccgcagctgcttcccgcccgggccggccgcc gcgcccacgccgccgccgtgcccgccgccgcccgcctctcccgcgccgcccgcttgcgcc gccgagccggtgcccggcgtggcggggctccgcaaccacggcaacacgtgcttcatgaac gccacgctgcagtgcctcagcaacaccgagctcttcgccgagtacctggcgctgggccag taccgggcggggcggcccgagccctcgcctgacccggagcagcctgcgggccgcggcgcg cagggccagggcgaggtcactgagcagctggcgcacctggtgcgggccctctggaccctg gagtacaccccgcagcacagccgcgacttcaagactattgtgtcaaagaatgcactgcag taccggggaaattcccaacatgatgcccaggagtttctgctgtggcttttggaccgagtt catgaagacctcaaccattcagtgaagcagagtggccagcctcctctgaagattgtgtta acagaaatgtactatgatgggttccatcgttccttttgtgatacagacgacctggaaaca gtccatgaaagcgactgcatttttgcctttgagactcccgaaatatttaggcctgaagga attctcagtcaaagagtggttattaatgcatcagtattacctgggaactggttagaaatg caaattcttggtccctactccacaccgagtaaaggagaaactctggtaaacaagcccttt aggtag >gi568815582f:23086272_23315466|GENSCAN_predicted_peptide_3|737_aa MSGLAEQPLLPTAGRWWALKLQVWRESEERGCVGLLLVLGQAPAIWTGVEGSRGAVIGGS RRFEAPWARPSSESRPQSPILAMAPGEKIKAKIKKNLPVTGPQAPTIKELMRWYCLNTNT HGCRRIVVSRGRLRRLLWIGFTLTAVALILWQCALLVFSFYTVSVSIKVHFRKLDFPAVT ICNINPYKYSTVRHLLADLEQETREALKSLYGFPESRKRREAESWNSVSEGKQPRFSHRI PLLIFDQDEKGKARDFFTGRKRKVGGSIIHKASNVMHIESKQVVGFQLCSNDTSDCATYT FSSGINAIQEWYKLHYMNIMAQVPLEKKINMSYSAEELLVTCFFDGVSCDARNFTLFHHP MHGNCYTFNNRENETILSTSMGGSEYGLQVILYINEEEYNPFLVSSTGAKVIIHRQDEYP FVEDVGTEIETAMVTSIGMHLTESFKLSEPYSQCTEDGSDVPIRNIYNAAYSLQICLHSC FQTKMVEKCGCAQYSQPLPPAANYCNYQQHPNWMYCYYQLHRAFVQEELGCQSVCKEACR YVDPKGFKEWTLTTSLAQWPSVVSEKWLLPVLTWDQGRQVNKKLNKTDLAKLLIFYKDLN QRSIMESPANSIEMLLSNFGGQLGLWMSCSVVCVIEIIEVFFIDFFSIIARRQWQKAKEW WAWKQAPPCPEAPRSPQGQDNPALDIDDDLPTFNSALHLPPALGTQVPGTPPPKYNTLRL ERAFSNQLTDTQMLDEL >gi568815582f:23086272_23315466|GENSCAN_predicted_CDS_3|2214_bp atgagtggcctggctgaacagcccctgctgcctacagccggacgctggtgggcactgaag ctgcaggtctggagggagtccgaggagcgaggctgcgtgggattactgctggtgctaggg caggcacctgccatctggacgggagtggaaggaagtcgcggggcagtaatagggggcagc aggcgctttgaggcgccttgggcacgcccgtcctcagagtcccgtcctcaaagtcccatc ctcgccatggcacccggagagaagatcaaagccaaaatcaagaagaatctgcccgtgacg ggccctcaggcgccgaccattaaagagctgatgcggtggtactgcctcaacaccaacacc catggctgtcgccgcatcgtggtgtcccgcggccgtctgcgccgcctcctctggatcggg ttcacactgactgccgtggccctcatcctctggcagtgcgccctcctcgtcttctccttc tatactgtctcagtttccatcaaagtccacttccggaagctggattttcctgcagtcacc atctgcaacatcaacccctacaagtacagcaccgttcgccaccttctagctgacttggaa caggagaccagagaggccctgaagtccctgtatggctttccagagtcccggaagcgccga gaggcggagtcctggaactccgtctcagagggaaagcagcctagattctcccaccggatt ccgctgctgatctttgatcaggatgagaagggcaaggccagggacttcttcacagggagg aagcggaaagtcggcggtagcatcattcacaaggcttcaaatgtcatgcacatcgagtcc aagcaagtggtgggattccaactgtgctcaaatgacacctccgactgtgccacctacacc ttcagctcgggaatcaatgccattcaggagtggtataagctacactacatgaacatcatg gcacaggtgcctctggagaagaaaatcaacatgagctattctgctgaggagctgctggtg acctgcttctttgatggagtgtcctgtgatgccaggaatttcacgcttttccaccacccg atgcatgggaattgctatactttcaacaacagagaaaatgagaccattctcagcacctcc atggggggcagcgaatatgggctgcaagtcattttgtacataaacgaagaggaatacaac ccattcctcgtgtcctccactggagctaaggtgatcatccatcggcaggatgagtatccc ttcgtcgaagatgtgggaacagagattgagacagcaatggtcacctctataggaatgcac ctgacagagtccttcaagctgagtgagccctacagtcagtgcacggaggacgggagtgac gtgccaatcaggaacatctacaacgctgcctactcgctccagatctgccttcattcatgc ttccagacaaagatggtggagaaatgtgggtgtgcccagtacagccagcctctacctcct gcagccaactactgcaactaccagcagcaccccaactggatgtattgttactaccaactg catcgagcctttgtccaggaagagctgggctgccagtctgtgtgcaaggaagcctgcagg tatgtggaccccaagggctttaaagagtggacactaaccacaagcctggcacaatggcca tctgtggtttcggagaagtggttgctgcctgttctcacttgggaccaaggccggcaagta aacaaaaagctcaacaagacagacttggccaaactcttgatattctacaaagacctgaac cagagatccatcatggagagcccagccaacagtattgagatgcttctgtccaacttcggt ggccagctgggcctgtggatgagctgctctgttgtctgcgtcatcgagatcatcgaggtc ttcttcattgacttcttctctatcattgcccgccgccagtggcagaaagccaaggagtgg tgggcctggaaacaggctcccccatgtccagaagctccccgtagcccacagggccaggac aatccagccctggatatagacgatgacctacccactttcaactctgctttgcacctgcct ccagccctaggaacccaagtgcccggcacaccgccccccaaatacaataccttgcgcttg gagagggccttttccaaccagctcacagatacccagatgctggatgagctctga >gi568815582f:23086272_23315466|GENSCAN_predicted_peptide_4|90_aa MDTAHEIRRDYDGHGPASMHQGLYTEYERRERKRTGSELGMGPESRPSSFEAPSTNIPLL SPLPTPPSIIQDRLHPPKTSAPAALFAPYI >gi568815582f:23086272_23315466|GENSCAN_predicted_CDS_4|273_bp atggatacagctcatgaaataaggagagattatgatggccatgggccagccagcatgcac cagggtctttacactgagtatgaaaggagggagagaaaaaggacaggatcagagcttggc atggggcctgagtcgaggccttcctcctttgaagcccccagcaccaacataccacttctt tcacccttgccaacaccacccagcatcatccaggaccgtttacatcccccgaagacttca gcccctgctgccctgtttgctccttacatttag >gi568815582f:23086272_23315466|GENSCAN_predicted_peptide_5|96_aa MDAKLIGYSRGLHLFISQGISHDPPSLGQIGFLVTGLMEYPETPQLANHFQKECGLRTAV WTHPVVPGWRPSSTLSLALRKCGSKRQGAAASRLGP >gi568815582f:23086272_23315466|GENSCAN_predicted_CDS_5|291_bp atggatgcaaagctcatcggatactccaggggtcttcatctcttcatctcccaaggaatc tcccatgatcctccatcccttgggcaaattgggttccttgtaactggcctgatggaatac ccagaaacgcctcagttggccaaccattttcagaaagaatgtgggctccggaccgcagtc tggactcacccagttgttccaggatggaggcccagctccactctctcccttgcccttagg aagtgtggcagcaaacgacaaggggcagctgccagccggctgggaccttag >gi568815582f:23086272_23315466|GENSCAN_predicted_peptide_6|33_aa MQGRRWLTQFLQPSQCEDDKDEDLCDDPLPLNE >gi568815582f:23086272_23315466|GENSCAN_predicted_CDS_6|102_bp atgcagggtagacgttggctgacacagttcctgcagccttctcaatgtgaagatgacaaa gatgaagacctttgtgatgatccacttccacttaatgaataa >gi568815582f:23086272_23315466|GENSCAN_predicted_peptide_7|165_aa MAPLSIIWCQGIGGGGGSVGHPLGLQKLNLWFDQILATRYRQEFFLGGVISMLLHHTRKL KMKGCLISEAKIDSWVQVVTTRSFHCEVVSSLRTKRKASGPKDMSTRKLLQHRLWQNAET VTAHQYRVGERILVHPLCTIIRTTECYAATCYATTKCYAATTLLD >gi568815582f:23086272_23315466|GENSCAN_predicted_CDS_7|498_bp atggcgccactctctatcatctggtgccaagggatcgggggaggaggtggatctgtaggc caccccctgggactacagaagctaaatctatggtttgatcagattttggctacacgttat cggcaagaattcttcctgggtggtgttataagtatgctcttgcatcacaccagaaagctc aaaatgaaaggctgtctcattagcgaagctaagattgattcatgggttcaggtggtgaca accagatccttccattgtgaagtcgtttcttctctcagaaccaaaagaaaagcatcagga cccaaagacatgagtaccaggaagttactgcagcatcgtttgtggcaaaatgcggaaaca gtgactgcccatcaatacagagtgggagaacgcatactggtgcatccattatgcaccatt attcgtaccacggaatgttatgctgccacgtgttatgctaccaccaaatgttatgctgcc actacactgttggattag