GENSCAN 1.0 Date run: 7-Nov-116 Time: 01:16:36 Sequence gi568815575r:19436420_19936283 : 499864 bp : 43.09% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 15624 16031 408 2 0 7 42 398 0.782 22.32 1.02 PlyA + 16296 16301 6 1.05 2.23 PlyA - 19394 19389 6 1.05 2.22 Term - 20600 20490 111 2 0 77 48 132 0.436 6.66 2.21 Intr - 23734 23566 169 0 1 71 80 219 0.996 19.35 2.20 Intr - 27987 27794 194 0 2 85 46 172 0.895 10.99 2.19 Intr - 30279 30247 33 0 0 76 89 38 0.663 1.12 2.18 Intr - 50086 50063 24 1 0 125 98 -24 0.266 0.52 2.17 Intr - 52548 52409 140 1 2 120 87 140 0.987 17.28 2.16 Intr - 59026 58979 48 2 0 129 87 -2 0.241 2.45 2.15 Intr - 72705 72568 138 1 0 88 20 86 0.014 2.24 2.14 Intr - 78894 78482 413 2 2 56 94 494 0.044 40.63 2.13 Intr - 90676 90554 123 0 0 36 81 63 0.079 0.10 2.12 Intr - 92878 92781 98 1 2 32 97 88 0.159 2.91 2.11 Intr - 94119 94032 88 2 1 55 70 85 0.075 3.37 2.10 Intr - 100595 100507 89 2 2 93 69 16 0.093 -1.03 2.09 Intr - 101361 101298 64 2 1 99 96 102 0.941 10.82 2.08 Intr - 105774 105506 269 0 2 141 81 235 0.517 24.43 2.07 Intr - 109631 109503 129 2 0 46 89 83 0.954 5.09 2.06 Intr - 113664 113555 110 1 2 84 84 31 0.480 2.30 2.05 Intr - 132769 132684 86 0 2 52 97 126 0.543 9.36 2.04 Intr - 140015 139991 25 0 1 92 66 -2 0.009 -4.92 2.03 Intr - 150400 150332 69 2 0 16 96 102 0.721 2.95 2.02 Intr - 150909 150850 60 1 0 82 89 47 0.780 2.91 2.01 Init - 152690 152636 55 2 1 96 86 52 0.955 7.26 2.00 Prom - 154563 154524 40 -0.36 3.15 PlyA - 155222 155217 6 1.05 3.14 Term - 165231 165113 119 1 2 80 43 81 0.797 1.50 3.13 Intr - 167026 166804 223 1 1 17 69 135 0.897 2.20 3.12 Intr - 171626 171519 108 2 0 148 101 139 0.999 21.68 3.11 Intr - 195539 195445 95 0 2 93 92 86 0.926 9.18 3.10 Intr - 209056 208981 76 1 1 105 70 59 0.022 4.89 3.09 Intr - 247609 247404 206 2 2 95 95 197 0.988 20.02 3.08 Intr - 251272 251198 75 2 0 81 79 48 0.832 2.69 3.07 Intr - 259322 259193 130 2 1 128 80 148 0.999 18.27 3.06 Intr - 270565 270462 104 2 2 84 99 124 0.949 12.99 3.05 Intr - 282350 282323 28 2 1 102 41 24 0.025 -3.21 3.04 Intr - 288271 288223 49 0 1 68 78 71 0.111 2.78 3.03 Intr - 304946 304877 70 0 1 65 78 75 0.534 2.44 3.02 Intr - 310022 309899 124 2 1 107 64 96 0.794 9.26 3.01 Init - 327043 327005 39 1 0 64 115 50 0.595 5.48 3.00 Prom - 343493 343454 40 -2.46 4.00 Prom + 356575 356614 40 -1.86 4.01 Init + 375328 375456 129 0 0 86 81 139 0.930 13.15 4.02 Term + 375702 375812 111 2 0 35 48 91 0.506 -1.64 4.03 PlyA + 379527 379532 6 1.05 5.05 PlyA - 379583 379578 6 1.05 5.04 Term - 383873 383632 242 0 2 33 32 119 0.066 -3.01 5.03 Intr - 399863 399732 132 0 0 120 68 196 0.796 21.42 5.02 Intr - 407119 406966 154 1 1 55 95 65 0.230 3.55 5.01 Init - 425661 425554 108 0 0 49 77 111 0.036 6.32 5.00 Prom - 436867 436828 40 -3.76 6.00 Prom + 440406 440445 40 -5.56 6.01 Init + 443027 443080 54 1 0 67 73 55 0.219 3.18 6.02 Intr + 448512 448576 65 2 2 76 110 20 0.364 0.52 6.03 Intr + 449911 450125 215 1 2 63 96 93 0.524 5.96 6.04 Intr + 450536 450740 205 0 1 73 45 84 0.310 0.86 6.05 Term + 456863 456896 34 2 1 116 44 23 0.178 -2.24 6.06 PlyA + 457882 457887 6 1.05 7.04 PlyA - 463680 463675 6 1.05 7.03 Term - 487817 487719 99 2 0 74 42 126 0.775 4.73 7.02 Intr - 493521 493366 156 0 0 106 96 37 0.924 6.51 7.01 Intr - 499479 499390 90 0 0 110 67 7 0.482 0.99 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 78842 78482 361 2 1 76 94 518 0.955 46.44 S.002 Term + 142453 142931 479 2 2 89 44 158 0.815 6.50 S.003 Intr - 198368 198305 64 2 1 107 45 45 0.804 0.39 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575r:19436420_19936283|GENSCAN_predicted_peptide_1|135_aa SRILSEKRREEKRREEKRREEKRREEKRREEKRREEKRREEKRREEKRREEKRREEKRRE EKRREEKRREEKRREEKRREEKRREEKRREEKRREEKRREEKRKRREEKRREKRKEKRKE KKRKEKKKEKEKEKN >gi568815575r:19436420_19936283|GENSCAN_predicted_CDS_1|408_bp agcaggattctgtcagagaagagaagagaagagaagagaagagaagagaagagaagagaa gagaagagaagagaagagaagagaagagaagagaagagaagagaagagaagagaagagaa gagaagagaagagaagagaagagaagagaagagaagagaagagaagagaagagaagagaa gagaagagaagagaagagaagagaagagaagagaagagaagagaagagaagagaagagaa gagaagagaagagaagagaagagaagagaagagaagagaagagaagagaagagaagagaa gagaagagaaagagaagagaagagaagagaagagagaaaagaaaagagaaaagaaaagaa aagaaaagaaaagaaaagaagaaagaaaaagaaaaagaaaaaaattaa >gi568815575r:19436420_19936283|GENSCAN_predicted_peptide_2|844_aa MAVSRFSQASKEELEYAEAPGSRHSTFCLYGFDYSRYLSSNVEASRNVGWLPSRYRAVVI SVLILVPGTWGDSPKIDLAGSSLSGILDKDLSDRSNDIDLEGFDSVVSSTEKLSHPTTSR PKATGRRPPSQSLTSSSLSSPDIFDSPSPEEDKEEHISLAHRGVDASKKTSKTVTISQVS DNKASLPPKPGTMAAGGGGPAPLSSAAPSPLSSSLGTAGHRANSPSLFGTEGKPKMEPAA SSQAAVEELRTQVRELRSIIETMKDQQKREIKQLLSELDEEKKIRLRLQGVRALSPVASD STLLLPNGRECVGVLIDRRAVFLEEVNLERGEGHELKEELSEEASGLKGAYGDVPDAMRG AECDRVPALKELGVKWAEVGRLPESEILESNPTICVLTSPLSNSDARISLKTTIMGGSTH SHGSQPLATRPGEGRAGGHMESGGGNAPAGALGAASESPQCPPPPGVEGAAGPAEPDGAA EGAAGGSGEGESGGGPRRALRAVYVRSESSQGGAAGGPEAGARQCLLRACEAEGAHLTSV PFGELDFGETAVLDAFYDAVFHGVLCGSKSWGCVTSTYSGLDELQENGRQALDVSKSKWM LLEKEIKTRSPRQRVPLTMTLDVAVVDMSDVSRQPSLFYHLGVRESFDMANNVILYHDTD ADTALSLKDMVTQKNTEVFDHNINQFQASSGNYYFIPYIVTPCADYFCCESDAQRRASEY MQPNWDNILGPLCMPLVDRFISLLKDIHVTSCVYYKETLLNDIRKAREKYQGEELAKELA RIKLRMDNTEVLTSDIIINLLLSYRDIQDYDAMVKLVETLEMLPTCDLADQHNIKFHYAF ALNR >gi568815575r:19436420_19936283|GENSCAN_predicted_CDS_2|2535_bp atggcagtgtcacgtttcagccaggcttcgaaggaagagctggagtatgctgaagcccct ggcagccgccattctactttctgtctctatggatttgactactccaggtacctcagctca aatgtagaggcaagtcgcaacgtgggctggctgcccagtcgatacagagctgtggtcatc tcagtgcttattctagtgcctggcacatggggtgacagtccaaagattgacttggccggc agttcgctatctggcatcctggacaaagatctctcggaccgcagcaatgacattgactta gaaggttttgactccgtggtatcatctactgagaaactcagtcatccgaccacaagcaga ccaaaagctacagggaggcggcctccgtcccagtccctcacatcttcatccctttcaagc cctgatatcttcgactccccaagtcccgaagaggataaggaggaacacatttcacttgcg cacagaggagtggacgcgtcaaagaaaacttccaagactgttaccatatcccaagtgtct gacaacaaagcatccctgccgcccaagccggggaccatggcagcaggtggcggtgggcca gcccctctgtcctcagcggcgccctcccccctgtcatcctctttgggaacagctggacac agagccaactccccgtctctgttcggcacggaaggaaaaccaaagatggagcctgcggcc agcagccaggcggccgtggaggagctaaggacacaggtccgcgagctgaggagcatcatc gagaccatgaaggaccagcagaaacgagagattaaacagttattgtctgagttggatgaa gagaagaaaatccggcttcggttgcagggcgttagagctctgtcccctgtagcctctgat tcaacactgctgttaccaaatggcagagaatgcgtgggagttctgattgaccgaagggca gtcttcttggaggaagtgaatctggaaagaggagaaggacatgaattgaaagaggaatta tcagaggaagcttcaggcctgaagggtgcttatggtgatgtgccagatgccatgagaggc gctgaatgcgacagagtccctgccctgaaggagctcggtgtcaaatgggcagaagtaggc agacttcctgaatcagaaattctggaaagcaacccaacaatctgtgttttaacaagtcct ctcagtaattctgatgcacgcataagtttgaagaccaccatcatggggggatcaactcat tcccacggatcgcagccgctggccactcgcccaggggaagggcgcgcaggcgggcacatg gagagcggcggtgggaatgctccggccggggccctcggggcggcgagcgagtcccctcag tgcccgccgccgccgggggtggagggcgcggccgggccggcggagcccgacggggcggcg gagggcgcggcaggcggcagcggcgagggcgagagtgggggcgggccgcggcgggctctg cgggcagtatacgtgcgcagtgagagctcccagggcggcgcggccggcggcccggaggct ggggcgcggcagtgcctgctgcgggcctgcgaggccgagggcgctcacctcacctccgtg cccttcggggagctggacttcggggagacggccgtgctcgacgccttctacgacgcagtg tttcatggggtcctgtgtgggtctaaatcttggggctgtgtcactagcacatatagtgga ttggatgagctgcaggaaaatggcaggcaggccttggacgtaagcaagagcaagtggatg ctgttggaaaaggaaattaaaactcgatcccccaggcaacgagtaccactcactatgaca cttgatgttgctgtggtagacatgagcgatgtctccagacagccttccctcttctaccat cttggagtccgagaaagctttgacatggccaataatgtgatcttgtaccatgacaccgat gccgacactgctctctctttgaaggacatggtaactcaaaaaaacacagaagtctttgac cataacatcaaccaatttcaggcatccagtggaaattattatttcatcccatacatcgtg acaccgtgcgctgattatttttgctgcgagagtgatgcccagagacgagcctccgagtac atgcagcccaactgggacaacatcctgggcccgctgtgcatgcctttggtggacaggttc attagcctccttaaggacatccacgtgacctcatgtgtttattacaaagaaaccttgtta aatgacatccggaaagccagagagaaataccaaggtgaggaactggcgaaggagctagct cggatcaagctccgcatggataatactgaggttctgacctcagacatcatcattaactta ctcctgtcctaccgtgatatccaggactatgatgcgatggtgaagctggtggaaacactg gagatgctgcctacgtgtgatttggccgatcagcataacattaaattccactatgcgttt gcactgaataggtaa >gi568815575r:19436420_19936283|GENSCAN_predicted_peptide_3|481_aa MSKLITEKDKWITEIKKEMKKDPLTNKAPEKPLHEVPSGNSLLSSETILRTNKREINAER SDGLKDVICNRSSSDATRSQQTMAHGPNLALAVFTSHNQLSLGGERRRRRCQVAFSYLPQ NDDELELKVGDIIEVVGEVEEGWWEGVLNGKTGMFPSNFIKELSGESDELGISQDEQLSK SRKTTFEGTILYRAAPGKTEGHRRYYSLRETTGSESDGGDSSSTKSEGANGTVATAAIQP KKVKGVGFGDIFKDKPIKLRPRSIEVENDFLPVEKTIGKKLPATTATPDSSKTEMDSRTK SKDYCKVIFPYEAQNDDELTIKEGDIVTLINKDCIDVGWWEGELNGRRGVFPDNFVKLLP PDFEKEGNRLNQVAGSQLTHHNKNSDVSQSGTQRSACPGHFVPYGSDSLEAGFHQKPRLS GAFREGKESKQEEFVVLTKGLAASPTDTARELSGMCGGGLPKSRIAENQAETYFQNLTIF L >gi568815575r:19436420_19936283|GENSCAN_predicted_CDS_3|1446_bp atgagtaagctcatcacagaaaaagacaagtggataacggaaataaagaaagagatgaag aaagaccctctcaccaacaaagctccagaaaagcccctgcacgaagtgcccagtggaaac tctttgctgtcttctgaaacgattttaagaaccaataagagagagattaatgctgaaaga tctgacggtttaaaagatgttatttgtaacaggtctagctctgatgctacgagaagtcag caaacgatggcccatgggccaaacctggcactggctgtttttaccagccacaaccagttg tcattgggaggcgagcgacggaggcgccggtgccaggtggcattcagctacctgccccag aatgacgatgaacttgagctgaaagttggcgacatcatagaggtggtaggagaggtagag gaaggatggtgggaaggtgttctcaacgggaagactggaatgtttccttccaacttcatc aaggagctgtcaggggagtcggatgagcttggcatttcccaggatgagcagctatccaag tcaagaaagactacctttgaagggacaattctgtaccgagctgcaccgggaaagacggag ggccacagacgctattacagtttaagggaaaccacaggctccgagagtgatgggggtgac tcaagcagcaccaagtctgaaggtgccaacgggacagtggcaactgcagcaatccagccc aagaaagttaagggagtgggctttggagacattttcaaagacaagccaatcaaactaaga ccaaggtcaattgaagtagaaaatgactttctgccggtagaaaagactattgggaagaag ttacctgcaactacagcaactccagactcatcaaaaacagaaatggacagcaggacaaag agcaaggattactgcaaagtaatatttccatatgaggcacagaatgatgatgaattgaca atcaaagaaggagatatagtcactctcatcaataaggactgcatcgacgtaggctggtgg gaaggagagctgaacggcagacgaggcgtgttccccgataacttcgtgaagttacttcca ccggactttgaaaaggaagggaatagacttaatcaggtagcaggcagccagcttactcat cacaacaagaactcagatgtcagccagtctggaacgcagaggtctgcctgccccggtcat tttgtcccctatgggtctgattccttagaagccgggttccaccagaagccacgtttatcc ggtgctttccgggagggcaaggaatcaaaacaagaggagtttgttgttcttaccaagggt ctggcagcctcacccacagacaccgccagagaacttagtgggatgtgtggaggaggtctc ccaaagtccagaattgctgaaaatcaagctgagacttacttccagaacctcaccatattc ctctaa >gi568815575r:19436420_19936283|GENSCAN_predicted_peptide_4|79_aa MASFDSQEKYNPKSFSMATSGTAGSQHSHNWNWNQLHNLYWQPTHDIISLLVFMRKSWWS QRKYDNNEKLSFMTSKIDS >gi568815575r:19436420_19936283|GENSCAN_predicted_CDS_4|240_bp atggcttcgtttgacagccaggaaaagtacaaccccaagagcttcagcatggccaccagt ggcacagctggttcacagcacagccacaactggaactggaaccaacttcacaacctttac tggcaaccaacacatgatatcatctctttgttagtcttcatgagaaaatcctggtggtcc cagaggaagtatgacaataatgaaaagctctccttcatgacgagcaaaatagacagctaa >gi568815575r:19436420_19936283|GENSCAN_predicted_peptide_5|211_aa MKHGFPSAILKTKHNQSSGYQEAEVVQSKQKRSRAKWCVCEDTKRKTWLVCRSALLYAEI PTVSVALTWEIQTPKPFSPDTNGRALDVEAIVEFDYQAQHDDELTISVGEIITNIRKEDG GWWEGQINGRRALDRITKQKISKGIEGLNNIINQEVVIDVYRISYTETAQNTFFSSTHEI SAKLNHILGHKTNLNIFKGIEIIKKVFFDHK >gi568815575r:19436420_19936283|GENSCAN_predicted_CDS_5|636_bp atgaaacacggctttcccagtgccatcctgaagacaaagcacaatcagagcagtggctac caagaggcggaagtggtccagtcaaagcaaaagcggtcaagagcaaagtggtgtgtttgt gaagacaccaaaaggaaaacgtggctggtatgtaggtcagctttattgtacgcggagatc cccacagtatctgtggccttgacctgggaaattcagacccccaaacccttcagccctgac actaatgggagggcactggatgtggaggccatagtggagtttgactaccaggcccagcac gatgatgagctgacgatcagcgtgggtgaaatcatcaccaacatcaggaaggaggatgga ggctggtgggagggacagatcaacggcaggagagcacttgacagaattactaaacagaaa atcagcaagggtatagaaggcctgaacaacataatcaaccaagaagttgtaattgatgtt tacagaatatcctacacagaaacagcacaaaacacatttttttcaagtacccatgaaata tctgccaagttaaaccatatcctgggtcataaaactaacctcaatatatttaagggaatt gagataataaaaaaggtgttctttgaccataagtaa >gi568815575r:19436420_19936283|GENSCAN_predicted_peptide_6|190_aa MGHDCQGSFQHFNARDQYKTKYKIQFYYVIHQEKTHYQASLPMPGLGTSKNNLPEAEKQK LAVSKNITWVDQTFNSPSNHKCLANKVAASLGSKVVPMLRGLNQMGPRGFKAQRRARSGA KMAKTKLSPPTRKTNHTPCIPQPEPPGASPRDRLLRPPRVPVGAARGLGGHTGASGSEWG QGLTLSPKLE >gi568815575r:19436420_19936283|GENSCAN_predicted_CDS_6|573_bp atgggccatgactgccaaggctccttccagcacttcaatgccagggaccaatataaaacg aaatataagatacagttttactatgttatccatcaagagaaaactcattatcaggccagc cttcctatgcctggcctaggaacatccaaaaataatctgccagaggctgaaaaacagaag ctggctgtcagcaagaacattacctgggtagaccaaacatttaatagcccaagtaatcat aagtgccttgccaataaagtggcagcttctctgggcagcaaggtggttccaatgctccgg ggtctaaatcaaatgggacccagggggttcaaagcgcagaggagggcgcgctcgggtgca aaaatggccaagacaaaactctccccgcccacccgaaaaaccaaccacaccccctgcatc cctcagcccgagccgcccggggccagcccccgtgatcgtctgctcaggcccccgcgggtt cccgtcggcgccgcgcgcgggctcggcgggcacacgggcgcaagtgggagtgaatgggga cagggtctgactctgtcgcccaagctggagtga >gi568815575r:19436420_19936283|GENSCAN_predicted_peptide_7|114_aa NYTTQRKDIITHKPFEVEGNHRNTRVRPFKSNFRGGRCQPNYKSGLVQKSLYIQAKYQRL RFTGPRGFITHKFRERLMRKKKISAITSVDVFNPDYSPDLQIVSCLPDALTWMS >gi568815575r:19436420_19936283|GENSCAN_predicted_CDS_7|345_bp aattatactacgcagagaaaagacataattactcacaaaccatttgaggttgagggaaac caccgaaacacaagagtaagaccttttaagagcaactttagaggtggcagatgccagccc aattataaatcaggcctggtacagaagagcttgtacattcaggctaagtatcagcgttta cggttcactggcccaaggggatttatcactcataagttcagagaaagattaatgagaaaa aagaagatttcagccatcacctctgttgatgtcttcaaccctgactactctccagacctc cagattgtcagctgcctcccagatgccctcacatggatgtcctaa