GENSCAN 1.0 Date run: 5-Nov-116 Time: 03:09:20 Sequence gi568815584r:67287332_67512059 : 224728 bp : 43.69% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 5180 5388 209 0 2 85 87 96 0.619 8.00 1.02 Intr + 14641 14787 147 0 0 47 111 131 0.790 11.73 1.03 Intr + 15079 15240 162 0 0 50 111 134 0.999 12.07 1.04 Intr + 25196 25379 184 1 1 99 110 124 0.996 15.06 1.05 Intr + 29501 29572 72 0 0 73 66 63 0.795 1.98 1.06 Intr + 30077 30148 72 0 0 52 115 45 0.885 2.98 1.07 Intr + 32899 33066 168 2 0 50 99 175 0.999 14.72 1.08 Intr + 33726 33928 203 1 2 67 103 89 0.978 7.30 1.09 Intr + 36371 36481 111 1 0 58 98 80 0.925 6.58 1.10 Term + 37512 37541 30 2 0 79 43 9 0.293 -6.65 1.11 PlyA + 39536 39541 6 -0.45 2.02 PlyA - 40153 40148 6 1.05 2.01 Sngl - 41748 41104 645 0 0 86 48 273 0.945 19.29 2.00 Prom - 46608 46569 40 -2.76 3.11 PlyA - 47068 47063 6 1.05 3.10 Term - 51431 51290 142 1 1 96 33 158 0.983 8.50 3.09 Intr - 53187 53109 79 1 1 74 58 100 0.931 4.21 3.08 Intr - 56107 56041 67 1 1 69 101 18 0.932 -0.32 3.07 Intr - 58540 58437 104 2 2 102 111 62 0.998 9.79 3.06 Intr - 60122 60078 45 0 0 96 98 31 0.906 3.28 3.05 Intr - 61773 61706 68 2 2 109 97 65 0.996 8.05 3.04 Intr - 63359 63280 80 2 2 84 110 23 0.877 2.45 3.03 Intr - 65709 65592 118 2 1 60 60 162 0.722 11.07 3.02 Intr - 72494 72327 168 1 0 25 101 62 0.331 0.26 3.01 Init - 73329 73220 110 0 2 81 52 85 0.451 4.14 3.00 Prom - 73882 73843 40 -4.76 4.00 Prom + 76302 76341 40 -3.76 4.01 Init + 77497 77677 181 0 1 74 115 202 0.497 20.84 4.02 Intr + 87137 87216 80 0 2 68 93 34 0.794 1.17 4.03 Intr + 89108 89259 152 1 2 88 62 199 0.991 16.26 4.04 Intr + 93328 93434 107 1 2 51 99 74 0.964 4.66 4.05 Intr + 94262 94359 98 0 2 57 113 61 0.997 5.23 4.06 Intr + 95116 95259 144 0 0 113 78 84 0.998 10.38 4.07 Term + 95984 96109 126 1 0 92 32 179 0.996 10.98 4.08 PlyA + 96508 96513 6 1.05 5.11 PlyA - 96602 96597 6 1.05 5.10 Term - 100125 99998 128 1 2 107 48 89 0.995 5.24 5.09 Intr - 100971 100893 79 0 1 115 95 -11 0.982 1.42 5.08 Intr - 103415 103332 84 2 0 72 87 59 0.897 4.22 5.07 Intr - 105096 104995 102 0 0 101 80 91 0.998 9.97 5.06 Intr - 105518 105331 188 0 2 51 78 311 0.999 25.81 5.05 Intr - 105910 105819 92 0 2 72 97 135 0.990 12.44 5.04 Intr - 108252 108071 182 0 2 125 73 260 0.996 26.87 5.03 Intr - 110287 110183 105 1 0 114 91 -10 0.823 2.31 5.02 Intr - 110495 110331 165 2 0 104 131 207 0.999 26.76 5.01 Init - 124728 124687 42 0 0 111 96 109 0.997 14.42 5.00 Prom - 127909 127870 40 -6.46 6.00 Prom + 129718 129757 40 -4.06 6.01 Init + 139038 139158 121 2 1 107 70 33 0.796 3.76 6.02 Intr + 141447 141551 105 1 0 56 66 124 0.750 7.19 6.03 Intr + 143191 143344 154 2 1 47 73 36 0.107 -2.87 6.04 Intr + 145799 145901 103 2 1 77 53 76 0.079 3.18 6.05 Intr + 165413 165505 93 1 0 61 115 36 0.834 3.76 6.06 Term + 170642 170746 105 1 0 60 48 108 0.763 2.41 6.07 PlyA + 172403 172408 6 1.05 7.07 PlyA - 173597 173592 6 1.05 7.06 Term - 186610 186089 522 1 0 114 53 1187 0.751 112.08 7.05 Intr - 191259 191110 150 0 0 56 94 23 0.177 0.06 7.04 Intr - 192924 192873 52 2 1 64 94 30 0.261 0.11 7.03 Intr - 194970 194886 85 1 1 21 97 52 0.038 -1.72 7.02 Intr - 207357 207249 109 0 1 77 66 68 0.120 3.36 7.01 Init - 208689 208594 96 0 0 84 26 73 0.108 1.01 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815584r:67287332_67512059|GENSCAN_predicted_peptide_1|452_aa XIEDLFSSLKHIQHTLVDSQSQEDISLLLQLVQNKDFQNAFKIHNAITVHMNKASPPFPL ISNAQDLAQEALLLAHDKVAEQEMQLEPITDERVYESIGQYGGETVKIVRIEKARDIPLG ATVRNEMDSVIISRIVKGGAAEKSGLLHEGDEVLEINGIEIRGKDVNEVFDLLIHVKAHF DYDPSDDPYVPCRELGLSFQKGDILHVISQEDPNWWQAYREGDEDNQPLAGLVPGKSFQQ QREAMKQTIEEDKEPEKSGKLWCAKKNKKKRKKVLYNANKNDDYDNEEILTYEEMSLYHQ PANRKRPIILIGPQNCGQNELRQRLMNKEKDRFASAVPHTTRSRRDQEVAGRDYHFVSRQ AFEADIAAGKFIEHGEFEKNLYGTSIDSVRQVINSGKICLLSLRTQSLKTLRNSDLKPYI IFIAPPSQERLRALLAKEGKNPKPPKVLGLQV >gi568815584r:67287332_67512059|GENSCAN_predicted_CDS_1|1359_bp naaatagaagacttgttttcttcacttaaacatatccaacatactttggtagattctcag agccaggaggatatttcactgcttttacaacttgttcaaaataaggatttccagaatgca tttaagatacacaatgccatcacagtacacatgaacaaggccagtcctccatttcctctt atctccaacgcacaagatcttgctcaagaggcacttttactggcccacgataaggttgct gagcaggaaatgcagctagagcccattacagatgagagagtttatgaaagtattggccag tatggaggagaaactgtaaaaatagttcgtatagaaaaggctcgtgatattccgttgggt gctacagttcgtaatgaaatggactctgtcatcattagccggatagtaaaagggggtgct gcagagaaaagtggtctgttgcatgaaggagatgaagttctagagattaatggcattgaa attcgggggaaagatgtcaatgaggtttttgacttgttgatccatgtaaaagctcatttt gactatgacccctcagatgacccttatgttccatgtcgagagttaggtctgtcttttcaa aaaggtgatatacttcatgtgatcagtcaagaagatccaaactggtggcaggcctacagg gaaggggacgaagataatcaacctctagccgggcttgttccagggaaaagctttcagcag caaagggaagccatgaaacaaaccatagaagaagataaggagccagaaaaatcaggaaaa ctgtggtgtgcaaagaagaataaaaagaagaggaaaaaggttttatataatgccaataaa aatgatgattatgacaacgaggagatcttaacctatgaggaaatgtcactttatcatcag ccagcaaataggaagagacctatcatcttgattggtccacagaactgtggccagaatgaa ttgcgtcagaggctcatgaacaaagaaaaggaccgctttgcatctgcagttcctcataca acccggagtaggcgagaccaagaagtagccggtagagattaccactttgtttcgcggcaa gcattcgaggcagacatagcagctggaaagttcattgagcatggtgaatttgagaagaat ttgtatggaactagcatagattctgtacggcaagtgatcaactctggcaaaatatgtctt ttaagtcttcgtacacagtcattgaagactctccggaattcagatttgaaaccatatatt atcttcattgcacccccttcacaagaaagacttcgggcattattggccaaagaaggcaag aatccaaagcctcccaaagttcttggattgcaggtgtaa >gi568815584r:67287332_67512059|GENSCAN_predicted_peptide_2|214_aa MAILPKVIYRFNAIPIKLPMTFFTELEKTTLKFIWNQKRACIAKTTLSQKNKAEGITLSN FKLYYKATVTKTACYWYQNRDTDQWNRTEPSEIIPHIYNHLIFDKPDKKKKWGKDALFNK WCWENWLAICRKLKLDPFLTPYTKVNSRWIKDLNVRPKTRKTLEENLGNTIQDIGMGKDF MTKTPKAMATKLKIDKWDLIKELLHSKRNYHQSE >gi568815584r:67287332_67512059|GENSCAN_predicted_CDS_2|645_bp atggccatactgcccaaggtaatttatagattcaatgccatccccatcaagctaccaatg actttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaagagcc tgcattgccaagacaaccctaagccaaaagaataaagctgaaggcatcaccctatccaac ttcaaactatactacaaggctacagtaaccaaaacagcatgctactggtaccaaaacaga gatacagaccaatggaacagaacagagccctcagaaataataccacacatctacaaccat ctgatctttgacaaacctgacaaaaagaagaaatggggaaaggatgccctatttaataaa tggtgctgggaaaactggctagccatatgtagaaagctgaaactggatcccttccttaca ccttatacaaaagttaattcaagatggattaaagacttaaatgttagacctaaaaccaga aaaaccctagaagaaaacctaggcaataccattcaggacataggcatgggcaaggacttc atgactaaaacaccaaaagcaatggcaacaaaactcaaaattgacaaatgggatctaatt aaagagcttctgcacagcaaaagaaactaccatcagagtgaatag >gi568815584r:67287332_67512059|GENSCAN_predicted_peptide_3|326_aa MPLEHRMTYTEPARRPTTARVVAVLSSHFPRRDGKGSNKRGQVTQPADSRRGRHCGGQFW SYCSLGCPAGDPSRKVIVRMSGKDRIEIFPSRMAQTIMKARLKGAQTGRNLLKKKSDALT LRFRQILKKIIETKMLMGEVMREAAFSLAEAKFTAGDFSTTVIQNVNKAQVKIRAKKDNV AGVTLPVFEHYHEGTDSYELTGLARGGEQLAKLKRNYAKAVELLVELASLQTSFVTLDEA IKITNRRVNAIEHVIIPRIERTLAYIITELDEREREEFYRLKKIQEKKKILKEKSEKDLE QRRAAGEVLEPANLLAEEKDEDLLFE >gi568815584r:67287332_67512059|GENSCAN_predicted_CDS_3|981_bp atgcctctggaacacaggatgacctacacggagccggcgaggaggcccacgacagcgcgg gtggtggcggtgctgtccagccacttcccgcgcagggacgggaaagggagcaacaagagg ggtcaagtgacacaaccagctgactcccgtagaggaagacactgtggaggccagttctgg agctattgcagcctcggttgcccggccggggacccgagccgaaaagttatcgtcagaatg tcgggcaaagaccgaattgaaatctttccctcgcgaatggcacagaccatcatgaaggct cgtttaaagggagcacagacaggtcgaaacctcctgaagaaaaaatctgatgccttaact cttcgatttcgacagatcctaaagaagataatagagactaaaatgttgatgggcgaagtg atgagagaagctgccttttcactagctgaagccaagttcacagcaggtgacttcagcact acagttatccaaaatgtcaataaagcgcaagtgaagattcgagcgaagaaagataatgta gcaggtgttactttgccagtatttgaacattaccatgaaggaactgacagttatgaactg actggtttagccagaggtggggaacagttggctaaattaaagaggaattatgccaaagca gtggaactactggtggaactagcttctctgcagacttcttttgttactttggatgaagct attaagataaccaacaggcgtgtaaatgccattgaacatgtcatcattccccggattgaa cgtactcttgcttatatcatcacagagctggatgagagagagcgagaagagttctatagg ttaaagaaaatacaagagaagaaaaagattctaaaggaaaaatctgagaaggacttggag caaaggagagcagctggagaggtgttggagcctgctaatcttctggctgaagagaaggac gaggatcttctatttgaataa >gi568815584r:67287332_67512059|GENSCAN_predicted_peptide_4|295_aa MVNVRSIAEMGAYVSLLEYNNIEGMILLSELSRRRIRSINKLIRIGRNECVVVIRVDKEK GYIDLSKRRVSPEEAIKCEDKFTKSKTVYSILRHVAEVLEYTKDEQLESLFQRTAWVFDD KYKRPGYGAYDAFKHAVSDPSILDSLDLNEDEREVLINNINRRLTPQAVKIRADIEVACY GYEGIDAVKEALRAGLNCSTENMPIKINLIAPPRYVMTTTTLERTEGLSVLSQAMAVIKE KIEEKRGVFNVQMEPKVVTDTDETELARQMERLERENAEVDGDDDAEEMEAKAED >gi568815584r:67287332_67512059|GENSCAN_predicted_CDS_4|888_bp atggtgaatgtcagatccattgctgaaatgggggcttatgtcagcttgctggaatacaac aacattgaaggcatgattcttcttagtgaattatccagaaggcgtatccgttctatcaac aaactcatccgaattggcaggaatgagtgtgtggttgtcattagggtggacaaagaaaaa ggatatattgatttgtcaaaaagaagagtttctccagaggaagcaatcaaatgtgaagac aaattcacaaaatccaaaactgtttatagcattcttcgtcatgttgctgaggtgttagaa tacaccaaggatgagcagctggaaagcctattccagaggactgcctgggtctttgatgac aagtacaagagacctggatatggtgcctatgatgcatttaagcatgcagtctcagaccca tctattttggatagtttagatttgaatgaagatgaacgggaagtactcattaataatatt aataggcgcttgaccccacaggctgtcaaaattcgagcagatattgaagtggcttgttat ggttatgaaggcattgatgctgtaaaagaagccctaagagcaggtttgaattgttctaca gaaaacatgcccattaagattaatctaatagctcctcctcggtatgtaatgactacgaca accctggagagaacagaaggcctttctgtcctcagtcaagctatggctgttatcaaagag aagattgaggaaaagaggggtgtgttcaatgttcaaatggagcccaaagtggtcacagat acagatgagactgaacttgcgaggcagatggagaggcttgaaagagaaaatgccgaagtg gatggagatgatgatgcagaagaaatggaagccaaagctgaagattaa >gi568815584r:67287332_67512059|GENSCAN_predicted_peptide_5|388_aa MEDGVLKEGFLVKRGHIVHNWKARWFILRQNTLVYYKLEGGRRVTPPKGRILLDGCTITC PCLEYENRPPSHLWPLWESGVGRRKSSELELKNLDSNPGSATYQLLIKLKTQTSTEYFLE ACSREERDAWAFEITGAIHAGQPGKVQQLHSLRNSFKLPPHISLHRIVDKMHDSNTGIRS SPNMEQGSTYKKTFLGSSLVDWLISNSFTASRLEAVTLASMLMEENFLRPVGVRSMGAIR SGDLAEQFLDDSTALYTFAESYKKKISPKEEISLSTVELSGTVVKQGYLAKQGHKRKNWK VRRFVLRKDPAFLHYYDPSKEENRPVGGFSLRGSLVSALEDNGVPTGVKGNVQGNLFKVI TKDDTHYYIQASSKAERAEWIEAIKKLT >gi568815584r:67287332_67512059|GENSCAN_predicted_CDS_5|1167_bp atggaggacggcgtgctcaaggagggcttcctggtcaagaggggccacattgtccacaac tggaaggcgcgatggttcatccttcggcagaacacgctggtgtactacaagcttgagggg ggtcggagagtgacccctcccaagggccggatcctcctggatggctgcaccatcacctgc ccctgcctggagtatgaaaaccgaccgccttcccacctctggcctctttgggaaagtggt gtgggtaggaggaagagctcagagttagaattgaaaaacttggattcaaatcctggctcc gccacataccagctcctcattaagctgaagactcaaacatccacggagtacttcctggag gcctgttctcgagaggagcgggatgcctgggcctttgagatcaccggggctattcatgca gggcagccggggaaggtccagcagctgcacagcctgagaaactccttcaagctgcccccg cacatcagcctgcatcgcattgtggacaagatgcacgatagcaacaccggaatccgttca agccccaacatggagcagggaagcacctataaaaagaccttcctcggctcctccctggtg gactggctcatctccaacagcttcacggccagccgtctggaggcggtgaccctggcctcc atgctcatggaggagaacttcctcaggcctgtgggtgtccgaagcatgggagccattcgc tctggggatctggccgagcagttcctggatgactccacagccctgtacacttttgctgag agctacaaaaagaagataagccccaaggaagaaattagcctgagcactgtggagttaagt ggcacggtggtgaaacaaggctacctggccaagcagggacacaagaggaaaaactggaag gtgcgtcgctttgttctaaggaaggatccagctttcctgcattactatgacccttccaaa gaagagaacaggccagtgggtgggttttctcttcgtggttcactcgtgtctgctctggaa gataatggcgttcccactggggttaaagggaatgtccagggaaacctcttcaaagtgatt actaaggatgacacacactattacattcaggccagcagcaaggctgagcgagccgagtgg attgaagctatcaaaaagctaacatga >gi568815584r:67287332_67512059|GENSCAN_predicted_peptide_6|226_aa MAVKPLKVGEQLSCALWEFHYLLPLRCWHSQCVPLKTQVAVLVIICRLLLKLSPLLPWAV GGLLEREAGVGLEEPGKKKQECFLNWQSGDRVECHINWCQWHHGDKIWRRQVATGPVEWS LGGGAGQKLMPILQGGPEKGNKAHRGILRADGWLLADAQKQFLDYKTCLETASTSETCFK YWNGYYFQGSDMLVRVLLLAKDPCGLTLPCFSSQERELKVELANPL >gi568815584r:67287332_67512059|GENSCAN_predicted_CDS_6|681_bp atggctgtgaaacctttgaaagtaggagaacagctgtcatgtgccctgtgggagtttcac tatctgctgcctcttcgctgttggcacagtcagtgtgtgccccttaaaacacaagtggcc gtattagttatcatctgccggctgctcctgaagctctccccactgctgccctgggctgta gggggcctgctggagagagaagcaggcgttggtttggaggagccaggaaagaagaagcag gagtgctttctcaactggcagagtggtgaccgagtggagtgtcacatcaactggtgtcaa tggcatcacggggataagatctggaggaggcaggtggcaacggggcctgtggagtggagt cttggtggaggggctgggcagaagttgatgcccatcttacaaggagggccagaaaaaggg aacaaggcacacagaggtatcctcagagctgatggatggctgctggcagatgcccagaaa cagtttctcgattataagacatgtctagagacagcatctacctcagagacctgttttaag tattggaatggttactatttccagggctcagatatgctggtcagagtgctgctgttggct aaggatccctgtggcctgacactcccctgcttcagcagccaggagcgggaattgaaggtg gagctggccaatccgttgtga >gi568815584r:67287332_67512059|GENSCAN_predicted_peptide_7|337_aa MATHSCDSEPEEEAVPVSEGFALELGGCDGAEPAITGCGQEGGMYLLSTDKSWSRHPLPL NPCPEYKHDFYGSLFTPLNCWPCPLPPCGTAGSESSRKGCLEAAESRGTVTLHRMLKIHS WASSGEKHRICILERLRFFYTLFHPVMSELEGRRQSWETFGVIQPAPVGAMASAEPLTAL SRWYLYAIHGYFCEVMFTAAWEFVVNLNWKFPGVTSVWALFIYGTSILIVERMYLRLRGR CPLLLRCLIYTLWTYLWEFTTGFILRQFNACPWDYSQFDFDFMGLITLEYAVPWFCGALI MEQFIIRNTLRLRFDKDAEPGEPSGALALANGHVKTD >gi568815584r:67287332_67512059|GENSCAN_predicted_CDS_7|1014_bp atggccacacacagctgtgacagtgaacctgaggaggaggctgtgcctgtgtctgagggc tttgctttggaattaggtggctgtgatggtgctgagccagccatcaccggctgtggacag gagggtgggatgtacctgctgagcactgacaagagctggtctaggcatcccctacctctc aacccgtgtccagagtataagcatgacttttatggcagcctctttacccctctcaactgc tggccatgtcccctgccaccctgtggaacagcaggctcggagagcagccggaagggctgc ctggaggctgccgagagcaggggcacagtgaccctgcatcggatgttgaagatccactca tgggcttcaagtggtgagaaacatcggatttgcattttagaaaggttgcgttttttctac acactctttcatcctgtgatgagtgaattagaaggaagaaggcaaagctgggagaccttt ggagtcatccagccagccccagtcggcgccatggcgtctgccgagcccctgacggcgctg tcccgctggtacctgtatgccatccacggctacttctgcgaggtgatgttcacagcggcc tgggagttcgtggtgaacttgaactggaagttccctggggtcacgagcgtgtgggccctc ttcatctacggcacctccatcctcatcgtggagcgcatgtacctgcggctgcgcggccgc tgcccgctgctcctgcgctgcctcatctacacgctctggacctacctgtgggagttcacc accggcttcatcctgcgccagttcaacgcctgcccctgggactactcccagttcgacttt gacttcatgggcctcatcaccctggagtacgccgtgccctggttctgcggggccctcatc atggagcagttcatcatccgcaacaccctccgcctccgcttcgacaaggacgctgagccc ggggagcccagcggcgccctagccctggccaacggccatgtcaagactgactga