GENSCAN 1.0 Date run: 4-Nov-116 Time: 09:55:39 Sequence gi568815589r:83561790_83807679 : 245890 bp : 40.07% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.08 PlyA - 242 237 6 1.05 1.07 Term - 3869 3788 82 1 1 44 36 128 0.236 -0.61 1.06 Intr - 4135 3989 147 0 0 97 42 63 0.308 1.03 1.05 Intr - 5470 5053 418 2 1 39 43 228 0.329 5.56 1.04 Intr - 9066 8946 121 0 1 114 78 36 0.238 4.35 1.03 Intr - 23347 23246 102 1 0 117 1 98 0.136 3.45 1.02 Intr - 23841 23764 78 0 0 86 105 60 0.787 6.33 1.01 Init - 29044 28988 57 1 0 107 4 65 0.367 1.36 1.00 Prom - 32751 32712 40 -8.05 2.00 Prom + 33263 33302 40 -5.15 2.01 Init + 35053 35091 39 0 0 64 72 81 0.773 4.34 2.02 Intr + 37144 37313 170 0 2 68 79 90 0.802 3.92 2.03 Term + 37383 37635 253 0 1 93 47 117 0.687 2.23 2.04 PlyA + 38733 38738 6 1.05 3.00 Prom + 42700 42739 40 -9.45 3.01 Sngl + 44741 45043 303 1 0 35 38 274 0.883 12.68 3.02 PlyA + 45166 45171 6 1.05 4.00 Prom + 51564 51603 40 -3.65 4.01 Init + 61383 61432 50 2 2 57 50 135 0.736 5.07 4.02 Intr + 66392 66422 31 2 1 94 88 39 0.160 1.71 4.03 Intr + 67084 67170 87 0 0 49 92 136 0.177 9.35 4.04 Intr + 79759 79802 44 0 2 97 116 -7 0.950 -0.88 4.05 Term + 81640 81991 352 1 1 103 48 279 0.980 18.37 4.06 PlyA + 82314 82319 6 1.05 5.16 PlyA - 86814 86809 6 1.05 5.15 Term - 100150 99998 153 1 0 69 43 162 0.889 6.74 5.14 Intr - 102254 102086 169 1 1 76 69 211 0.998 17.03 5.13 Intr - 103356 103241 116 0 2 120 61 62 0.965 5.03 5.12 Intr - 107538 107396 143 1 2 29 109 56 0.150 0.95 5.11 Intr - 116172 115938 235 0 1 72 69 219 0.952 14.74 5.10 Intr - 116810 116652 159 2 0 32 58 137 0.942 4.36 5.09 Intr - 118248 117986 263 1 2 102 55 135 0.991 7.88 5.08 Intr - 121277 121162 116 1 2 95 97 57 0.995 6.47 5.07 Intr - 124366 124215 152 1 2 55 80 98 0.944 3.74 5.06 Intr - 132125 132072 54 2 0 53 92 61 0.657 1.26 5.05 Intr - 145047 145015 33 0 0 125 69 21 0.723 1.40 5.04 Intr - 145892 145711 182 0 2 80 94 284 0.842 26.87 5.03 Intr - 146335 146136 200 0 2 90 54 151 0.989 9.97 5.02 Intr - 146719 146537 183 0 0 84 81 34 0.392 0.28 5.01 Init - 147628 147612 17 1 2 67 95 1 0.607 -1.49 5.00 Prom - 147855 147816 40 -6.05 6.00 Prom + 150144 150183 40 -9.15 6.01 Init + 150718 151057 340 0 1 55 111 487 0.174 45.17 6.02 Term + 151478 151638 161 0 2 14 32 152 0.553 -0.68 6.03 PlyA + 151677 151682 6 1.05 7.14 PlyA - 151862 151857 6 1.05 7.13 Term - 177955 177908 48 1 0 128 38 32 0.814 -1.47 7.12 Intr - 180240 180163 78 0 0 61 85 64 0.724 2.23 7.11 Intr - 180811 180741 71 2 2 58 110 52 0.570 2.38 7.10 Intr - 186583 186520 64 1 1 63 106 31 0.207 -0.23 7.09 Intr - 191570 191469 102 2 0 109 68 83 0.377 7.85 7.08 Intr - 196551 196524 28 2 1 78 87 15 0.003 -2.40 7.07 Intr - 203854 203721 134 1 2 91 34 93 0.002 2.72 7.06 Intr - 207181 207020 162 1 0 66 9 115 0.001 0.65 7.05 Intr - 218615 218593 23 0 2 111 107 0 0.083 0.54 7.04 Intr - 223049 222926 124 2 1 87 90 78 0.973 7.14 7.03 Intr - 226889 226812 78 2 0 86 63 83 0.935 4.43 7.02 Intr - 237539 237396 144 2 0 100 87 125 0.988 13.16 7.01 Init - 244728 244513 216 0 0 68 77 151 0.864 10.96 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815589r:83561790_83807679|GENSCAN_predicted_peptide_1|334_aa MGQNGEDLLTAMVVELLNERFLAQDGCLELQPSSLYSRQQEGRREAAATAHKVYNPDIAG LRVCQSPREAPNERSDVPKVQTTIISCLDYGSRFLTGLPASALASLHVFSTQQQVFLSVS APAVAERGQHLALTVASEGGSPKPWQLPRGVEPVGAQKSGTEVWEPLPRFQKMYGNVWMP RQKFAAGAGPSWRTSARAVWKGNVGLEPSHRVPTGALPSGTVKRGPPSSRPQNGRSTNSL HCVPGKASDTQCQPVRAAGWEYAEPALSVNRITSGLFDAKGDQFPNSFLKFLHWPAVLPK IPVIGKFSHQADIEEMGLGQSDFDRALRAVSLQV >gi568815589r:83561790_83807679|GENSCAN_predicted_CDS_1|1005_bp atgggccaaaatggagaagacctcttgactgccatggttgtagaactgcttaatgagcgt ttcttggcccaagatggctgcttggagctccagccatcaagtctgtattccaggcagcag gaaggaagaagagaggctgctgccaccgcccacaaagtttacaacccagacatagcagga ctcagggtttgccagtctccccgagaggctccaaatgagagaagtgatgttcctaaggtc caaaccaccatcatttcttgcctagattacggaagccgattcctaactggtctccctgct tctgcccttgcctctcttcatgtgttctcaacacagcagcaagtgttcctgtcagtttct gctccagctgtggctgaaaggggccaacatttagctctgaccgtggcttcagagggtgga agccccaagccttggcagcttccacgtggtgttgagcctgtgggtgcacagaagtcagga actgaggtttgggaacctctgcctagatttcagaagatgtatggaaatgtctggatgccc aggcaaaagtttgctgcaggggcagggccctcatggagaacctctgctagggcagtgtgg aaggggaatgtggggttggagccctcacacagagtccctactggggcactgcctagtgga actgtgaaaagagggccaccgtcctccagaccccagaatggtagatccaccaacagcttg cactgtgtgcctggaaaagcctcagacactcaatgccagcccgtgagagcagccggatgg gagtatgcagagccagctctaagtgttaacagaataactagtggcttatttgatgcaaaa ggagaccaatttcctaactcctttctcaaatttctccactggcctgccgtattgcctaaa attcccgtaattggaaaattcagtcaccaggcagacattgaagaaatgggactgggacag tcagattttgacagggcgctacgggcagtatctttgcaggtgtaa >gi568815589r:83561790_83807679|GENSCAN_predicted_peptide_2|153_aa MNVKPKGDKEAKGASALKESSPLKESFIEEAMEISMRLALISAFQRLISVWSFPPPRLLH GGRRGICWKHYGKQPSRHSQSVIHIHSQSVTHINSQSVTHICSQFVTTHICSLFGTLSSE TALPAAVAPTPAPFEVANLDQLRLCGPTPAKGD >gi568815589r:83561790_83807679|GENSCAN_predicted_CDS_2|462_bp atgaacgtgaagccaaagggagataaagaagccaaaggggcttcagcgttgaaagaatct tctccactgaaggaatccttcatagaagaggccatggaaatatctatgaggcttgctctc atatcagcattccagagactcatctctgtgtggtcctttcccccacctcgcctgctccat ggagggaggagagggatatgttggaaacattacggcaaacaaccttccagacattcccaa tctgtaatccacatccattcccaatctgtaactcacatcaattcccaatctgtaacccac atctgttcccaatttgtaacaacccacatctgttccttatttggcacgcttagttccgaa actgctcttcctgctgctgtagcccccacccctgctccatttgaagtagctaatctggat cagcttagattgtgtggtccgaccccagccaaaggggactga >gi568815589r:83561790_83807679|GENSCAN_predicted_peptide_3|100_aa MLVQSGVKKGATKSVDHRQGQKEVSKPNRAQNQGNAVYQCLINKKYRYKKVSVLARAIRQ EEEIKGIQIGKVEVTLSLFADDMITYPENPKDSSKKPLGT >gi568815589r:83561790_83807679|GENSCAN_predicted_CDS_3|303_bp atgttggtacagagcggagtgaaaaaaggggccactaagtcagtagaccacagacagggg cagaaagaagtatctaaaccaaacagagcacaaaatcagggtaatgccgtctaccagtgt ctgatcaacaagaaatacaggtataagaaagtatccgtcctagccagagcaatcagacaa gaggaagaaataaagggcatccaaattggtaaagtggaagtcacattgtcactgtttgct gatgacatgatcacatacccagaaaaccctaaagactcatccaaaaagcccctgggtaca tga >gi568815589r:83561790_83807679|GENSCAN_predicted_peptide_4|187_aa MAAPGALLVMGVSGSGKSTVGALLASELGWKFYDADDYHPEENRRKMGKGIPLNDQDRIP WLCNLHDILLRDVASGQRVVLACSALKKTYRDILTQGKDGVALKCEESGKEAKQAEMQLL VVHLSGSFEVISGRLLKREGHFMPPELLQSQFETLEPPAAPENFIQISVDKNVSEIIATI METLKMK >gi568815589r:83561790_83807679|GENSCAN_predicted_CDS_4|564_bp atggcggcgccgggcgcgctgctggtgatgggcgtgagcggctcggggaaatccaccgtg ggcgccctgctggcatctgagctgggatggaaattctatgatgctgatgattatcacccg gaggaaaatcgaaggaagatgggaaaaggcataccgctcaatgaccaggaccggattcca tggctctgtaacttgcatgacattttactaagagatgtagcctcgggacagcgtgtggtt ctagcctgttcagccctgaagaaaacgtacagagacatattaacacaaggaaaagatggt gtagctctgaagtgtgaggagtcgggaaaggaagcaaagcaggctgagatgcagctcctg gtggtccatctgagcgggtcgtttgaggtcatctctggacgcttactcaaaagagaggga cattttatgccccctgaattattgcagtcccagtttgagactctggagcccccagcagct ccagaaaactttatccaaataagtgtggacaaaaatgtttcagagataattgctacaatt atggaaaccctaaaaatgaaatga >gi568815589r:83561790_83807679|GENSCAN_predicted_peptide_5|724_aa MTGKNRAIRPLQISNAKHIKITKYLAAILSPDPPPSGERRVDREAREGKAIVTARRGEHG PAGHEVSPSHWLRLHFGQWSGKGKGTQAELKVAPAGDDSATPPGALPGVLRAAPAKGRGG RGREEEAVAAADVAMAESGESGGPPGSQDSAAGAEGAGAPAAAASAEPKIMKVTVKTPKE KEEFAVPENSSVQQYVYHLLDANHWPCRISISIPDEETKAQRQFKEEISKRFKSHTDQLV LIFAGKILKDQDTLSQHGIHDGLTVHLVIKTQNRPQDHSAQQTNTAGSNVTTSSTPNSNS TSGSATSNPFGLGGLGGLAGLSSLGLNTTNFSELQSQMQRQLLSNPEMMVQIMENPFVQS MLSNPDLMRQLIMANPQMQQLIQRNPEISHMLNNPDIMRQTLELARNPAMMQEMMRNQDR ALSNLESIPGGYNALRRMYTDIQEPMLSAAQEQFGGNPFASLVSNTSSGEGSQPSRTENR DPLPNPWAPQTSQSSSASSGTASTVGGTTGSTASGTSGQSTTAPNLVPGVGASMFNTPGM QSLLQQITENPQLMQNMLSAPYMRSMMQSLSQNPDLAAQMQNPDTLSAMSNPRAMQALLQ IQQGLQTLATEAPGLIPGFTPGLGALGSTGGSSGTNGSNATPSENTSPTAGTTEPGHQQF IQQMLQALAGVNPQLQNPEVRFQQQLEQLSAMGFLNREANLQALIATGGDINAAIERLLG SQPS >gi568815589r:83561790_83807679|GENSCAN_predicted_CDS_5|2175_bp atgacagggaaaaacagggcaatacgacccctccaaatatcaaacgctaagcatattaaa ataacaaaatacctcgccgctattttatcaccagaccctcctcccagcggggaacgcaga gtagacagagaggcaagggagggcaaggcgattgtgacagcgcggaggggtgaacacggc ccagctgggcatgaggtaagtccttcacactggcttcgcctgcactttgggcaatggagc gggaaaggaaaaggcacgcaggctgagctcaaagtcgcgccagccggggatgactcggcg acgccgccaggcgcgttacccggcgtgctccgcgcggcgccggcgaagggacgtggggga aggggcagggaggaggaagcggtggctgctgcggatgtcgccatggccgagagtggtgaa agcggcggtcctccgggctcccaggatagcgccgccggagccgaaggtgctggcgccccc gcggccgctgcctccgcggagcccaaaatcatgaaagtcaccgtgaagaccccgaaggaa aaggaggaattcgccgtgcccgagaatagctccgtccagcagtatgtttatcatctctta gatgccaatcattggccctgcagaataagtatctctattcctgatgaagaaaccaaggct cagagacagtttaaggaagaaatctctaaacgttttaaatcacatactgaccaacttgtg ttgatatttgctggaaaaattttgaaagatcaagataccttgagtcagcatggaattcat gatggacttactgttcaccttgtcattaaaacacaaaacaggcctcaggatcattcagct cagcaaacaaatacagctggaagcaatgttactacatcatcaactcctaatagtaactct acatctggttctgctactagcaacccttttggtttaggtggccttgggggacttgcaggt ctgagtagcttgggtttgaatactaccaacttctctgaactacagagtcagatgcagcga caacttttgtctaaccctgaaatgatggtccagatcatggaaaatccctttgttcagagc atgctctcaaatcctgacctgatgagacagttaattatggccaatccacaaatgcagcag ttgatacagagaaatccagaaattagtcatatgttgaataatccagatataatgagacaa acgttggaacttgccaggaatccagcaatgatgcaggagatgatgaggaaccaggaccga gctttgagcaacctagaaagcatcccagggggatataatgctttaaggcgcatgtacaca gatattcaggaaccaatgctgagtgctgcacaagagcagtttggtggtaatccatttgct tccttggtgagcaatacatcctctggtgaaggtagtcaaccttcccgtacagaaaataga gatccactacccaatccatgggctccacagacttcccagagttcatcagcttccagcggc actgccagcactgtgggtggcactactggtagtactgccagtggcacttctgggcagagt actactgcgccaaatttggtgcctggagtaggagctagtatgttcaacacaccaggaatg cagagcttgttgcaacaaataactgaaaacccacaactgatgcaaaacatgttgtctgcc ccctacatgagaagcatgatgcagtcactaagccagaatcctgaccttgctgcacagatg cagaatcctgatacactatcagcaatgtcaaaccctagagcaatgcaggccttgttacag attcagcagggtttacagacattagcaacggaagccccgggcctcatcccagggtttact cctggcttgggggcattaggaagcactggaggctcttcgggaactaatggatctaacgcc acacctagtgaaaacacaagtcccacagcaggaaccactgaacctggacatcagcagttt attcagcagatgctgcaggctcttgctggagtaaatcctcagctacagaatccagaagtc agatttcagcaacaactggaacaactcagtgcaatgggatttttgaaccgtgaagcaaac ttgcaagctctaatagcaacaggaggtgatatcaatgcagctattgaaaggttactgggc tcccagccatcatag >gi568815589r:83561790_83807679|GENSCAN_predicted_peptide_6|166_aa MMLDKKQIRAIFLFKFKMGHKAAETTHNISNTFGPGTANERTVQWWFKKFHKGEESLEDE EHSGQPSEVDSDQLRAIIEADPLITTPEVAEELNIDHSTVVRHLKQIGKVKKLALGKLNE LGYEVLPHPPYSPDLSPTDYHFFKHLDNFLQGKRFHNQQDAEHAFQ >gi568815589r:83561790_83807679|GENSCAN_predicted_CDS_6|501_bp atgatgttagacaaaaagcaaattcgagcaattttcttattcaagttcaaaatgggtcat aaagcagcggagacaactcacaacatcagcaacacatttggcccaggaactgctaatgaa cgcacagtgcagtggtggttcaagaagtttcacaaaggagaggagagccttgaagatgag gagcatagtggccagccatcggaagttgacagtgaccaattgagagcaatcatcgaagct gatcctcttataactacacctgaagttgctgaagaactcaacattgaccattctacagtc gttcggcatttgaagcaaattggaaaggtgaaaaagcttgcacttgggaaattgaacgaa ttgggctacgaagttttgcctcatcctccatattcacctgacctctcgccaaccgactac cacttcttcaagcatcttgacaactttttgcagggaaaacgcttccacaaccagcaggat gcagaacatgctttccaatag >gi568815589r:83561790_83807679|GENSCAN_predicted_peptide_7|423_aa MASAVLSSVPTTASRFALLQVDSGSGSDSEPGKGKGRNTGKSQTLGSKSTTNEKKREKRR KKKEQQQSEANELRNLAFKKIPQKSSHAVCNAQHDLPLSNPVQKDSREENWQEWRQRDEQ LTSEMFEADLEKALLLSKLEYEEHKKEYEDAENTSTQSKVMNKKDKRKNHQGKDRPLTVS LKDFHSEDHISKKTEELSSSQTLSHDGGFFNRLEDDVHKILIREKRREQLTEYNGTDNCT AHEHNQACKVLLIKEMCSHGVGQLCLCGFAGYSLHLLAAFTAGIECLQLFRCKPCYILPN QHAEVVLKDGRIERLKLELERKDAEIQKLKNVITQWEAKYKEVKARNAQLLKMLQEGEMK DKAEILLQVDESQSIKNELTIQVTSLHAALEQERSKVKVLQAELAKYQGGRKGKRNSESD QCR >gi568815589r:83561790_83807679|GENSCAN_predicted_CDS_7|1272_bp atggcctcagcagtacttagttctgttcccaccaccgcttctcgttttgccctgttacaa gtggatagtggcagtggctctgattctgaacctggaaaaggtaaaggtcgaaatactgga aagtctcaaactttaggaagcaagtcaactacaaatgagaaaaaaagagagaaaagaaga aaaaagaaggaacagcaacagagtgaagcaaatgagctcaggaatcttgcttttaagaaa attccccagaaatcctcccatgctgtttgtaacgctcaacatgatcttccattgtcaaac ccagtacagaaggattcacgagaagaaaattggcaagagtggagacaaagagatgagcag ctgacatctgaaatgtttgaagcagatcttgagaaggcattgttactaagtaaactagaa tatgaagagcacaaaaaggagtatgaagatgctgaaaatacttcaactcagtccaaagtt atgaataaaaaagataaaagaaagaatcatcagggaaaagacagacctctcacagtatca ctaaaagattttcattcggaagatcacattagtaaaaagactgaggaattgagttcttct cagactttatcacatgatggaggattcttcaatagactggaagatgatgttcataaaatt cttattagagaaaaacgaagagaacagcttacagaatataatggaacagataattgtaca gctcatgaacacaaccaggcatgtaaagtcttgctgattaaagagatgtgctcccatggt gttgggcaactctgcctctgtggctttgcagggtacagcctccacttactggctgctttc acggctggcattgagtgtctgcagcttttccggtgcaaaccgtgttatatcctgccaaat caacatgctgaagtggttctgaaagatggaagaattgaaagactaaagttagagcttgaa aggaaagatgctgaaatccagaagctgaaaaatgtaatcactcaatgggaggcaaagtat aaggaagtaaaggcaagaaatgcacaattattgaaaatgcttcaggaaggtgaaatgaaa gataaggcagaaatacttctgcaagttgatgaatcacaaagtatcaagaatgagctcact attcaggtgacttcacttcatgctgcattagaacaagaaagatctaaagtgaaagtatta caagcagagttagccaaataccagggtggcagaaaagggaaaagaaactctgaatccgac cagtgtaggtga