GENSCAN 1.0 Date run: 7-Nov-116 Time: 23:20:48 Sequence gi568815585r:101276700_101499126 : 222427 bp : 37.46% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.13 Intr - 2392 2195 198 1 0 65 47 98 0.055 2.03 1.12 Intr - 7320 7234 87 0 0 109 61 93 0.552 7.95 1.11 Intr - 15395 15291 105 2 0 76 101 50 0.613 4.59 1.10 Intr - 15667 15525 143 2 2 114 79 44 0.783 5.25 1.09 Intr - 25217 25127 91 2 1 29 82 92 0.006 1.35 1.08 Intr - 34544 34245 300 2 0 13 46 163 0.104 0.61 1.07 Intr - 36018 35015 1004 1 2 -5 80 488 0.033 28.25 1.06 Intr - 48975 48769 207 1 0 4 47 313 0.190 16.93 1.05 Intr - 68721 68567 155 2 2 93 48 99 0.081 5.19 1.04 Intr - 100129 100001 129 0 0 97 110 -8 0.130 1.19 1.03 Intr - 100357 100218 140 1 2 94 61 152 0.180 11.44 1.02 Intr - 118666 118484 183 1 0 32 105 185 0.922 13.66 1.01 Init - 122427 122320 108 0 0 68 100 135 0.813 12.97 1.00 Prom - 126366 126327 40 -3.85 2.00 Prom + 127733 127772 40 -6.25 2.01 Init + 131274 131278 5 2 2 61 107 0 0.571 -1.28 2.02 Intr + 131752 131875 124 1 1 69 72 149 0.345 11.07 2.03 Term + 139139 139345 207 1 0 47 50 95 0.031 -2.04 2.04 PlyA + 140637 140642 6 1.05 3.02 PlyA - 140863 140858 6 1.05 3.01 Sngl - 142778 142491 288 2 0 83 40 226 0.632 12.64 3.00 Prom - 143864 143825 40 -9.85 4.05 PlyA - 143883 143878 6 1.05 4.04 Term - 144955 144624 332 2 2 7 36 264 0.406 6.93 4.03 Intr - 147450 147321 130 0 1 73 101 5 0.408 -0.35 4.02 Intr - 155853 155689 165 0 0 102 47 45 0.236 1.04 4.01 Init - 159641 159558 84 2 0 106 43 90 0.872 7.17 4.00 Prom - 162811 162772 40 -4.05 5.07 PlyA - 163161 163156 6 1.05 5.06 Term - 171497 171222 276 2 0 75 48 119 0.841 1.08 5.05 Intr - 173336 173120 217 1 1 65 63 109 0.543 3.68 5.04 Intr - 177103 176916 188 1 2 95 43 152 0.333 8.97 5.03 Intr - 177393 177246 148 2 1 64 85 140 0.472 10.62 5.02 Intr - 179866 179573 294 0 0 20 89 169 0.403 5.40 5.01 Init - 182676 182600 77 0 2 57 99 8 0.292 -0.59 5.00 Prom - 182731 182692 40 -5.35 6.04 PlyA - 182741 182736 6 1.05 6.03 Term - 185721 185298 424 2 1 31 41 253 0.010 8.68 6.02 Intr - 187046 186915 132 1 0 75 63 83 0.013 3.34 6.01 Init - 221388 221177 212 0 2 38 93 128 0.583 6.60 6.00 Prom - 222135 222096 40 -3.85 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 78643 79254 612 0 0 63 48 196 0.975 9.04 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815585r:101276700_101499126|GENSCAN_predicted_peptide_1|950_aa MLKRKQSSRVEAQPVTDFGPDESLSDNADILWINKPWVHSLLRICAIISVISVCMNTPMT FEHYPPLQYVTFTLDTLLMFLYTAEMIAKMHIRGIVKVFEIADIVDQMSPWGMLRIPRPL IMIRAFRIYFRFELPRTRITNILKRSGEQIWSVSIFLLFFLLLYGILGVQMFGTFTYHCV VNDTKPGNVTWNSLAIPDTHCSPELEEGYQCPPGFKCMDLEDLGLSRQELGYSGFNEIED KTDISRKKAASTSLDDEQRKEAAAEEKHILLTFEEGLRGPLPPENGTWEESLDGRQYKIK GQQLAVLYHSAIKLELRIKKLTQNRSTTWKLNNLLLNEYWVHNEMKAEIKMFFETNENKD TTYQNLWDTFKAVCRGKFIALNAHKRKQERSKIDTLTSQLKELEKQEQTHSKASRRQEIA KIRAELKEIETQKTFQKINESRSWFFEKINKIDRPLTRLIKKKREKNQIDAIKNDKGDIT TDPTEMQTTIREYYKHLYANKLENLEEMDKFLDTYTLPRLNQEEVESLNRPITGSEIVAI INSLPTKKSPGPDGFTAEFYQRYKEELVPFLLKLFQSIEKEGILPNSFYEASIILIPKPG RDTTKKENFRPISLMNIDAKILNKILANRIQQHIKKLIHHDQLISNFSKVSGYKINVQKS QAFLYTNNRQTESQIMSELPFTVASKRIKYLAIQLTRDVKDLFKENYKPLLNEIKEDTNK WKNIPCSWVGRINIVKMATLPMRLSGDQQYHHCPSASMEVEVQGVLTHTGYLGTSIFTVY EAASQEGWVFLMYRAIDSFPRWRSYFYFITLIFFLAWLVKNVFIAVIIETFAEIRVQFQQ MWGSRSSTTSTATTQMFHEDAAGGWQLVAVDVNKPQGRAPACLQGKFHYFRDHTSHGHFC NCALDMRDDQLCLLTENSPVKEGGSTFLAYESTYLTSRLEAALFVLLVLS >gi568815585r:101276700_101499126|GENSCAN_predicted_CDS_1|2850_bp atgctcaaaaggaagcagagttccagggtggaagcccagccagtcactgactttggtcct gatgagtctctgtcggataatgctgacatcctctggattaacaaaccatgggttcactct ttgctgcgcatctgtgccatcatcagcgtcatttctgtttgtatgaatacgccaatgacc ttcgagcactatcctccacttcagtatgtgaccttcactttggatacattattgatgttt ctctacacggcagagatgatagcaaaaatgcacatccggggcattgtcaaggtgtttgaa attgctgatatagttgatcagatgtcaccttggggcatgttgcggattccacggccactg attatgatccgagcattccggatttatttccgatttgaactgccaaggaccagaattaca aatattttaaagcgatcgggagaacaaatatggagtgtttccatttttctacttttcttt ctacttctttatggaattttaggagttcagatgtttggaacatttacttatcactgtgtt gtaaatgacacaaagccagggaatgtaacctggaatagtttagctattccagacacacac tgctcaccagagctagaagaaggctaccagtgcccacctggatttaaatgcatggacctt gaagatctgggacttagcaggcaagagctgggctacagtggctttaatgagatagaggat aaaacggatatcagccgtaagaaagcagctagcactagcctggatgatgaacaaagaaaa gaagcagctgctgaggaaaagcacattttgctgacctttgaggagggcttaagaggccct ttgccccctgaaaatgggacctgggaagaaagtcttgatggcagacagtacaaaatcaag ggacagcagctggctgttttgtaccacagtgcaatcaaactagaactcaggattaagaaa ctcactcaaaaccgctcaactacatggaaactgaacaacctgctcctgaatgaatactgg gtacataatgaaatgaaggcagaaataaagatgttctttgaaaccaacgagaacaaagac acaacataccagaatctctgggacacattcaaagcagtgtgtagagggaaatttatagca ctaaatgcccacaagagaaagcaggaaagatccaaaattgacaccctaacatcacaatta aaagaactagaaaagcaagagcaaacacattcaaaagctagcagaaggcaagaaatagct aaaatcagagcagaactgaaggaaatagagacacaaaaaacctttcaaaaaattaatgaa tccaggagctggttttttgaaaagatcaacaaaattgatagacctctaacaagactaata aagaagaaaagagagaagaatcaaatagacgcaataaaaaatgataaaggggatatcacc accgatcccacagaaatgcaaactaccatcagagaatattacaaacacctctacgcaaat aaactagaaaatctagaagaaatggataaattcctcgacacgtacaccctcccaagacta aaccaggaagaagttgaatctctgaatagaccaataacaggctctgaaattgtggcaata atcaatagcttaccaaccaaaaaaagtccgggaccagatggattcacagccgaattctac cagaggtacaaggaggaactggtaccattccttctgaaactattccaatcaatagaaaaa gagggaatcctccctaactcattttatgaggccagcatcatcctgataccaaagccgggc agagacacaaccaaaaaagagaattttagaccaatatctttgatgaacattgatgcaaaa atcctcaataaaatactggcaaaccgaatccagcagcacatcaaaaagcttatccaccat gatcagctgataagcaacttcagcaaagtctcaggatacaaaatcaatgtgcaaaaatca caagcattcttatacaccaataacagacaaacagagagccaaatcatgagtgaactccca ttcacagttgcttcaaagagaataaaatacctagcaatccaacttacgagggatgtgaag gacctcttcaaggagaactacaaaccactgctcaatgaaataaaagaggatacaaacaaa tggaagaacattccatgctcatgggtaggaagaatcaatatcgtgaaaatggccacactg cccatgcgccttagtggtgatcagcagtatcatcactgccctagtgcatccatggaagtt gaggtccagggtgtactgactcacacaggttacctaggaactagtatattcaccgtctat gaggccgcctcacaggaaggctgggtgttcctcatgtacagagcaattgacagctttccc cgttggcgttcctacttctatttcatcactctcattttcttcctcgcctggcttgtgaag aacgtgtttattgctgttatcattgaaacatttgcagaaatcagagtacagtttcaacaa atgtggggatcgagaagcagcactacctcaacagccaccacccagatgtttcatgaagat gctgctggaggttggcagctggtagctgtggatgtcaacaagccccagggacgcgcccca gcctgcctccaggggaaattccattactttagagaccataccagtcatggtcacttctgc aactgtgctttagacatgagagatgatcaattatgtttattaaccgagaacagtccagtg aaagaaggaggcagtaccttcttggcttatgaaagtacttatttgacctctcgcttagaa gctgctctgtttgtcctcctggtcttgtct >gi568815585r:101276700_101499126|GENSCAN_predicted_peptide_2|111_aa MSNTAENMLAGELRQHQKTDTLGITAAMRVSTTEQRPTPLPLQRFTQAAALVPRGQVQRC SDAQWPRPQPPAPQHLLDAQAGQARGVAGPGLPKPAQGHPAAGAPDTHAQE >gi568815585r:101276700_101499126|GENSCAN_predicted_CDS_2|336_bp atgagcaacacagcagagaacatgttagcaggagaactaagacagcatcaaaaaactgac accctgggcatcacagcggcaatgcgtgtctccacaacagaacagcggcccactcccctg cccctgcagaggtttacccaggcagcggccctggtgccgagggggcaggtgcagcgctgt agtgatgcccagtggcctcgtcctcagccacctgccccccaacacctgctagatgcccag gcggggcaggcccgaggtgtcgccggccctgggcttcccaagccagcccaggggcatccc gcggcaggcgcccccgacacccacgcgcaggaatag >gi568815585r:101276700_101499126|GENSCAN_predicted_peptide_3|95_aa MVEGERHVLHGSRREKMKTKRKGLIKPSDPMRLIHYHKNSMGETAPMIQLSPTRSLLQHM GIVGATIQDEIWMGTQTNDITSIPKHLQDHEHNFI >gi568815585r:101276700_101499126|GENSCAN_predicted_CDS_3|288_bp atggtggaaggcgaaaggcacgtcttacacggcagcagacgagagaaaatgaagaccaag agaaagggccttataaaaccatcagatcccatgagacttattcactaccacaagaacagt atgggggaaactgcccccatgattcaattatctcccaccaggtccctcctacaacacatg ggaattgtgggagctacaattcaagatgagatttggatggggacacagacaaatgatatc acatccattccaaagcacttgcaggaccatgaacacaatttcatttaa >gi568815585r:101276700_101499126|GENSCAN_predicted_peptide_4|236_aa MAAALSRFPEIRNFCWYRWDAPAPVELQAVGSGLRLVPSWLPDGCVQSKPHQLIKLTQNI LWEVKHVSDLTFLFKNEKDLSQKLISMTQLKAICQPFKDLHSDSADGFLPPTCAHLFLVL QITQLQFLLRNTFCKAIAAIDSYSSGGSGLSKLKTLWKGFTIPDAVKYIPDSWKNIKISA LTGAWINLIPTPVDDFEGFKTSVEEVTADLVEIVRELELEIEPEDVTELLQSHDKT >gi568815585r:101276700_101499126|GENSCAN_predicted_CDS_4|711_bp atggctgctgccttgtccaggttccccgagattcgcaacttctgttggtataggtgggat gcgccagctcctgtagaacttcaggctgttggctctggtctcagacttgtcccttcatgg ttaccagatggctgtgtccagtctaaacctcaccaactcataaaactcactcagaatatc ctctgggaagtaaagcatgtctctgacttaacgttcctttttaagaatgaaaaagatctt tctcagaagctgatttcaatgacacaattaaaggccatatgccaacctttcaaagacctc cactctgactcagcagatggatttttaccacctacttgcgctcacctgtttttagtgctt cagataacacagctacaattcttattaagaaatacattttgtaaggctatagctgctata gatagctattcttctggtggatctggactaagtaaattgaaaaccctctggaaaggattc accattccagatgctgttaagtacattcctgattcatggaagaatataaaaatatcagct ttaacaggggcttggataaacttgattccaacccccgtggatgactttgaggggttcaaa acttcagtggaggaagtaactgcagatttagtggaaatagtgagagaactagaattagaa atagagcctgaagatgtgaccgaattgctacaatctcatgataaaacttaa >gi568815585r:101276700_101499126|GENSCAN_predicted_peptide_5|399_aa MLNVTQNRRFFISIRILLQKKALNIRSWRGLRKLLYFWKGPTGTFMEVKAQGHINIGNLL YFHIPVIQLELSLKNKTEKLNLKHGIHRRARKAALGATCRDLRMGPTNVVRAFAGTDSAQ GPVECSRRRSRTPTHGTHTGARRTCPAQSRGRCRRRSRSGPGRGTARPPGAAQCPLQVQT YNPKNRTPEAHISLGSKRCLEGLYFPKQTGITHQRNDQGPKCVMGDLPPDTGPSSSFQNL KDAHVPRKVLSFSLLLAGDTAVGWMHNEGCLKGPGASGTEFKKIENTLVLEWSPSVLRLH NQRISAFKIPSPGPMYPPISLMTVITADESQLPLSPRLALCIRGQASQRLITQIQDSSLI STSNSIDIIKPLTYGNCRAGQLHHCASVNNAEKTQRFKN >gi568815585r:101276700_101499126|GENSCAN_predicted_CDS_5|1200_bp atgctcaatgtcacacagaatagaaggttcttcatatccataaggatactgttacagaaa aaggcactaaatatcagaagttggagaggtctccgaaagttactctatttctggaaagga cctacaggaactttcatggaggtgaaagcacaggggcacataaacataggaaacctgcta tatttccatattccagtgatacaactggaactatctcttaaaaataaaactgagaagcta aatttgaagcatggaatacatcgacgggctcgcaaggcagccctaggtgctacttgtagg gatctccgcatgggccccacaaacgtggtcagggcatttgcggggacagattctgctcaa ggccctgtagagtgctcccgtcgtaggtctcgcacacccactcatggcactcacacaggg gcccgaagaacatgcccggctcagtcacgtggcagatgcagacgccgcagtcgcagcggc cccggccgtggcacagcgcggcccccgggggctgcccagtgtcccctccaggtgcagacg tacaaccccaagaacaggacgcctgaggcccacatctctctggggtccaagcgctgcctg gaagggctgtattttcccaagcaaacaggtatcacccaccagagaaatgaccagggccca aaatgtgtaatgggtgacctgcccccggacacaggtccttcctccagttttcagaacctc aaggacgcacatgtgccaagaaaggtcctgagtttctcacttctgctggctggggacaca gcagtaggttggatgcataatgaagggtgcctgaaaggaccgggtgcatctggaactgaa ttcaaaaagattgaaaacaccctagtccttgaatggagtccttcagtcctcaggttacac aatcagagaatttcagccttcaagatcccctctccggggcccatgtatccacccatctcg ctgatgacagtaataactgctgatgagtcacagctgcccctctcaccgagacttgccctg tgcataagaggacaggcttcccagaggctgatcactcagattcaagacagttccctaatc tctacctccaacagtattgacatcattaagcctctcacatatggaaactgtagagctgga cagctccatcattgtgcttctgtcaacaatgcagaaaagactcaaagattcaaaaactga >gi568815585r:101276700_101499126|GENSCAN_predicted_peptide_6|255_aa MCIPGVPNHKTEWKTLLSEIPTQNQTLLLEKHPDQTKSPPTSAGTVDVLNCVEGTFIFES KSRECCSGPKSFATEEAVSGINLFGINNLEEIKEHPRSTGISVEGRVRSRIALIERVTML RRLWAFKDSHTLAPETCKYVTLRGKSDFADVMKFMSLGWEAYPGLSGFSLVKIHLKSGVC FPTVVRRDVMMEEALGDSGVRKVSQPVSSFEIWEWGTCKDPKVASRRQEGPQLTASKELG TTVLQPHRTKLSQHQ >gi568815585r:101276700_101499126|GENSCAN_predicted_CDS_6|768_bp atgtgcataccaggagtaccaaatcataaaactgaatggaaaaccttactgagtgaaatc ccaacacagaaccagacactactattggaaaaacatcctgatcagacaaaatctccccca acctcggctggaactgtagacgtattaaactgtgtggaaggaacattcatctttgaatcc aagagcagagaatgttgtagtggcccgaaaagttttgcaactgaagaagcagtttctggg atcaacctatttgggatcaacaatttagaggagatcaaggaacaccctagaagtacaggc atctcagtggagggaagagtcagaagcagaattgcacttatagagagggtgactatgctg agaagactatgggccttcaaagatagccatacccttgctcctgaaacctgtaaatatgtc accttacgtggcaaaagtgactttgctgatgtgatgaagtttatgagtcttggatgggaa gcttatcctggattatctgggttcagtctagttaaaatacaccttaaaagtggagtatgt tttccgacggtggtaagaagggatgtgatgatggaagaggctctgggagacagcggtgtg agaaaggtctcacaacctgtttccagttttgagatttgggagtggggtacttgtaaggac ccgaaagtagcttctagaagacaagagggaccccaactgacagccagcaaggaactgggg accacagtcttacaaccacacagaactaaattatctcaacaccaataa