GENSCAN 1.0 Date run: 6-Nov-116 Time: 03:17:16 Sequence gi568815587f:9284764_9545191 : 260428 bp : 43.11% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 2791 2786 6 1.05 1.03 Term - 10624 10480 145 0 1 77 48 65 0.122 -1.22 1.02 Intr - 26811 26657 155 0 2 105 76 82 0.296 7.57 1.01 Init - 29678 29373 306 2 0 118 41 87 0.354 2.44 1.00 Prom - 38151 38112 40 -7.76 2.00 Prom + 41952 41991 40 -2.26 2.01 Init + 78209 78314 106 1 1 32 59 93 0.288 1.18 2.02 Intr + 94333 94499 167 2 2 100 82 118 0.875 12.08 2.03 Intr + 99965 100084 120 1 0 -7 72 166 0.298 6.19 2.04 Intr + 109451 109521 71 1 2 63 87 -25 0.022 -7.12 2.05 Intr + 110008 110030 23 1 2 110 94 23 0.058 2.29 2.06 Intr + 123723 123876 154 1 1 66 91 89 0.269 6.13 2.07 Intr + 125165 125323 159 2 0 86 80 48 0.223 2.90 2.08 Intr + 129492 129648 157 0 1 26 90 128 0.128 6.81 2.09 Intr + 135648 135742 95 2 2 24 84 36 0.721 -4.44 2.10 Intr + 135851 135935 85 2 1 29 108 104 0.795 6.32 2.11 Intr + 140383 140499 117 0 0 104 61 49 0.703 4.46 2.12 Intr + 143777 143866 90 1 0 71 75 29 0.415 0.09 2.13 Intr + 144268 144433 166 0 1 23 98 118 0.684 5.83 2.14 Intr + 144911 145071 161 0 2 102 76 72 0.998 7.11 2.15 Intr + 146112 146240 129 2 0 57 94 97 0.995 8.09 2.16 Intr + 148807 148873 67 0 1 131 35 3 0.863 -2.22 2.17 Intr + 148958 149083 126 0 0 66 109 60 0.899 6.55 2.18 Intr + 153127 153211 85 2 1 43 98 -6 0.186 -5.22 2.19 Intr + 153317 153522 206 2 2 112 107 192 0.904 22.34 2.20 Intr + 155692 155898 207 2 0 22 106 330 0.999 27.25 2.21 Intr + 157318 157434 117 2 0 79 80 51 0.917 3.84 2.22 Intr + 175113 175227 115 1 1 50 94 74 0.001 3.71 2.23 Intr + 179088 179107 20 0 2 48 121 10 0.000 -3.45 2.24 Intr + 189178 189261 84 2 0 71 86 43 0.807 2.19 2.25 Intr + 193627 193823 197 2 2 95 75 194 0.998 17.93 2.26 Intr + 194709 194783 75 2 0 80 91 67 0.964 5.91 2.27 Intr + 209883 210002 120 2 0 67 71 78 0.710 4.69 2.28 Intr + 211540 211615 76 0 1 34 98 19 0.615 -3.41 2.29 Intr + 212912 213037 126 0 0 81 79 41 0.735 3.15 2.30 Intr + 223856 224083 228 0 0 65 94 250 0.913 21.14 2.31 Intr + 227685 227833 149 1 2 79 103 63 0.945 6.85 2.32 Intr + 231438 231599 162 2 0 67 91 116 0.851 9.97 2.33 Intr + 240477 240623 147 2 0 55 109 22 0.098 1.43 2.34 Term + 242767 242850 84 0 0 87 36 59 0.244 -1.75 2.35 PlyA + 242869 242874 6 1.05 3.02 PlyA - 242997 242992 6 1.05 3.01 Sngl - 257507 256797 711 2 0 79 37 282 0.854 18.43 3.00 Prom - 257664 257625 40 -11.14 4.02 PlyA - 257765 257760 6 1.05 4.01 Sngl - 259190 257790 1401 2 0 44 42 496 0.907 36.63 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 129532 129648 117 0 0 85 90 83 0.826 8.40 S.002 Init + 173108 173170 63 1 0 95 64 71 0.873 4.55 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587f:9284764_9545191|GENSCAN_predicted_peptide_1|201_aa MAKGRVAERSQLGAHHTTPVGDGAAGTRGLAAPGSRDHQKGEWARGAELGVARGFCVSGG EKDPRYSRRVAGKEGGLLAPRREQRPWGSWAWALAQHKAGRRAPVLLPQELRKAIQLSPQ VGPLILCHSQGIQDANPRVPTLLHTHRLLLECLLEERVNMKVPRDMDDAKALGKVLSKYK DTFYVQVLVAYFATYILYPFN >gi568815587f:9284764_9545191|GENSCAN_predicted_CDS_1|606_bp atggcgaaaggcagagtcgccgaacgatcgcagttgggcgctcaccacacgacccccgtg ggggacggggcagcggggacgcggggtctcgcggcgcctggcagcagagaccaccagaag ggtgagtgggcccggggagcagagctgggggtggcccgaggcttctgtgtgtcgggtggg gaaaaggacccccgctattctcggagagtcgcggggaaagaggggggcctgctggcgccg cggcgggagcagagaccgtgggggagttgggcttgggcgctcgcgcagcacaaggccgga cgccgagcccctgtcctcctcccccaggagctccgcaaggcaattcagctttccccccag gttggccccctcatcctgtgccacagccagggtatccaggatgccaatcctcgggtccct accctcctccatacccaccgtctgctcctggaatgcctcctagaagaaagagtgaatatg aaggttcccagagatatggatgatgccaaggctctaggaaaagttttatccaaatacaag gacaccttttatgttcaagtacttgtagcttattttgctacatatattttgtatcctttt aactga >gi568815587f:9284764_9545191|GENSCAN_predicted_peptide_2|1396_aa MFQAFCNSNRAEHNDAFENLQQQGHSAIYNIYQSAETPETVAGRHTSGCTSRGTHQQMNT QAAGRQEEHIGEGTHRWLDIKRVALTGTGMPHPSPGFDRVRAAMDPNTIIEALRGTMDPA LREAAERQLNESEGKAISDYSVSEWNASKGAGIRFVWRIVSTSVIYLKNMITQYWPDRET APGDISPYTIPEEDRHCIRENIVEAIIHSPELIRVQLTTCIHHIIKHDYPSRWTAIVDKI GFYLQSDNSACWLGILLCLYQLVKNYEYKKPEERSPLVAAMQHFLPVLKDRFIQLLSDQS DQSVLIQKQIFKIFYALVQETLQVEEDDRPELPWWKCKKWALHILARLFERYGSPGNVSK EYNEFAEVFLKAFAVGVQQVLQKTMGFCYQILTEPNADPRKKDGALHMIGSLAEILLKKK IYKDQMEYMLQNHVFPLFSSELGYMRARACWVLHYFCEVKFKSDQNLQTALELTRRCLID DREMPVKVEAAIALQVLISNQEKAKEYITPFIRPVMQALLHIIRETENDDLTNVIQKMIC EYSEEVTPIAVEMTQHLAMTFNQVIQTGPDEEGSDDKAVTAMGILNTIDTLLSVVEDHKE ITQQLEGICLQVIGTVLQQHVLEFYEEIFSLAHSLTCQQVSPQMWQLLPLVFEVFQQDGF DYFTENLRFPNNVEPVTNHFITQWLNDVDCFLGLHDRKMCVLGLCALIDMEQIPQVLNQV SGQILPAFILLFNGLKRAYACHAEHENDSDDDDEAEDDDETEELGSDEDDIDEDGQEYLE ILAKQAGEDGDDEDWEEDDAEETALEGYSTIIDDEDNPVDEYQIFKAIFQTIQNRNPVWY QALTHGLNEEQRKQLQDIATLADQRRAAHAQFYPLADLRNFESNTPTSSLVGLVRQRKFV LSTFGTDYIVPFLEDAKLIDGQVIQLEDGSAAYVQHVPIPKSNSYDQSALQAVQLEDGTT AYIHHAVQVPQSDTILAIQADGTVAGLHTGDATIDPDTISALEQYAAKVSIDGSESVAGT GMIGENEQEKKMQIVLQGHATRVTAKSQQSGEKAFRCEYDGCGKLYTTAHHLKVHERSHT GDRPYQCEHAGCGKAFATGYGLKSHVRTHTGEKPYRCSEDNCTKSFKTSGDLQKHIRTHT GEKPYVCTVPGCDKRFTEYSSLYKHHVVHTHSKPYNCNHCGKTYKQISTLAMHKRTAHND TEPIEEEQEAFFEPPPGQGEDVLKGSQITYVTGVEGDDVVSTQVATVTQSGLSQQVTLIS QDGTQHVNISQADMQAIGNTITMVTQDGTPITVPAHDAVISSAGTHSVAMVTAEGTEGEQ VAIVAQDLAAFHTASSEMGHQQHSHHLVTTETRPLTLVATSNGTQIAVQLGEQPSLEEAI RIASRIQQGETPGLDD >gi568815587f:9284764_9545191|GENSCAN_predicted_CDS_2|4191_bp atgtttcaagccttctgtaacagcaacagagcagagcacaatgatgccttcgaaaacctg cagcaacaaggacactctgctatatacaatatctatcagagtgctgaaacccctgagact gtagcaggcagacacacaagcggctgtacatcaagaggaacacatcagcagatgaataca caggcagctggacgtcaagaggagcacatcggtgaaggaacacacaggtggctggacatc aagagggtcgcgctaacaggcactggcatgccgcacccgagccccgggtttgaccgagtc cgcgctgcgatggaccccaacaccattatcgaggccctgcggggcactatggacccagcc ctgcgtgaggccgcggagcgccagctcaatgaatcagaggggaaagcaatttctgattac agtgttagtgagtggaatgctagtaagggggcagggataagatttgtctggcgcattgtc agtactagtgttatctatctgaaaaatatgataacacagtattggcctgatcgagaaaca gcaccaggggatatatccccttatactattccagaagaagatcgccattgtattcgagaa aatattgtagaagccattatccattctcctgagctcatcagggtacagcttactacatgc attcatcacatcatcaaacatgattatccaagccgctggactgccattgtggacaaaatt ggcttttatcttcagtccgataacagtgcttgttggctaggaattcttctttgcctttat cagcttgtgaaaaattatgagtataaaaaaccagaggagcggagtccattggtagcagca atgcagcattttctgccagttctaaaggatcgttttatccagcttctttctgaccagtct gatcagtctgtcctcatccagaaacagatattcaagatcttctatgctcttgttcaggaa acacttcaagttgaagaagatgatcgacctgagttaccatggtggaaatgcaagaagtgg gccttacatattttagcaagactttttgaaagatatggaagccctggcaatgtttccaag gagtataatgaatttgctgaagtatttctgaaggcatttgctgttggtgtccagcaagta ctgcaaaagactatgggattttgttaccagattcttacagaaccaaatgctgaccctcga aaaaaagatggagccctgcatatgattggctctttagctgaaatacttctgaagaaaaag atctataaagatcagatggaatacatgttgcagaatcatgtattccctctcttcagcagt gaactaggctacatgagagcaagggcttgctgggtacttcactatttttgtgaagtgaag ttcaaaagtgatcagaaccttcaaacagccttagagctaacaagaagatgtctgattgat gatagagaaatgcctgtgaaagtggaagctgccattgcccttcaagtattgatcagcaat caagaaaaagctaaagaatatatcacaccattcatcagacctgtaatgcaggctcttctt cacattataagagaaacagaaaatgatgaccttaccaatgtaattcagaaaatgatctgt gaatatagtgaagaagttactcctattgcagtagaaatgacacaacatttggcaatgaca tttaaccaagtaatccagacggggccagatgaagaaggtagtgatgacaaagcagttact gctatgggaattctgaatacaattgatacacttcttagtgtagttgaagatcataaagag ataacccaacagcttgagggaatctgcttacaggtcattggtactgttttacaacagcat gtcttagaattctatgaggagatcttctctttagcgcacagtttgacatgtcaacaagtg tctccacagatgtggcagctactaccccttgtatttgaagtctttcagcaagatggcttt gattactttacagaaaatcttcgcttccctaataatgttgaaccagttacaaatcatttt attacacagtggcttaatgatgttgactgtttcttggggcttcatgacagaaagatgtgt gttctcggactctgtgctcttattgatatggaacagataccccaagttttaaatcaggtt tctggacagattttgccggcttttatccttttatttaacggattgaaaagagcatatgcc tgccatgcagaacatgagaatgacagtgatgatgatgatgaagctgaagatgatgatgaa accgaggaactggggagtgatgaagatgatattgatgaagatgggcaagaatatttggag attctggctaagcaggctggtgaagatggagatgatgaagattgggaagaagatgatgct gaagagactgctctggaaggctattccacaatcattgatgatgaagataaccctgttgat gagtatcagatatttaaagctatctttcaaactattcaaaatcgtaatcctgtgtggtat caggcactgactcacggtcttaatgaagaacaaagaaaacagttacaggacatagcaact ctggctgatcaaagaagagcagcccatgctcaattctaccccctggccgacttaagaaat tttgagtctaatacacccacctcatctttagtcggcctggtgcgccaacgcaaatttgta ctgagtaccttcggaactgactacatagttccatttctggaagatgcaaaactcatagat ggccaggtcattcagttggaagatggttctgcggcctatgttcaacatgtacccatacct aaaagtaatagttatgaccagagtgcattacaggcggttcagctggaagatggtaccaca gcttatatccaccatgcagtgcaagtcccgcagtctgacaccatcttggcaattcaggct gatgggacagtggcaggtctgcacactggggatgctacaattgaccctgacaccatcagt gctttggaacagtatgcagcaaaggtgtccattgatggaagtgaaagtgtagcaggtact ggaatgattggagaaaatgagcaagagaaaaaaatgcagattgttttacaaggacatgct acaagagtaactgctaaatctcaacagagtggagagaaggcatttcgatgtgaatatgat ggatgtggaaaattatatacaacagctcatcatctcaaggtccatgagaggtcacacaca ggagatcggccttatcagtgtgagcatgcaggctgtgggaaggcatttgcaacaggttat ggattaaaaagtcacgtcagaactcatacaggagaaaagccatatcggtgttcggaagat aattgtactaaatctttcaaaacttcaggagatctacagaaacacatcagaactcataca ggagaaaagccatatgtttgtacagttcctgggtgtgacaaaaggtttacagaatattcc agtttgtacaaacatcatgttgtccacactcattccaaaccttacaactgtaaccactgt gggaagacatacaagcagatctccacgctggccatgcacaaacggacagcccacaacgac actgagcccatcgaggaggagcaggaagccttctttgagccgcccccaggtcaaggtgaa gatgttcttaaagggtcccagattacgtatgttacaggtgtagaaggggacgacgttgtt tctacacaagtagccacagtaacccaatctggactgagtcaacaagttacactcatatcc caggatgggactcagcatgtcaacatatctcaagctgacatgcaggccattggcaacacc atcacaatggtaacgcaggatggcacgcccatcacagtccccgcccatgatgcagtcatc tcctcagcaggaacgcactctgttgctatggttactgctgagggtacagaaggggaacag gttgcaattgtagctcaagacttggcagcattccatactgcctcatcagaaatggggcac cagcagcatagccatcacttagtaaccacagaaaccagacctctgaccttagtagcaaca tccaatggcacccagattgcagttcagcttggagaacagccatctctggaagaagccatc agaatagcgtctagaatccaacaaggagaaacgccagggttggatgattaa >gi568815587f:9284764_9545191|GENSCAN_predicted_peptide_3|236_aa MIVYLENPIVSAQNLLKLISNFSKVSGYKINVQKSQAFLYTNNRQTESQIMSELPFTIAS KRIKYLGIQLTRDVKDLFKENYKPLLNEIKEDTNKWKNIPCSWVGRINIVKMAILPKVIY RFNAIPIKLPMTFFIELEKTTLKFIWNQKRSRIAKSILSQKNKAGGITLPDFKLYYKATV TKTAWYWYQNRDIDQWNRTEPSEIMPHIYNYLIFDKPEKNKKWGKDSLFNKWCWEN >gi568815587f:9284764_9545191|GENSCAN_predicted_CDS_3|711_bp atgattgtatatctagaaaaccccattgtctcagcccaaaatctccttaagctgataagc aacttcagcaaagtctcaggatacaaaatcaatgtacaaaaatcacaagcattcttatac accaacaacagacaaacagagagccaaatcatgagtgaactcccattcacaattgcttca aagagaataaaatacctaggaatccaacttacaagggatgtgaaggacctcttcaaggag aactacaaaccactgctcaatgaaataaaagaggatacaaacaaatggaagaacattcca tgctcatgggtaggaagaatcaatatcgtgaaaatggccatactgcccaaggtaatttac agattcaatgccatccccattaagctaccaatgactttcttcatagaattggaaaaaact actttaaagttcatatggaaccaaaaaagatcccgcatcgccaagtcaatcctgagccaa aagaacaaagctggaggcatcacactacctgacttcaaactatactacaaggctacagta accaaaacagcatggtactggtaccaaaacagagatatagatcaatggaacagaacagag ccctcagaaataatgccgcatatctacaactatctgatctttgacaaacctgagaaaaac aagaaatggggaaaggattccctatttaataaatggtgctgggaaaactag >gi568815587f:9284764_9545191|GENSCAN_predicted_peptide_4|466_aa MGDFNTPLSTLDRSTRQKVNKDTQELNSALHQADLIDFYRTLHPKSTEYTFFSAPHHTYS KIDHIVGSKAPLSKRKRTEIITNYLSDHSAIKLELRIKNLTQNRSTTWKLNNLLLNDYWV HNEMKAEIKMFFETNENKDTTYQNLWDTFKAVCRGKFIALNAHKRKQERSKTDTLTSQLK ELEKQEQTHSKASRRQEITKIRAELKEIETQKTLQKINESKSWFFERINKIDRPLARLIK KKREKNQIDAIKNDKGDITTDPTEIQTTIRECYKHVYANKLENLEEMDKFLDTYTLPRLN QEEVESLNRPITGAEIVAIINSLPTKKSPGPDGFTAEFYQRYKEELVPFLLELFQSIEKE GILPNSFYEASIILIPKLGRDTTKKENFRPISLMNIDAKILNKILANRIQQHMKKLIHHD QVGFIPGMQGWFNIRKSINVIQHINRTKDKNHMIISIDAEKAFDKI >gi568815587f:9284764_9545191|GENSCAN_predicted_CDS_4|1401_bp atgggagactttaacaccccactgtcaacattagacagatcaaccagacagaaagtcaac aaggatacccaggaattgaactcagctctgcaccaagcagacctaatagacttctacaga actctccaccccaaatcaacagaatatacatttttttcagcaccacaccacacctattcc aaaattgaccacatagttggaagtaaagctcccctcagcaaacgtaaaagaacagaaatt ataacaaactatctctcagaccacagtgcaatcaaactagaactcaggattaagaatctc actcaaaaccgctcaactacatggaaactaaacaacctgctcctgaatgactactgggta cataacgaaatgaaggcagaaataaagatgttctttgaaaccaatgagaacaaagacaca acataccagaatctctgggacacattcaaagcagtgtgtagagggaaatttatagcacta aatgcccacaagagaaagcaggaaagatccaaaactgacaccctaacatcacaattaaaa gaactagaaaagcaagagcaaacacattcaaaagctagcagaaggcaagaaataactaaa atcagagcagaactgaaggaaatagagacacaaaaaacccttcaaaaaattaacgaatcc aagagctggttttttgaaaggatcaacaaaattgatagaccactagcaagactaataaag aaaaaaagagagaagaatcaaatagacgcaataaaaaatgataaaggggatatcaccacc gatcccacagaaatacaaactaccatcagagaatgctacaaacatgtctacgcaaataaa ctagaaaatctagaagaaatggataaattcctcgacacatacaccctcccaagactaaac caggaagaagttgaatctctgaatagaccaataacaggagctgaaattgtggcaataatc aatagcttaccaaccaaaaagagtccaggaccagatggattcacagccgaattctaccag aggtacaaggaggaactggtaccattccttctggaactattccaatcaatagaaaaagag ggaatcctccctaactcattttatgaggccagcatcatcctgataccaaagctgggcaga gacacaaccaaaaaagagaattttagaccaatatccttgatgaacattgatgcaaaaatc ctcaataaaatactggcaaaccgaatccagcagcacatgaaaaagcttatccaccatgat caagtgggcttcatccctgggatgcaaggctggttcaatatacgcaaatcaataaatgta atccagcatataaacagaaccaaagacaaaaaccacatgattatctcaatagatgcagaa aaggcctttgacaaaatttaa