GENSCAN 1.0 Date run: 8-Nov-116 Time: 13:40:47 Sequence gi568815583r:89372738_89596544 : 223807 bp : 47.74% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 5735 6139 405 1 0 40 57 197 0.008 7.34 1.02 Intr + 6311 6532 222 1 0 63 111 155 0.618 13.62 1.03 Intr + 6571 6670 100 0 1 77 82 83 0.225 6.28 1.04 Intr + 15080 15144 65 0 2 119 98 51 0.543 7.64 1.05 Intr + 24537 24718 182 2 2 33 80 161 0.351 8.47 1.06 Intr + 27215 27287 73 2 1 54 26 34 0.414 -6.79 1.07 Intr + 27438 27635 198 2 0 96 94 64 0.597 7.35 1.08 Term + 35057 35215 159 1 0 62 43 136 0.493 4.44 1.09 PlyA + 35930 35935 6 1.05 2.03 PlyA - 36244 36239 6 1.05 2.02 Term - 36465 36266 200 1 2 -3 49 130 0.669 -2.44 2.01 Init - 37153 36925 229 1 1 98 107 153 0.986 15.59 2.00 Prom - 43197 43158 40 -4.26 3.00 Prom + 55964 56003 40 -5.46 3.01 Init + 62738 62785 48 1 0 47 68 123 0.181 5.15 3.02 Intr + 75119 75195 77 1 2 62 105 40 0.033 1.41 3.03 Intr + 79075 79172 98 1 2 55 75 97 0.279 4.75 3.04 Intr + 87912 87987 76 1 1 77 93 35 0.290 1.47 3.05 Intr + 88707 88729 23 0 2 98 72 35 0.337 0.09 3.06 Intr + 90325 90443 119 2 2 112 68 67 0.897 7.28 3.07 Term + 91160 91261 102 1 0 118 44 17 0.492 -1.42 3.08 PlyA + 94704 94709 6 1.05 4.15 PlyA - 98698 98693 6 1.05 4.14 Term - 100126 99998 129 1 0 84 44 75 0.676 0.88 4.13 Intr - 104091 104018 74 1 2 63 105 43 0.857 2.63 4.12 Intr - 104469 104345 125 2 2 128 80 113 0.956 14.73 4.11 Intr - 104916 104780 137 0 2 86 61 167 0.999 13.17 4.10 Intr - 105237 105100 138 0 0 80 103 243 0.997 25.66 4.09 Intr - 106751 106585 167 0 2 49 89 243 0.984 20.18 4.08 Intr - 107671 107524 148 1 1 60 77 216 0.840 17.51 4.07 Intr - 110480 110330 151 1 1 104 105 192 0.868 22.66 4.06 Intr - 114248 114062 187 0 1 78 55 593 0.424 53.75 4.05 Intr - 124003 123624 380 0 2 98 108 446 0.080 42.21 4.04 Intr - 140975 140842 134 2 2 24 34 111 0.005 -1.26 4.03 Intr - 144700 144665 36 1 0 103 82 14 0.059 0.76 4.02 Intr - 153747 153656 92 1 2 68 44 75 0.008 0.81 4.01 Init - 162876 161739 1138 0 1 69 75 406 0.311 31.61 4.00 Prom - 179196 179157 40 -4.76 5.03 PlyA - 179465 179460 6 1.05 5.02 Term - 186101 185467 635 0 2 1 45 273 0.252 8.75 5.01 Init - 193285 193216 70 1 1 78 55 65 0.107 1.41 5.00 Prom - 193495 193456 40 -7.96 6.03 PlyA - 196215 196210 6 1.05 6.02 Term - 196511 196363 149 0 2 37 42 178 0.984 6.16 6.01 Init - 199853 199805 49 2 1 86 58 81 0.811 4.01 6.00 Prom - 200720 200681 40 -7.66 7.00 Prom + 201056 201095 40 -4.36 7.01 Init + 202871 203503 633 1 0 42 116 913 0.919 82.85 7.02 Intr + 209949 210228 280 2 1 57 121 51 0.587 2.35 7.03 Intr + 211549 211757 209 2 2 48 99 22 0.459 -1.90 7.04 Intr + 212971 213205 235 0 1 48 107 192 0.961 14.36 7.05 Intr + 222656 222874 219 0 0 54 61 94 0.025 1.57 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr + 5860 6139 280 1 1 56 57 166 0.937 7.78 S.002 Term + 154181 154281 101 0 2 104 41 102 0.896 5.39 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583r:89372738_89596544|GENSCAN_predicted_peptide_1|467_aa MAALAVLAPEAASWTPGVRGSLRRSVARALAGSGAAASRPRRIRPAPAGLELDPELGKLE KQAQTRKPRYGKSLSKESFPRFTPRLHQGTAWAYGLPNPKRPQPRAGIRPSGREGRRRPQ SAGAGGARPQDARGWPRGPQKRNARRERPVRSPQTDARLFITARPRGKSGCRLPAHSPPD PPSRLPGPRSRSRRARLCVSWLLGRKRLPGRKRPSPRGEQWRYFGVCNRERFKLFDYFRG VLGHALESEDSEKRSRSPGGKSLKRRSITVLGDGEQGCEWAYLHKQHVSGEKDITEQKSA GAQECTCHPYFIPERVNGLEGVSERTEGGRKSPVPAQMGFPAIFRTKCQEPPPPAGAAPV PWSRCPRSVPPKDLGARAGEGVEGGVEARYEPVGASPTPQSQRQTVLGRTDARAELHGGR GLCELGPRAALHLPGFKYLGEAAQAAVLIAVSKYLLMNKHQPHLSGT >gi568815583r:89372738_89596544|GENSCAN_predicted_CDS_1|1404_bp atggcggctctggcggttctagcaccggaggcggcttcttggacacccggcgttcgcggg agcctgaggcgctcggttgctagagccttagctgggagtggcgcggctgcttcccgcccg cgcaggatcaggccggcccccgcgggcctggagctggatccagagctagggaaactggaa aaacaggcacaaactcggaagccgcggtacggcaagagcctaagcaaagaatcctttcca agattcacacctcgtctacaccagggcaccgcctgggcctacggccttccgaacccgaag cgcccgcagcccagagctggcatcaggccatcaggccgggaaggtcgtcgcaggccccag agtgcgggcgcggggggcgcgcgcccacaggacgcccggggttggccgcgcggcccacag aagcggaacgcgcgtcgagagcgccctgtccgctcgccccagacagatgcccggttattc attaccgcgaggcctagaggaaagagtggctgccgtcttcctgcccacagcccgccggac cctccgtcgcggctgcccggtccccggagccgcagccgccgagcccggctgtgcgtgtcg tggctgctggggagaaagaggcttccggggcggaaacggccttccccgcgtggggaacaa tggcgttattttggagtctgcaaccgcgagcgcttcaagctctttgattattttcgagga gtgttaggacatgctctggagtcagaagacagcgaaaagagaagcagaagccccggtggc aagagtctgaagcgcagaagtatcaccgtgctgggggatggggaacagggctgtgaatgg gcctatttgcataagcagcatgtgtctggagagaaagacatcacagagcagaagagtgcg ggtgcccaggagtgcacttgccacccctacttcatccctgaaagagtaaatggcctggaa ggtgtctctgagaggacggaaggtggccggaagagccccgtcccagctcagatgggcttc cccgccatcttccggacaaagtgtcaagagcctcctcctcccgccggcgcggcgcctgtg ccctggtcccggtgtccccgctcggtgccaccgaaggatctaggggctcgggccggggag ggggtggaggggggagtggaggcgcgatacgaacccgtgggagcctccccaaccccgcag tcccagcggcaaactgtgcttggtcggacggatgcccgcgcggagctccacggcggccgc gggctttgtgagctggggccgcgcgccgcgctgcacctgcccggatttaagtaccttggc gaggcagcgcaggcggctgttctaatcgccgtctccaagtacctgctcatgaataaacac caaccccatttatcaggtacttag >gi568815583r:89372738_89596544|GENSCAN_predicted_peptide_2|142_aa MAQKKALVRCPAELEGIPTPRPLGGFPRPRPGALVPGLGRLGADSTGTKRACVGWRQPES HETEVSGFPAGVRFATAPSQDPEGPTQSLARVEAGFSAAACVQYKWNVLGNSDMQRFSKH PVIPETRVATAMSHARTGRQKL >gi568815583r:89372738_89596544|GENSCAN_predicted_CDS_2|429_bp atggcgcagaagaaggcgcttgtgagatgccctgcggagctggagggaatccctactccc cgccctcttggcggcttcccgcggccgcgacctggagcccttgttccaggactaggaagg ctcggggctgactccacggggactaaacgagcctgcgtgggctggaggcagcccgaaagc cacgaaacagaagtgtctggttttcccgcaggggtccgatttgcaaccgcaccgtcgcag gaccccgagggtcctacccagtcattagcaagagtcgaggctggattttcagcggccgcc tgcgttcagtataaatggaacgtgctaggaaactccgatatgcagcgattttctaaacat cccgttatccctgaaacccgtgtggccaccgccatgagtcatgcaaggactgggcgacag aagctctga >gi568815583r:89372738_89596544|GENSCAN_predicted_peptide_3|180_aa MQLPRLWVAGAQLAPLGLHPYIGKSRNRGTGKSTDLLKSPSRYSRWCIPGVSGTDVLLIT MAFGTLHCLVLSNSASPKGHPREQHKLDGAGIGGKKAPRRVAAPSIQGPSGDSAYAQRQL QVCGQGYSLVKTKGFYVEAPTPNCPAWTTIGNSPSIPPDSSMGHSPSISPGTLLGYILHH >gi568815583r:89372738_89596544|GENSCAN_predicted_CDS_3|543_bp atgcagctgcccaggctgtgggtggctggggcccagctggcccccctgggcctgcatcct tacattggcaagtctagaaaccgaggcacggggaagtcaactgacttgctgaagtccccc agccggtacagccgctggtgtatccctggtgtatcaggcactgatgtgctgctcataact atggcctttgggacccttcactgtcttgttctcagcaactcagccagcccaaagggacat cccagagaacaacacaagctggacggtgctggaattggtgggaaaaaagccccacgccgg gtggccgccccgagcatccagggaccctcaggagatagcgcctatgcacagaggcaactg caggtttgtggccagggctactctctggtgaaaacaaaaggtttctatgtggaagcgcct acccccaactgcccggcttggaccactataggcaactctccatcgattccacctgattcc tctatgggccactctccatctatttcacctggtaccctgcttggctacatcctccaccat tga >gi568815583r:89372738_89596544|GENSCAN_predicted_peptide_4|1011_aa MKAEIKMFFETNENKDTTYQNLWDTFKAVCRGKFIALNAHKRKQERSKINTLTSQLKELE KQEQMHSKASRRQEITKIRAELKEIETQKTLQKINESRSCFFEKINKIDRPLARLTKKKR EKNQIDAIKNDKREITTNPAEIQTTIREYYKHLYANKLENLEEMDKFLDTYTLPRLNQEE VESLNRPITGSEIEAITNSLPTKKSPGPDGFTAEFYQRYKEELIPFLLKLFQSIEKEGIL PNSFYEASIILILKPGRDTTKKENFRPISLMNIDAKILNKILANQIQQHIKKLIHHDQVG FIPGMQGWFNVRKSINVIHHKNRIKDKNHMIISIDAEKAFDKIQQPFMLKMLNKLGIDGT YVKISSYSSHNSHVLWEGPGGSLTTVQPRRKEETKMDPTSREEKVEAEAEVYREKIQAST GQAFISFLYSVIVDGSLQWRGHMGKPYPPTPLYVHLSKPFECILSALPTRGIFWGPRLGL GPCQLGERAGGARVSCSVPHVAAPPPCPGLKSPAGLRASEPPAASPARHPCSMAWNTNLR WRLPLTCLLLQVIMVILFGVFVRYDFEADAHWWSERTHKNLSDMENEFYYRYPSFQDVHV MVFVGFGFLMTFLQRYGFSAVGFNFLLAAFGIQWALLMQGWFHFLQDRYIVVGVENLINA DFCVASVCVAFGAVLGKVSPIQLLIMTFFQVTLFAVNEFILLNLLKVKDAGGSMTIHTFG AYFGLTVTRILYRRNLEQSKERQNSVYQSDLFAMIGTLFLWMYWPSFNSAISYHGDSQHR AAINTYCSLAACVLTSVAISSALHKKGKLDMVHIQNATLAGGVAVGTAAEMMLMPYGALI IGFVCGIISTLGFVYLTPFLESRLHIQDTCGINNLHGIPGIIGGIVGAVTAASASLEVYG KEGLVHSFDFQGFNGDWTARTQGKFQIYGLLVTLAMALMGGIIVGLILRLPFWGQPSDEN CFEDAVYWEMPEGNSTVYIPEDPTFKPSGPSVPSVPMVSPLPMASSVPLVP >gi568815583r:89372738_89596544|GENSCAN_predicted_CDS_4|3036_bp atgaaggcagaaataaagatgttctttgaaaccaatgagaacaaagacacaacataccag aatctctgggacacatttaaagcagtgtgtagagggaaatttatagcactaaatgcccac aagagaaagcaggaaagatctaaaattaacaccctaacatcacaattaaaagaactagag aagcaagagcaaatgcattcaaaagctagcagaaggcaagaaataactaagatcagagca gaactgaaggagatagagacacaaaaaactcttcaaaaaatcaatgaatccaggagctgt ttttttgaaaagatcaacaaaattgatagaccgctagcaagactaacaaagaagaaaaga gagaagaatcaaatagacgcaataaaaaatgataaaagggagatcaccaccaatcccgca gaaatacaaactaccatcagagaatactataaacacctctatgcaaataaactagaaaat ctagaagaaatggataaattccttgacacatacaccctcccaagactaaaccaggaagaa gttgaatctctgaatagaccaataacaggctctgaaattgaggcaataactaatagctta ccaaccaaaaaaagtccaggaccagatggattcacagccgaattctaccagaggtacaag gaggagctgataccattccttctgaaactattccaatcaatagaaaaagagggaatcctc cctaactcattttatgaggccagcatcatcctgatactaaagcctggcagagacacaaca aaaaaagagaattttagaccaatatccctgatgaacattgatgcaaaaatcctcaataaa atactggcaaaccaaatccagcagcacatcaaaaagcttatccaccatgatcaagtgggc ttcatccctgggatgcaaggctggttcaacgtacgcaaatcaattaacgtaatccaccat aaaaacagaatcaaagacaaaaaccacatgattatctcaatagatgcagaaaaggccttt gacaaaattcagcagcccttcatgctgaaaatgctcaataaattaggtattgatgggacg tatgtcaaaatctcatcttatagctcccataattcccacgtgttgtgggagggaccagga ggaagcctcacgactgttcagccaagaaggaaagaggagactaagatggacccaacatct agagaagagaaggtggaagcagaggcagaagtttatagagagaagattcaagcctcaaca ggccaggcctttatctccttcctgtattctgtcatcgtggatggtagccttcaatggaga ggccacatgggcaagccctatcctcctactccactctatgttcatctgtcaaagcccttt gagtgcatcctcagtgctctcccgacgcggggaatcttttggggccctcggcttgggttg ggcccctgccagttgggcgagcgcgcgggcggtgcccgagtcagctgctccgtgccccac gtggccgccccgcccccctgcccgggtctcaaaagccccgccggcctccgagcgtccgaa ccgcccgctgccagcccggccaggcacccctgcagcatggcctggaacaccaacctccgc tggcggctgccgctcacctgcctgctcctgcaggtgattatggtgattctcttcggggtg ttcgtgcgctacgacttcgaggccgacgcccactggtggtcagagaggacgcacaagaac ttgagcgacatggagaacgaattctactatcgctacccaagcttccaggacgtgcacgtg atggtcttcgtgggcttcggcttcctcatgactttcctgcagcgctacggcttcagcgcc gtgggcttcaacttcctgttggcagccttcggcatccagtgggcgctgctcatgcagggc tggttccacttcttacaagaccgctacatcgtcgtgggcgtggagaacctcatcaacgct gacttctgcgtggcctctgtctgcgtggcctttggggcagttctgggtaaagtcagcccc attcagctgctcatcatgactttcttccaagtgaccctcttcgctgtgaatgagttcatt ctccttaacctgctaaaggtgaaggatgcaggaggctccatgaccatccacacatttggc gcctactttgggctcacagtgacccggatcctctaccgacgcaacctagagcagagcaag gagagacagaattctgtgtaccagtcggacctctttgccatgattggcaccctcttcctg tggatgtactggcccagcttcaactcagccatatcctaccatggggacagccagcaccga gccgccatcaacacctactgctccttggcagcctgcgtgcttacctcggtggcaatatcc agtgccctgcacaagaagggcaagctggacatggtgcacatccagaatgccacgctcgca ggaggggtggccgtgggtaccgctgctgagatgatgctcatgccttacggtgccctcatc atcggcttcgtctgcggcatcatctccaccctgggttttgtatacctgaccccattcctg gagtcccggctgcacatccaggacacatgtggcattaacaatctgcatggcattcctggc atcataggcggcatcgtgggtgctgtgacagcggcctccgccagccttgaagtctatgga aaagaagggcttgtccattcctttgactttcaaggtttcaacggggactggaccgcaaga acacagggaaagttccagatttatggtctcttggtgaccctggccatggccctgatgggt ggcatcattgtggggctcattttgagattaccattctggggacaaccttcagatgagaac tgctttgaggatgcggtctactgggagatgcctgaagggaacagcactgtctacatccct gaggaccccaccttcaagccctcaggaccctcagtaccctcagtacccatggtgtcccca ctacccatggcttcctcggtacccttggtaccctag >gi568815583r:89372738_89596544|GENSCAN_predicted_peptide_5|234_aa MGRARWLMPVIPALWEAEAGGSRGRINIVKIAILPKIIYRFNAIPIKLPMTFFTELEKTT LKFIWNQKRARIAKAILSQKNKAGGIMLPDFKLYYKATVTKTAGYWYWYQNRDIDQWNRT EPSEIIPHIYNHLIFDKPDKNKKWGKDSLFNKWCWENWLAIWRRLAIWRKLKLDPFLTPY TKINSRWIKDLNVRPKTIKTLEENLGNTIQDIGMGKDFMSKTPKAMATKAKIDK >gi568815583r:89372738_89596544|GENSCAN_predicted_CDS_5|705_bp atgggccgggcacggtggctcatgcctgtaatcccagcactttgggaggccgaggcgggc ggatcacgaggaagaatcaatatcgtgaaaattgctatactgcccaagataatttataga ttcaatgccatccccatcaagctaccaatgactttcttcacagaattggaaaaaactact ttaaagttcatatggaaccaaaaaagagcccgcattgccaaggcaatcctaagccaaaag aacaaagctggaggcatcatgctacctgacttcaaactgtattacaaggctacagtaacc aaaacagcagggtactggtactggtaccaaaacagagatatagaccaatggaacagaaca gagccctcagaaataataccgcacatctacaaccatctgatctttgacaaacctgacaaa aacaagaaatggggaaaggattccctatttaataaatggtgctgggaaaactggctagcc atatggagaaggctagccatatggagaaagctgaaactggatcccttccttacaccttat acaaaaattaattcaagatggattaaagacttaaatgttagacctaaaaccataaaaacc ctagaagaaaacctaggcaataccattcaggacataggcatgggcaaggacttcatgtct aaaacaccaaaagcaatggcaacaaaagctaaaattgacaaatag >gi568815583r:89372738_89596544|GENSCAN_predicted_peptide_6|65_aa MGFRHVGQAGLELLASATWMQLEAIILSRLMQEQKLKYRRLSLISEAKHWILMDIKMTIP DSGNY >gi568815583r:89372738_89596544|GENSCAN_predicted_CDS_6|198_bp atggggtttcgccatgttggccaggctggtctggaactcctggcctcagcaacatggatg cagctggaggccattatcttaagtcgattaatgcaggaacagaaactcaaataccgcagg ctctcgcttataagtgaagctaaacactggatacttatggacataaagatgacaatacca gacagtgggaactactaa >gi568815583r:89372738_89596544|GENSCAN_predicted_peptide_7|526_aa MLLLDTAGGAARHSRVRRAALRLLTYLSCRFGLARVHWAFKFFDSQGARSRPSRVSDFRE LGSRSWEDFEEELEARLEDRAHLPGPAPRATHTHGALMETLLDYQWDRPEITSPTKPILR SSGRRLLDVESEAKEAEAALGGLVNAVFLLAPCPHSQRELLQFVSGCEAQAQRLPPTPKQ VMEKLLPKRVREVMVARKITFYWVDTTEWSKLWESPDHLGYWTVCELLHHGGGTVLPSES FSWDFAQAGEMLLRSGIKLSSEPHLSPWISMLPTDATLNRLLYNSPEYEASFPRMEGMLF LPVEAGKEIQETWTVTLEPLAMHQRHFQKPVRIFLKGSVAQWSLPTSSTLGTDSWMLGSP EESTATQRLLFQQLVADVDPGEGRPPITGVISPLSASAMILTVCRTKEAEFQRHVLQTAV ADSPRDTASLFSDVVDSILNQTHDSLADTASAGGVPRTPVRQKMNTMCRSLKMLNVARLN VKAQKLHPDGSPDVAGEKGIQKIPSGRTVDKLEDRGRTLRSSKPKX >gi568815583r:89372738_89596544|GENSCAN_predicted_CDS_7|1578_bp atgctgctgctggacaccgcgggcggcgccgcccgccacagccgggtccggcgggccgcc ctgcgcctcctcacctatctgagttgccgattcggcctggccagggtccactgggccttc aagttctttgactcgcagggggcgcggagccggccgtcccgcgtgtctgacttccgcgag ctggggtcccgctcgtgggaggactttgaggaggagctggaggccaggctcgaggatcgc gcccacctgcccggcccggcgcccagggccacccacacgcacggcgccctgatggagacg ctgctagactaccagtgggaccggcccgagatcacgtcgcccacgaagccgatcctgcgg agcagcgggaggagactgctggacgtggagagcgaggccaaggaggccgaggccgcgctc gggggcttggtgaacgccgtcttcctcctggccccctgtccgcactcgcagagggagctg ctgcagttcgtgtctgggtgcgaggcccaggcccagcgcctgccgcccacccctaagcag gtgatggagaagttgttgcccaagagagtccgggaagtcatggtcgcccgaaaaatcacc ttctactgggtggataccaccgaatggtctaagttgtgggaatccccagaccaccttgga tactggactgtttgtgaactgctccaccacggaggtggcactgtcttgccatctgaatct ttcagctgggattttgctcaagctggggaaatgctgctcaggagtggaataaagctgtca agtgaacctcatctttctccgtggatttcaatgctgccaactgatgccactttaaaccgt ttgctctacaattctcctgagtatgaggcctcgtttccacgaatggaaggaatgttattt ctccctgttgaagcaggcaaagagattcaagaaacatggacagtcaccctagagcccttg gccatgcatcagagacattttcagaaaccagtcagaatttttctaaaaggctcagtggcc cagtggtctctcccaacgagcagcactttgggcactgacagctggatgctaggaagtcca gaggagagcacagcaactcaaaggctgttatttcagcagttggttgctgatgtggaccct ggtgaaggccggccccccatcactggagttatttccccactctctgccagtgctatgatc ctcactgtgtgccgcaccaaggaggctgaatttcaacgacatgttctccaaacagctgtg gctgacagcccccgggacacagcttcccttttctcagatgttgtggatagtatattgaat cagactcatgattcgcttgcagatactgcttctgctggaggggtccctcgtactccagtg agacagaagatgaataccatgtgccgttccttaaagatgttgaatgtcgcaaggctgaat gtgaaggcccagaagttacatccagatggcagtccggatgtggctggggagaaaggaatc caaaagatacctagtgggagaacagtggataaattggaagacagaggaagaacactaaga agttctaaacctaaagnn