GENSCAN 1.0 Date run: 5-Nov-116 Time: 11:27:04 Sequence gi568815596r:20518990_20745508 : 226519 bp : 48.32% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 827 982 156 0 0 64 93 49 0.425 2.98 1.02 Term + 6615 6775 161 1 2 114 49 67 0.617 3.50 1.03 PlyA + 7535 7540 6 -0.45 2.13 PlyA - 10427 10422 6 1.05 2.12 Term - 10972 10478 495 1 0 33 42 258 0.078 10.27 2.11 Intr - 19722 19617 106 2 1 70 57 73 0.099 2.62 2.10 Intr - 20574 20420 155 0 2 88 61 36 0.104 -0.23 2.09 Intr - 30422 30344 79 1 1 68 86 54 0.096 2.75 2.08 Intr - 34956 34891 66 2 0 79 116 21 0.135 2.02 2.07 Intr - 40144 40043 102 0 0 45 76 71 0.338 0.89 2.06 Intr - 41404 41350 55 2 1 94 78 31 0.445 0.74 2.05 Intr - 44847 44686 162 1 0 33 93 137 0.366 8.65 2.04 Intr - 49809 49734 76 0 1 69 113 59 0.541 5.59 2.03 Intr - 56203 56132 72 1 0 24 84 74 0.267 0.20 2.02 Intr - 58259 58029 231 2 0 73 99 27 0.068 0.27 2.01 Init - 71560 71483 78 1 0 41 60 101 0.061 3.67 2.00 Prom - 77655 77616 40 -4.56 3.03 PlyA - 78746 78741 6 1.05 3.02 Term - 87704 87501 204 2 0 46 47 223 0.969 11.37 3.01 Init - 92780 92700 81 2 0 75 84 8 0.403 0.07 3.00 Prom - 93169 93130 40 -1.56 4.11 PlyA - 93331 93326 6 1.05 4.10 Term - 100256 99998 259 1 1 80 46 315 0.889 21.52 4.09 Intr - 105042 104907 136 1 1 116 81 80 0.961 9.73 4.08 Intr - 105903 105743 161 2 2 76 103 286 0.999 28.53 4.07 Intr - 119663 119447 217 0 1 117 12 216 0.096 14.46 4.06 Intr - 122191 121984 208 1 1 99 48 287 0.972 24.45 4.05 Intr - 122711 122605 107 0 2 49 73 62 0.542 0.73 4.04 Intr - 123342 123179 164 2 2 94 70 55 0.461 3.92 4.03 Intr - 123764 123715 50 2 2 78 31 51 0.267 -4.12 4.02 Intr - 126516 126351 166 2 1 130 85 290 0.999 32.86 4.01 Init - 128600 128497 104 2 2 82 97 82 0.859 6.53 4.00 Prom - 138239 138200 40 -4.06 5.00 Prom + 139116 139155 40 -5.36 5.01 Init + 148251 148759 509 2 2 67 59 505 0.638 37.93 5.02 Intr + 148897 149039 143 1 2 97 65 7 0.547 -0.70 5.03 Intr + 151475 152432 958 0 1 112 74 1140 0.406 105.14 5.04 Intr + 152721 152823 103 0 1 54 30 87 0.233 -0.32 5.05 Term + 161753 161836 84 1 0 70 49 87 0.489 0.65 5.06 PlyA + 162335 162340 6 1.05 6.04 PlyA - 162721 162716 6 1.05 6.03 Term - 168105 167914 192 0 0 92 32 187 0.777 10.92 6.02 Intr - 183214 183197 18 1 0 126 61 35 0.036 1.41 6.01 Init - 186508 186443 66 1 0 73 111 0 0.322 1.87 6.00 Prom - 191594 191555 40 -2.56 7.00 Prom + 194847 194886 40 -5.46 7.01 Init + 194940 195961 1022 2 2 49 72 378 0.118 25.48 7.02 Intr + 196275 196372 98 0 2 44 40 125 0.058 2.95 7.03 Term + 214082 214095 14 0 2 131 48 12 0.137 -0.14 7.04 PlyA + 215121 215126 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 10987 10478 510 1 0 82 42 238 0.887 14.76 S.002 Term - 119663 119428 236 0 2 117 54 202 0.894 16.18 S.003 Term + 142805 142920 116 0 2 92 52 70 0.822 2.53 S.004 Init - 177603 177528 76 0 1 81 89 54 0.891 5.99 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:20518990_20745508|GENSCAN_predicted_peptide_1|105_aa XTVLEKPGPDPSPGISSSLHRDSECIRPLSHQRWKPKPMSSLPQGSEQAQQHWEAWEPGT GKPLQGTARHCVAQQIWTQGLRGQPDTPLSMRAKEASESSVRGHS >gi568815596r:20518990_20745508|GENSCAN_predicted_CDS_1|318_bp ngcacagtcctggagaagccaggaccagacccctccccagggatcagcagcagcctccac cgggactcggaatgtattaggccattatcccaccaaagatggaagcccaagcccatgtcc tctcttccccagggttcagaacaagcccagcaacactgggaagcatgggagccagggaca ggaaagcctctgcagggcacagccaggcactgtgtcgcccagcagatctggacacaaggg ctgcggggccaaccagacactccactgagcatgcgggccaaggaggcctctgagagctca gtgagaggacacagctag >gi568815596r:20518990_20745508|GENSCAN_predicted_peptide_2|558_aa MRLTEQQLEPEIAKGSEAVGLLRDEGLPGPPRYGNINSIVHEYQCVMPPVLNLPGTPSPS LALGHPTSPSRPSDKPLFTLKTQLKRQFPCKFSHKFPCKYSSQDKTPEPIVPKVPEKSKQ MPNNNMGVGTLRDKHYVLWFSILSPTQGLAYHALRGCRYRWSMNHTLKNKAVATNESPLD STARVEGWASVKKPPVQEWQKNGLSTGSWAGVEEEAPLQIQYGESSYIDKEVGLEALQGP LRSMQIQQTFLPAFTLLTMNLMASTRILKTTAHQFASLHKRCDLNPGADGAEVSMNDSSR SGGLVGKPCPSICKIFDLTNIFCALPENDITKCALCPVTWSSWKHRMSLSLDTAPGLGRG SLQPPSGKLPPWGQSLLPPVLPGVKDVLLLKGEQDLFRALWRSLSREVKEHVGTDQFGNK YYYILQNKNWRGQTIQEKRIVEAANKKEVDYETGDIPTEWEAWIRRTRKTPPTMEEILKN EKHREEIKIKSQDFYEKEKLLSKETSEELLPPPVQTQIKGRASAPYFGKEKPSVAPSSTG KTFQPGSWMPRDGKSHNE >gi568815596r:20518990_20745508|GENSCAN_predicted_CDS_2|1677_bp atgaggttgacagagcaacagctggaaccagagatcgccaaggggagcgaggctgtgggg ctgctccgggatgaggggctcccaggccctccccgatacggaaatattaactcaattgta catgaatatcagtgtgtcatgcccccagtccttaatctccctggaaccccaagtccttca cttgccctgggccaccccacatctccatctaggcccagtgacaagcccttgtttactctg aaaacccaactcaaacggcaattcccttgtaaattctcccataaatttccttgtaaatac tcttcccaggacaaaactccagagcccattgttccaaaagttccagagaaatccaaacaa atgcccaacaacaatatgggggtgggaactcttcgagataagcactatgtcttatggttc tccatcctgtccccaacacagggcctggcctatcacgctctcaggggatgtcgataccgc tggtccatgaaccacactttgaaaaacaaagctgtagctaccaatgagagccctctggat tctactgccagggtggaggggtgggcgagtgtgaagaagccccctgtgcaggaatggcaa aagaatggacttagtactggatcctgggctggggtggaggaggaggcgccactgcagatc cagtatggtgagagttcctacattgacaaagaagttggactagaggcactccagggccca ctgagatccatgcagatacagcagactttccttcctgcattcacgctcctcacgatgaat ctcatggcttccaccagaatcctcaaaaccacagcacatcagtttgctagcttgcacaaa aggtgtgatctcaaccctggggctgatggcgcagaagtgagcatgaacgacagctcacgt tctggtggccttgtgggaaagccctgcccctccatctgtaagatctttgatctcacaaac atattctgtgcccttccagagaatgacatcaccaagtgtgccctctgtcctgtcacttgg agctcctggaagcacagaatgtccctcagcctggacactgcccctggcctggggaggggg tccctgcagccaccatcaggaaagctcccaccctggggacagtctctcctgcctcctgtt cttcctggggtcaaagacgtgctcctgctcaagggtgagcaggatttgttccgcgccttg tggagatcgctgtcaagggaagtgaaggagcacgtgggcacggaccaattcgggaacaaa tactactacatcctgcagaacaagaactggagaggacaaactattcaagagaaaagaatt gtagaagcagcaaataaaaaagaagtagactatgaaacaggggatattccaacagaatgg gaagcttggattagaagaacaagaaagactccacctactatggaggaaatactaaagaat gaaaaacacagagaagaaatcaaaataaaaagccaagatttttatgaaaaagaaaaactc cttagtaaagagaccagtgaggaactcctgcctccaccagttcaaactcaaattaaaggc cgtgcctctgctccatacttcggaaaggaaaaaccctcagtggctcccagcagcactggt aaaacctttcagccaggatcctggatgccacgagatggcaagagccacaatgaatga >gi568815596r:20518990_20745508|GENSCAN_predicted_peptide_3|94_aa MGMGTYKTTGDPERGPGIEGPCVQGALRGCSVHRCSLSFAMPPKDKKKKKDAGKLARKDK DPVSKSGDKAKKKKCSKDIVRDKPNNLVLFDKTT >gi568815596r:20518990_20745508|GENSCAN_predicted_CDS_3|285_bp atgggaatgggcacatacaaaaccactggagacccagaaaggggccctgggattgaggga ccctgtgtccagggggccctgcgaggctgtagcgttcaccgctgctctctgagctttgca atgccacccaaggacaagaagaagaagaaagacgctggaaagttggccaggaaagacaaa gacccagtgagcaaatccggggacaaggccaaaaagaagaagtgctccaaagacatagtt cgggacaagcccaataacttagttttgtttgacaaaactacctaa >gi568815596r:20518990_20745508|GENSCAN_predicted_peptide_4|523_aa MAGHPRPEHAFLTSSQAMLILIQGPHFENPTSRARRLQNAHTGLDLTVPQHQEVRGKMMS GHVEYQILVVTRLAAFKSAKHRPEDVVQFLMPTQEALPEPLMDVGPGLGILGGVAGAKDV CSRHTLHQGISPSPRASDARSPCTPALQWHTVGTVRAWEHKKRSCLTQGREQALEISVST ACIGWPAEGHTGSRENEVSKKYSEIEEFYQKLSSRYAAASLPPLPRKVLFVGESDIRERR AVFNEILRCVSKDAELAGSPELLEFLGTRSPGAAGLTSRDSSVLDGTDSQTGNDEEAFDF FEEQDQVAEEGPPVQSLKGEDAEESLEEEEALDPLGIMRSKKPKKHPKVAVKAKPSPRLT IFDEEVDPDEGLFGPGRKLSPQDPSEDVSSVDPLKLFDDPDLGGAIPLGDSLLLPAACES GGPTPSLSHRDASKELFRVEEDLDQILNLGAEPKPKPQLKPKPPVAAKPVIPRKPAVPPK AGPAEAVAGQQKPQEQIQAMDEMDILQYIQDHDTPAQAAPSLF >gi568815596r:20518990_20745508|GENSCAN_predicted_CDS_4|1572_bp atggctggccaccctaggcccgaacatgcctttcttacaagttcccaggcgatgctgatc ctgatccagggaccacactttgagaaccccacatctagggcaaggcgacttcagaatgcc cacactggcctcgacctgactgtgccccagcaccaggaggtacggggcaagatgatgtct ggacacgtggagtaccagatcctggtggtgacccgtctggctgcgttcaagtcggccaag cacaggcccgaggatgtcgtccagttcttgatgcccacccaagaagccctgcctgaaccc ctgatggacgtgggccctgggctgggcattctgggtggtgttgctggagccaaggatgtg tgttccagacacaccctgcaccaagggatctccccgagcccgagggcgtcagatgcccgc agcccctgcacgccagcactccagtggcacactgtgggcacagtccgggcctgggagcat aagaagaggtcctgcctgacgcagggcagagaacaagccttggaaatatctgtctccact gcttgcattggctggcccgccgaaggccacactggttcacgggagaatgaggtctccaaa aagtacagcgagattgaggagttttaccagaaactgagcagtcgttatgcagcagccagc ctccccccactacccaggaaggtcctgtttgttggggagtctgacatccgggagaggaga gccgtgttcaatgagatcctgcgctgtgtctccaaggatgccgagttggcaggcagccca gagctgctagagttcttaggtaccagatccccaggggctgcagggctcaccagcagagat tcctctgtcctggatggcacagacagtcagacagggaatgatgaagaggctttcgacttt tttgaggagcaagaccaagtggcagaagagggtccgcccgtccagagcctgaagggcgag gatgctgaggaatccttggaggaggaggaggcgctggaccctctgggcattatgcgctcc aagaagcccaagaaacatcccaaagtggccgtgaaagccaagccctcgccccggctcacc atctttgacgaggaggtggaccctgatgaggggctctttggcccgggcaggaagctgtct ccacaggacccctcggaggacgtgtcatccgtggaccccctgaagctatttgatgatcct gacctcggcggggccatccccctgggtgactccctcctgctgccagccgcctgtgagagt ggagggcccacacccagcctcagccacagggacgcctccaaggaactgttcagagttgaa gaggacttggaccagattctgaacctgggagctgagcccaaacccaagccccagcttaag cccaagccaccagtggcagctaagccggtgatacccagaaaaccagctgttccccccaaa gcgggcccggctgaagctgtggctgggcagcagaagccgcaggagcagatccaagccatg gacgagatggacatcttgcagtacatccaggaccacgatacaccagcccaggccgccccc agcctcttctga >gi568815596r:20518990_20745508|GENSCAN_predicted_peptide_5|598_aa MDLSAAAALCLWLLSACRPRDGLEAAAVLRAAGAGPVRSPGGGGGGGGGGRTLAQAAGAA AVPAAAVPRARAARRAAGSGFRNGSVVPHHFMMSLYRSLAGRAPAGAAAVSASGHGRADT ITGFTDQATQGTYASSVPAHPVRSWAETSPGAVPQLRYIWSRGPMRPTLRPAGPLEEAGG GQDSLLGEWSGVASEPSEKKGGQELCSRESRREGLHGDESAAETGQSFLFDVSSLNDADE VVGAELRVLRRGSPESGPGSWTSPPLLLLSTCPGAARAPRLLYSRAAEPLVGQRWEAFDV ADAMRRHRREPRPPRAFCLLLRAVAGPVPSPLALRRLGFGWPGGGGSAAEERAVLVVSSR TQRKESLFREIRAQARALGAALASEPLPDPGTGTASPRAVIGGRRRRRTALAGTRTAQGS GGGAGRGHGRRGRSRCSRKPLHVDFKELGWDDWIIAPLDYEAYHCEGLCDFPLRSHLEPT NHAIIQTLLNSMAPDAAPASCCVPARLSPISILYIDAANNVVYKQYEDMVVEACGCRERK VFVLMLQLEARKSGSNNGLGNSKFQNGRTKGAKAKMPPTAVPLRVVPNVVMDACALVS >gi568815596r:20518990_20745508|GENSCAN_predicted_CDS_5|1797_bp atggacctgagcgccgccgccgcgctgtgcctttggctgctgagcgcctgccgcccccgc gacgggctggaagcggccgccgtgctgcgagcggcgggggctgggccggtccggagccca gggggcggcggcggcggcggcggcggcgggcggactcttgcccaggctgcgggcgccgcg gctgtcccggccgccgcggttccccgggcccgcgccgcgcgccgcgccgcgggctccggc ttcaggaacggctcggtggtgccgcaccacttcatgatgtcgctttaccggagcctggcc gggagggctccggccggggcagccgctgtctccgcctcgggccatggtcgcgcggacacg atcaccggcttcacagaccaggcgacccaaggtacttacgcctcttctgtgcccgcccat cccgtcaggtcctgggctgagaccagccccggagccgtgccgcagctccgttacatttgg agccgcggccccatgcggcccaccctcaggccggctggtcccctcgaggaggcaggcgga ggccaagattcgcttcttggggaatggtctggagtggcctcggagccctcagagaagaaa ggagggcaagagctgtgttcccgggagtcacggcgagagggactgcacggtgacgaatcg gcagccgaaacaggccagagcttcctgttcgacgtgtccagccttaacgacgcagacgag gtggtgggtgccgagctgcgcgtgctgcgccggggatctccagagtcgggcccaggcagc tggacttctccgccgttgctgctgctgtccacgtgcccgggcgccgcccgagcgccacgc ctgctgtactcgcgggcagctgagcccctagtcggtcagcgctgggaggcgttcgacgtg gcggacgccatgaggcgccaccgtcgtgaaccgcgccccccccgcgcgttctgcctcttg ctgcgcgcagtggcaggcccggtgccgagcccgttggcactgcggcggctgggcttcggc tggccgggcggagggggctctgcggcagaggagcgcgcggtgctagtcgtctcctcccgc acgcagaggaaagagagcttattccgggagatccgcgcccaggcccgcgcgctcggggcc gctctggcctcagagccgctgcccgacccaggaaccggcaccgcgtcgccaagggcagtc attggcggccgcagacggaggaggacggcgttggccgggacgcggacagcgcagggcagc ggcgggggcgcgggccggggccacgggcgcaggggccggagccgctgcagccgcaagccg ttgcacgtggacttcaaggagctcggctgggacgactggatcatcgcgccgctggactac gaggcgtaccactgcgagggcctttgcgacttccctttgcgttcgcacctcgagcccacc aaccatgccatcattcagacgctgctcaactccatggcaccagacgcggcgccggcctcc tgctgtgtgccagcgcgcctcagccccatcagcatcctctacatcgacgccgccaacaac gttgtctacaagcaatacgaggacatggtggtggaggcctgcggctgcagggagaggaag gtgtttgtgctgatgttgcagttagaggcacgaaaatcaggtagcaacaacgggttagga aattcgaagttccagaatgggagaaccaaaggggccaaggcaaagatgcctccaactgct gttccattgagggtggtgcccaacgtggtcatggatgcatgtgctcttgtgtcctag >gi568815596r:20518990_20745508|GENSCAN_predicted_peptide_6|91_aa MEKTGEEITKEIIKENFPERKLVFDILKLTFYYGTIDPWCPKEYYEDIKKDFPEGDIRLC EKNIPHAFITHFNQEMADMIADSLKDDLSKM >gi568815596r:20518990_20745508|GENSCAN_predicted_CDS_6|276_bp atggagaaaacaggggaagaaattaccaaagaaataataaaagaaaattttccagaaaga aagttggtatttgacatcttgaaacttacattttattatggtactatagatccttggtgt ccaaaagagtactatgaagacattaagaaggattttccagaaggagacattcgactctgt gagaaaaacatacctcatgctttcatcacccattttaaccaggaaatggcagacatgatt gctgactccctaaaggatgacttgtccaaaatgtaa >gi568815596r:20518990_20745508|GENSCAN_predicted_peptide_7|377_aa MGDFNTPLSILDKSMSQKVNKHIQDLNSALHQADLIDIYRTLHPKSTEYTFFSAPHGTYS KIDPIVGSKALLSKCKRMEITTNCLSDHSAIKLELRIKKLGQNHTTTWKLNNLLLNDYWV NNEMKAEIKMFFETNENKDTTYQDLWDTFKAVCRGKFVALNTHTGKQERSKINTLMSQLK ELEKQEQANSTASRRQEITKIRAELKEVETQKTLQKINESRSWFFEKISKIDRLLARLIK KKREKNQIDAIQNDKGDITTNLTEIQTTIREYYKHFYANKLENLEEMDKFLDTYTLPRLS QEEVESLNRPITGSEIEAIINGLPTKKSPGPDRFIAEFYQRTNDKNHMIISIDAEKAFDK IQQPFMLKTLNKLGSSG >gi568815596r:20518990_20745508|GENSCAN_predicted_CDS_7|1134_bp atgggagactttaacaccccactgtcaatattagacaaatcaatgagccagaaggttaac aagcatatccaggacttgaactcagctctgcaccaagcagacctaatagacatctacaga acactccaccccaaatcaacagaatataccttcttctcagcaccacatggcacttattcc aaaattgaccccatagttggaagtaaagcactcctcagcaaatgtaaaagaatggaaatc acaacaaactgtctctcagaccacagtgcaatcaaattagaactcaggattaagaaactt ggtcaaaatcacacaactacatggaaactgaacaacctgctcctgaatgactactgggta aataatgaaatgaaggcagaaataaagatgttctttgaaaccaatgagaacaaagacaca acgtaccaggatctctgggacacatttaaagcagtgtgtagagggaaatttgtagcacta aatacccacacaggaaagcaggaaagatctaaaatcaacaccctaatgtcacaattaaaa gaactagagaagcaagagcaagcaaattcgacagctagcagaaggcaagaaataactaag atcagagcagaactgaaagaggtagagacacaaaaaacccttcaaaaaatcaatgaatcc aggagctggttttttgaaaagatcagcaaaattgatagactgctagcaagactaataaag aagaaaagagagaagaatcaaatagatgcaatacaaaatgataaaggggatatcaccacc aatctcacagaaatacaaactaccatcagagaatactataaacacttctacgcaaataaa ctagaaaatctagaagaaatggataaattcctggacacatacaccctcccaagactaagc caggaagaagtggaatctctgaatagaccaataacaggatctgaaattgaggcaataatc aatggcctaccaaccaaaaaaagtccaggaccagacagattcatagccgaattctaccag agaaccaacgacaaaaaccacatgattatctcaatagatgcggaaaaggcctttgacaaa attcaacagcccttcatgctaaaaactctcaataaactaggttcttcaggttga